LinusWangg

followers

following

stars

LinusWangg's starred repositories

Cal-QL

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Language:Python6500

ShiArthur03

Language:MATLABGPL-3.01032600

air-conditioner

❄️ Yun Portable Air Conditoner. 云空调，便携小空调，为你的夏日带去清凉！

Language:TypeScriptMIT342500

rust-snake-ai-ratatui

Neural network learns to play snake in a terminal, built in Rust with Ratatui

Language:RustMIT34800

CodeFuse-Query

Query-Based Code Analysis Engine

Language:JavaApache-2.018000

NCISurvey

Neural Code Intelligence Survey 2024; Reading lists and resources

MIT19300

SeqIns

The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LAVIS

Language:Jupyter Notebook2900

ENVISIONS

A Neural-Symbolic Self-Training Framework

Language:C9200

self-translate

Do Multilingual Language Models Think Better in English?

Language:Jupyter NotebookMIT3900

DPC

Official Implementation of AAAI'24 paper "Dirichlet-Based Prediction Calibration for Learning with Noisy Labels"

Language:Python500

RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Language:Jupyter NotebookMIT15600

RL-state_mask

Language:Python800

HESS

Language:Python700

ddim

Denoising Diffusion Implicit Models

Language:PythonMIT134900

cordcloud-action

❤ Auto check in Cord Cloud site by GitHub Action | GitHub Action 实现 Cord Cloud 帐号自动续命

Language:PythonMIT3800

TALAR

[NeurIPS'23] Official code for "Natural Language-conditioned Reinforcement Learning with Task-related Language Development and Translation", NeurIPS 2023.

200

peg

Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.

Language:PythonMIT7100

rust-playground

The Rust Playground

Language:RustApache-2.0120800

RepL4RL

Representation Learning for RL

ai-town

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

Language:TypeScriptMIT725500

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.0686800

GPT-Critic

GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems

Language:Python1000

DUC

[ICLR 2023 Spotlight] Code release for "Dirichlet-based Uncertainty Calibration for Active Domain Adaptation"

Language:Python2500

message-pusher

搭建专属于你的消息推送服务，支持多种消息推送方式，支持 Markdown，基于 Golang 仅单可执行文件，开箱即用

Language:JavaScriptMIT249800

align_sd

Better Aligning Text-to-Image Models with Human Preference. ICCV 2023

Language:PythonApache-2.025800

Imitating-Human-Behaviour-w-Diffusion

Code for ICLR 2023 paper "Imitating Human Behaviour with Diffusion Models"

Language:PythonMIT11900

SiMPL

Language:Python4600

decision-diffuser

Language:Python26200

awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

Apache-2.069500

Software-Engineer

2018 NUAA Software-Engineer------SchoolRun for Android

Language:Java100