LinusWangg's starred repositories

Cal-QL

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Language:PythonStargazers:65Issues:0Issues:0
Language:MATLABLicense:GPL-3.0Stargazers:10326Issues:0Issues:0

air-conditioner

❄️ Yun Portable Air Conditoner. 云空调,便携小空调,为你的夏日带去清凉!

Language:TypeScriptLicense:MITStargazers:3425Issues:0Issues:0

rust-snake-ai-ratatui

Neural network learns to play snake in a terminal, built in Rust with Ratatui

Language:RustLicense:MITStargazers:348Issues:0Issues:0

CodeFuse-Query

Query-Based Code Analysis Engine

Language:JavaLicense:Apache-2.0Stargazers:180Issues:0Issues:0

NCISurvey

Neural Code Intelligence Survey 2024; Reading lists and resources

License:MITStargazers:193Issues:0Issues:0

SeqIns

The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LAVIS

Language:Jupyter NotebookStargazers:29Issues:0Issues:0

ENVISIONS

A Neural-Symbolic Self-Training Framework

Language:CStargazers:92Issues:0Issues:0

self-translate

Do Multilingual Language Models Think Better in English?

Language:Jupyter NotebookLicense:MITStargazers:39Issues:0Issues:0

DPC

Official Implementation of AAAI'24 paper "Dirichlet-Based Prediction Calibration for Learning with Noisy Labels"

Language:PythonStargazers:5Issues:0Issues:0

RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Language:Jupyter NotebookLicense:MITStargazers:156Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0
Language:PythonStargazers:7Issues:0Issues:0

ddim

Denoising Diffusion Implicit Models

Language:PythonLicense:MITStargazers:1349Issues:0Issues:0

cordcloud-action

❤ Auto check in Cord Cloud site by GitHub Action | GitHub Action 实现 Cord Cloud 帐号自动续命

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

TALAR

[NeurIPS'23] Official code for "Natural Language-conditioned Reinforcement Learning with Task-related Language Development and Translation", NeurIPS 2023.

Stargazers:2Issues:0Issues:0

peg

Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.

Language:PythonLicense:MITStargazers:71Issues:0Issues:0

rust-playground

The Rust Playground

Language:RustLicense:Apache-2.0Stargazers:1208Issues:0Issues:0

RepL4RL

Representation Learning for RL

Stargazers:111Issues:0Issues:0

ai-town

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

Language:TypeScriptLicense:MITStargazers:7255Issues:0Issues:0

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6868Issues:0Issues:0

GPT-Critic

GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems

Language:PythonStargazers:10Issues:0Issues:0

DUC

[ICLR 2023 Spotlight] Code release for "Dirichlet-based Uncertainty Calibration for Active Domain Adaptation"

Language:PythonStargazers:25Issues:0Issues:0

message-pusher

搭建专属于你的消息推送服务,支持多种消息推送方式,支持 Markdown,基于 Golang 仅单可执行文件,开箱即用

Language:JavaScriptLicense:MITStargazers:2498Issues:0Issues:0

align_sd

Better Aligning Text-to-Image Models with Human Preference. ICCV 2023

Language:PythonLicense:Apache-2.0Stargazers:258Issues:0Issues:0

Imitating-Human-Behaviour-w-Diffusion

Code for ICLR 2023 paper "Imitating Human Behaviour with Diffusion Models"

Language:PythonLicense:MITStargazers:119Issues:0Issues:0
Language:PythonStargazers:46Issues:0Issues:0
Language:PythonStargazers:262Issues:0Issues:0

awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

License:Apache-2.0Stargazers:695Issues:0Issues:0

Software-Engineer

2018 NUAA Software-Engineer------SchoolRun for Android

Language:JavaStargazers:1Issues:0Issues:0