StaminaTang's repositories
Awesome-3D-Detectors
Paperlist of awesome 3D detection methods
awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)
Byzantine-Federeated-RL
code for NeurIPS2021 paper on Federated Reinforcement Learning with Byzantine Resilience
combined-experience-replay
A Deeper Look at Experience Replay (Zhang and Sutton, 2017)
CORRO
CORRO code
ddpo
Code for the paper "Training Diffusion Models with Reinforcement Learning"
ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
Deep_Learning-Notebook
Paper list
deeprm
Resource Management with Deep Reinforcement Learning (HotNets '16)
EfficientZero
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
eop
Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022
Genet
The repository of Genet project.
google-research
Google Research
gpt_academic
为ChatGPT/GLM提供实用化交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm2等本地模型。兼容文心一言, moss, llama2, rwkv, claude2, 通义千问, 书生, 讯飞星火等。
Hands-on-RL
https://hrl.boyuai.com/
HuRL
Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper
HyQ
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
insightface
State-of-the-art 2D and 3D Face Analysis Project
interreplay
Repository that implements a variety of interpolated experience replay algorithms for continuous control tasks.
Loss-Gated-Learning
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
machine-learning-and-simulation
All the handwritten notes 📝 and source code files 🖥️ used in my YouTube Videos on Machine Learning & Simulation (https://www.youtube.com/channel/UCh0P7KwJhuQ4vrzc3IRuw4Q)
OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
Reinforcement_Learning_With_Non-Cumulative_Objective
This repository contains code for our TMLCN paper "Reinforcement Learning With Non-Cumulative Objective".
rl-atari-tennis
Play atari Tennis game by dqn
TorchPQ
Efficient implementations of Product Quantization and its variants using Pytorch and CUDA
transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习