StaminaTang

0

followers

following

stars

StaminaTang's repositories

Awesome-3D-Detectors

Paperlist of awesome 3D detection methods

010

awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

Apache-2.0000

Byzantine-Federeated-RL

code for NeurIPS2021 paper on Federated Reinforcement Learning with Byzantine Resilience

Language:Python010

cc-afbc

Advantage-Filtered Behavioral Cloning for Offline Continuous Control

Language:PythonMIT010

combined-experience-replay

A Deeper Look at Experience Replay (Zhang and Sutton, 2017)

Language:PythonMIT010

CORRO

CORRO code

Language:Python010

CSRL

Language:Jupyter Notebook010

ddpo

Code for the paper "Training Diffusion Models with Reinforcement Learning"

Language:PythonMIT000

ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

MIT000

deep_control

Deep Reinforcement Learning for Continuous Control in PyTorch

000

Deep_Learning-Notebook

Paper list

000

deeprm

Resource Management with Deep Reinforcement Learning (HotNets '16)

MIT000

EfficientZero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

010

eop

Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022

Language:Jupyter NotebookMIT010

Genet

The repository of Genet project.

Language:Python010

google-research

Google Research

Language:Jupyter NotebookApache-2.0010

gpt_academic

为ChatGPT/GLM提供实用化交互界面，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm2等本地模型。兼容文心一言, moss, llama2, rwkv, claude2, 通义千问, 书生, 讯飞星火等。

Language:PythonGPL-3.0000

Hands-on-RL

https://hrl.boyuai.com/

Language:Jupyter NotebookApache-2.0010

HuRL

Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper

Language:PythonMIT010

HyQ

Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.

000

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonMIT010

interreplay

Repository that implements a variety of interpolated experience replay algorithms for continuous control tasks.

010

Loss-Gated-Learning

ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'

Language:PythonMIT010

machine-learning-and-simulation

All the handwritten notes 📝 and source code files 🖥️ used in my YouTube Videos on Machine Learning & Simulation (https://www.youtube.com/channel/UCh0P7KwJhuQ4vrzc3IRuw4Q)

Language:PythonMIT010

OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

Language:PythonMIT000

open-interpreter

OpenAI's Code Interpreter in your terminal, running locally

Language:PythonMIT000

Reinforcement_Learning_With_Non-Cumulative_Objective

This repository contains code for our TMLCN paper "Reinforcement Learning With Non-Cumulative Objective".

000

rl-atari-tennis

Play atari Tennis game by dqn

Language:Python010

TorchPQ

Efficient implementations of Product Quantization and its variants using Pytorch and CUDA

Language:CudaMIT010

transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Language:PythonMIT010