StaminaTang

StaminaTang

Geek Repo

Github PK Tool:Github PK Tool

StaminaTang's repositories

Awesome-3D-Detectors

Paperlist of awesome 3D detection methods

Stargazers:0Issues:0Issues:0

awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

License:Apache-2.0Stargazers:0Issues:0Issues:0

Byzantine-Federeated-RL

code for NeurIPS2021 paper on Federated Reinforcement Learning with Byzantine Resilience

Stargazers:0Issues:0Issues:0

cc-afbc

Advantage-Filtered Behavioral Cloning for Offline Continuous Control

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

combined-experience-replay

A Deeper Look at Experience Replay (Zhang and Sutton, 2017)

License:MITStargazers:0Issues:0Issues:0

CORRO

CORRO code

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ddpo

Code for the paper "Training Diffusion Models with Reinforcement Learning"

License:MITStargazers:0Issues:0Issues:0

ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

License:MITStargazers:0Issues:0Issues:0

deep_control

Deep Reinforcement Learning for Continuous Control in PyTorch

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

deeprm

Resource Management with Deep Reinforcement Learning (HotNets '16)

License:MITStargazers:0Issues:0Issues:0

EfficientZero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Stargazers:0Issues:1Issues:0

eop

Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022

License:MITStargazers:0Issues:0Issues:0

Genet

The repository of Genet project.

Stargazers:0Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

gpt_academic

为ChatGPT/GLM提供实用化交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm2等本地模型。兼容文心一言, moss, llama2, rwkv, claude2, 通义千问, 书生, 讯飞星火等。

License:GPL-3.0Stargazers:0Issues:0Issues:0

Hands-on-RL

https://hrl.boyuai.com/

License:Apache-2.0Stargazers:0Issues:0Issues:0

HuRL

Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper

License:MITStargazers:0Issues:0Issues:0

HyQ

Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.

Stargazers:0Issues:0Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

interreplay

Repository that implements a variety of interpolated experience replay algorithms for continuous control tasks.

Stargazers:0Issues:0Issues:0

Loss-Gated-Learning

ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

machine-learning-and-simulation

All the handwritten notes 📝 and source code files 🖥️ used in my YouTube Videos on Machine Learning & Simulation (https://www.youtube.com/channel/UCh0P7KwJhuQ4vrzc3IRuw4Q)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

License:MITStargazers:0Issues:0Issues:0

open-interpreter

OpenAI's Code Interpreter in your terminal, running locally

License:MITStargazers:0Issues:0Issues:0

Reinforcement_Learning_With_Non-Cumulative_Objective

This repository contains code for our TMLCN paper "Reinforcement Learning With Non-Cumulative Objective".

Stargazers:0Issues:0Issues:0

rl-atari-tennis

Play atari Tennis game by dqn

Stargazers:0Issues:0Issues:0

TorchPQ

Efficient implementations of Product Quantization and its variants using Pytorch and CUDA

License:MITStargazers:0Issues:0Issues:0

transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

License:MITStargazers:0Issues:0Issues:0