guoshicheng's repositories
transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
RLElement
强化学习各种组件的介绍
OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
rl-tutorials
basic algorithms of reinforcement learning
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
my_notebook
the notebook about RL, DL, ML, pytorch, python
tianshou
An elegant PyTorch deep reinforcement learning library.
FinRL
FinRL: The first open-source project for financial reinforcement learning. Please star. 🔥
easy-rl
强化学习中文教程(蘑菇书),在线阅读地址:https://datawhalechina.github.io/easy-rl/
awesome-DeepLearning
深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI
gym
A toolkit for developing and comparing reinforcement learning algorithms.
genshin_auto_fish
基于深度强化学习的原神自动钓鱼AI
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
pytorch-optimizer
torch-optimizer -- collection of optimizers for Pytorch
awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
Tinyhttpd
Tinyhttpd 是J. David Blackstone在1999年写的一个不到 500 行的超轻量型 Http Server,用来学习非常不错,可以帮助我们真正理解服务器程序的本质。官网:http://tinyhttpd.sourceforge.net
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
ElegantRL
Lightweight and scalable deep reinforcement learning using PyTorch. 🔥
highway-env
A minimalist environment for decision-making in autonomous driving
paper
Reading List
Deep-Reinforcement-Learning-Hands-On-Second-Edition
Deep-Reinforcement-Learning-Hands-On-Second-Edition, published by Packt
Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....