Xidong Feng's repositories
LLM_Tree_Search
(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training
MELU_pytorch
An unofficial pytorch implementation of MELU
apollo_learning
Baidu Apollo Learning
decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Deep-RL-Keras
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Language:Jupyter Notebook000
deepdrive
End-to-end simulation for self-driving cars
Language:PythonMIT000
DeepLearningFlappyBird
Flappy Bird hack using Deep Reinforcement Learning (Deep Q-learning).
Language:PythonMIT000
Language:Jupyter Notebook000
Language:Python000
Language:Jupyter Notebook000
metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
MIT000
Language:PythonMIT000
reinforcement-learning
Minimal and Clean Reinforcement Learning Examples
Language:PythonMIT000