waterhorse1

followers

following

stars

University College London

https://waterhorse1.github.io/

Xidong Feng's repositories

LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Language:Python127 3 4

ChessGPT

(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling

Language:PythonApache-2.085 4 4

MELU_pytorch

An unofficial pytorch implementation of MELU

Language:Python41 4 8

NAC

(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.

Language:Jupyter Notebook25 2 1

CMML_pytorch

3 20

ha_ma_ppo

Language:PythonMIT1 30

waterhorse1.github.io

Language:HTMLMIT100

apollo_learning

Baidu Apollo Learning

MIT010

classification

Language:Python020

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonMIT010

Deep-RL-Keras

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Language:Python010

MRI_RL

Language:Python010

chess_template

Language:Jupyter Notebook000

deepdrive

End-to-end simulation for self-driving cars

Language:PythonMIT000

DeepLearningFlappyBird

Flappy Bird hack using Deep Reinforcement Learning (Deep Q-learning).

Language:PythonMIT000

DRL-implementation

Language:Jupyter Notebook000

haddpg

020

meta_classification

000

Meta_Gradient

Language:Python000

Meta_Regression

Language:Jupyter Notebook000

metaworld

An open source robotics benchmark for meta- and multi-task reinforcement learning

MIT000

models

Models and examples built with TensorFlow

Language:PythonApache-2.0010

MRI_DDPG

Language:Jupyter Notebook020

pearl_lstm

Language:PythonMIT020

Pearl_relabel

Language:PythonMIT000

Promp_test

Language:PythonMIT020

Regression

020

reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

Language:PythonMIT000

torchopt

TorchOpt is a high-performance optimizer library built upon PyTorch for easy implementation of functional optimization and gradient-based meta-learning.

Language:PythonApache-2.0010