Xidong Feng (waterhorse1)

waterhorse1

Geek Repo

Company:University College London

Home Page:https://waterhorse1.github.io/

Github PK Tool:Github PK Tool

Xidong Feng's repositories

LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

ChessGPT

(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling

Language:PythonLicense:Apache-2.0Stargazers:85Issues:4Issues:4

MELU_pytorch

An unofficial pytorch implementation of MELU

NAC

(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.

Language:Jupyter NotebookStargazers:25Issues:2Issues:1
Language:PythonLicense:MITStargazers:1Issues:3Issues:0
Language:HTMLLicense:MITStargazers:1Issues:0Issues:0

apollo_learning

Baidu Apollo Learning

License:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Deep-RL-Keras

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

deepdrive

End-to-end simulation for self-driving cars

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepLearningFlappyBird

Flappy Bird hack using Deep Reinforcement Learning (Deep Q-learning).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

metaworld

An open source robotics benchmark for meta- and multi-task reinforcement learning

License:MITStargazers:0Issues:0Issues:0

models

Models and examples built with TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0

reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torchopt

TorchOpt is a high-performance optimizer library built upon PyTorch for easy implementation of functional optimization and gradient-based meta-learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0