Yan Song's repositories
Master-thesis
Policy gradient planning in MBRL using probabilistic models.
NLP-project
Abstractive Summarisation
Gibbs-sampler
coursework
Network-analysis
toy software
subcellular-location-prediction
bioinformatics
Language:JavaScriptMIT000
Language:PythonMIT000
Language:PythonMIT000
envpool
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
Language:C++Apache-2.0000
gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:PythonNOASSERTION000
LLM_Tree_Search
The official implementation of paper: Alphazero-like Tree-Search can guide large language model decoding and training
000
ma-gym
A collection of multi agent environments based on OpenAI gym.
Language:PythonApache-2.0000
Language:PythonMIT000
minigrid-rl
RL experiments using mini grid gym environment
Language:Python000
Language:PythonMIT000
overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
MIT000
safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Apache-2.0000
000