Tian Xu's repositories
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:PythonMIT000
CS234
homework for CS234 2017
Language:Python000
CS234-1
My Solution to Assignments of CS234
Language:PythonMIT000
Language:PythonApache-2.0000
google-research
Google AI Research
Language:Jupyter NotebookApache-2.0000
000
Interview
Interview = 简历指南 + LeetCode + Kaggle
Language:Jupyter NotebookGPL-3.0000
Inverse-Reinforcement-Learning
Implementations of selected inverse reinforcement learning algorithms.
Language:PythonMIT000
mazelab
A customizable framework to create maze and gridworld environments
Language:Python000
000
policy_optimization
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
Language:Python000
ppo-dice
We propose a new way to make policy optimization more stable.
Language:PythonMIT000
probabilitydistributiontoolbox
Folklore facts on probability distribution learning, testing, and whatever-ing
Language:TeX000
Language:Python000
tensorflow_tutorials
From the basics to slightly more interesting applications of Tensorflow
Language:Jupyter NotebookNOASSERTION000