tianxusky

Tian Xu's repositories

Language:Python10 10

Language:Python100

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT000

homework for CS234 2017

Language:Python000

My Solution to Assignments of CS234

Language:PythonMIT000

Language:PythonApache-2.0000

Google AI Research

Language:Jupyter NotebookApache-2.0000

000

Interview = 简历指南 + LeetCode + Kaggle

Language:Jupyter NotebookGPL-3.0000

Implementations of selected inverse reinforcement learning algorithms.

Language:PythonMIT000

A customizable framework to create maze and gridworld environments

Language:Python000

000

Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)

Language:Python000

We propose a new way to make policy optimization more stable.

Language:PythonMIT000

Folklore facts on probability distribution learning, testing, and whatever-ing

Language:TeX000

Language:Python000

From the basics to slightly more interesting applications of Tensorflow

Language:Jupyter NotebookNOASSERTION000