Haosheng Zou (邹昊晟)'s repositories
DeepSpeedExamples
SFT+RLHF on baichuan-7B
spinningup
An educational resource to help anyone learn deep reinforcement learning.
algorithm-pattern-python
Python version of algorithm-pattern
caffe_test
for viewing source codes on Windows
cse8340-learningJena
A program that uses Jena to build a model and query it with SPARQL.
reversi-alpha-zero
Reversi reinforcement learning by AlphaGo Zero methods.
tianshou
An elegant PyTorch deep reinforcement learning platform.
Duke-Tsinghua-MLSS-2017
Duke-Tsinghua Machine Learning Summer School 2017
GeniusInvokationSimulator
Genius Invokation Simulator in python;七圣召唤模拟器;
lab-report
lab reports using markdown
multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
overcoming-catastrophic
Implementation of "Overcoming catastrophic forgetting in neural networks" in Tensorflow