Haosheng Zou (邹昊晟)'s repositories

DeepSpeedExamples

SFT+RLHF on baichuan-7B

Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0

blog

learning blog towards a programmer

ELF

An End-To-End, Lightweight and Flexible Platform for Game Research

Language:C++License:NOASSERTIONStargazers:1Issues:1Issues:0

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonLicense:NOASSERTIONStargazers:1Issues:2Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

algorithm-pattern-python

Python version of algorithm-pattern

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

caffe_test

for viewing source codes on Windows

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

cse8340-learningJena

A program that uses Jena to build a model and query it with SPARQL.

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

cv

my latex code of curriculum viate

Language:TeXStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0

reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

tianshou

An elegant PyTorch deep reinforcement learning platform.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Duke-Tsinghua-MLSS-2017

Duke-Tsinghua Machine Learning Summer School 2017

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

GeniusInvokationSimulator

Genius Invokation Simulator in python;七圣召唤模拟器;

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

keras

Deep Learning for humans

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

lab-report

lab reports using markdown

Stargazers:0Issues:2Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

multiagent_mujoco

Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.

Language:PythonStargazers:0Issues:1Issues:0

overcoming-catastrophic

Implementation of "Overcoming catastrophic forgetting in neural networks" in Tensorflow

Language:Jupyter NotebookStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0

zhusuan

A Library for Bayesian Deep Learning, Generative Models, Based on Tensorflow

Language:PythonLicense:MITStargazers:0Issues:2Issues:0