HaoshengZou

Haosheng Zou (邹昊晟)'s repositories

DeepSpeedExamples

SFT+RLHF on baichuan-7B

Language:PythonApache-2.0500

blog

learning blog towards a programmer

1 20

ELF

An End-To-End, Lightweight and Flexible Platform for Game Research

Language:C++NOASSERTION1 10

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonNOASSERTION1 20

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonMIT1 10

algorithm-pattern-python

Python version of algorithm-pattern

Language:Python010

awesome-ml4co

Language:Python010

caffe_test

for viewing source codes on Windows

Language:C++NOASSERTION000

cse8340-learningJena

A program that uses Jena to build a model and query it with SPARQL.

Language:JavaNOASSERTION000

cv

my latex code of curriculum viate

Language:TeX020

DGM-kernel

Language:Python020

minGPT24

Language:Python000

reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.

Language:PythonMIT020

tianshou

An elegant PyTorch deep reinforcement learning platform.

Language:PythonMIT000

trl

Language:PythonApache-2.0000

Duke-Tsinghua-MLSS-2017

Duke-Tsinghua Machine Learning Summer School 2017

Language:Jupyter NotebookApache-2.0020

GeniusInvokationSimulator

Genius Invokation Simulator in python；七圣召唤模拟器；

Language:PythonMIT010

implementation-matters

Language:PythonMIT010

keras

Deep Learning for humans

Language:PythonNOASSERTION020

lab-report

lab reports using markdown

020

minGPT

Language:Jupyter NotebookMIT010

multiagent_mujoco

Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.

Language:Python010

overcoming-catastrophic

Implementation of "Overcoming catastrophic forgetting in neural networks" in Tensorflow

Language:Jupyter Notebook020

TStarBot1

Language:Python020

zhusuan

A Library for Bayesian Deep Learning, Generative Models, Based on Tensorflow

Language:PythonMIT020