sheng-han-zhang's repositories
alpaca-lora
Instruct-tune LLaMA on consumer hardware
awesome-game-ai
Awesome Game AI materials of Multi-Agent Reinforcement Learning
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
DeepRole
The code used to power DeepRole
DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
jynew
金庸群侠传3D重制版
lykos
Werewolf, the popular detective/social party game (a theme of Mafia)
MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
mcts
An implementation of Monte Carlo Tree Search in python
mathematics_dataset
This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.
Megatron-LM
Ongoing research training transformer models at scale
melee-ai
Super Smash Bros. Melee (SSBM) AI
overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
PyIMDB
In-memory database for python like a Redis(?). It's my learning sandbox of grpc.
rl-baselines3-zoo
A collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.
sac-discrete.pytorch
A PyTorch implementation of SAC-Discrete.
shakespeare
The Complete Works of William Shakespeare hosted at http://shakespeare.mit.edu/
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
tianshou
An elegant PyTorch deep reinforcement learning platform.
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)