sheng-han-zhang's repositories

Language:PythonStargazers:1Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-game-ai

Awesome Game AI materials of Multi-Agent Reinforcement Learning

License:MITStargazers:0Issues:0Issues:0

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Language:PythonStargazers:0Issues:0Issues:0

Deep-Reinforcement-Learning-Hands-On

Hands-on Deep Reinforcement Learning, published by Packt

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepRole

The code used to power DeepRole

Language:C++Stargazers:0Issues:0Issues:0

DI-star

An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

jynew

金庸群侠传3D重制版

Language:C#License:NOASSERTIONStargazers:0Issues:0Issues:0

lykos

Werewolf, the popular detective/social party game (a theme of Mafia)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

Stargazers:0Issues:0Issues:0

mcts

An implementation of Monte Carlo Tree Search in python

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

mathematics_dataset

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

melee-ai

Super Smash Bros. Melee (SSBM) AI

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

License:MITStargazers:0Issues:0Issues:0

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

License:MITStargazers:0Issues:0Issues:0

PyIMDB

In-memory database for python like a Redis(?). It's my learning sandbox of grpc.

License:MITStargazers:0Issues:0Issues:0

rl-baselines3-zoo

A collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

sac-discrete.pytorch

A PyTorch implementation of SAC-Discrete.

License:MITStargazers:0Issues:0Issues:0

shakespeare

The Complete Works of William Shakespeare hosted at http://shakespeare.mit.edu/

Stargazers:0Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

License:MITStargazers:0Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning platform.

License:MITStargazers:0Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

License:MITStargazers:0Issues:0Issues:0