shibei00

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Language:Python010

DouDiZhu

000

emacs-document

translate emacs documents to Chinese for convenient reference

000

football

Check out the new game server:

Language:PythonApache-2.0020

hok_env

Apache-2.0000

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:C++NOASSERTION000

Learn-Vim

A book for learning the Vim editor the smart way.

NOASSERTION010

llama

Inference code for Llama models

NOASSERTION000

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

MIT000

oi-slides

我的信息学竞赛讲课课件

000

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++Apache-2.0020

poker-cfrm

A NLTH Poker Agent using Counterfactual Regret Minimization

Language:C++010

procgen

Procgen Benchmark: Procedurally Generated Game-Like Gym Environments

MIT000

python-mode

Vim python-mode. PyLint, Rope, Pydoc, breakpoints from box.

LGPL-3.0000

resume

个人中文简历 Latex 源码 https://hijiangtao.github.io/

MIT000

rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

MIT000

seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Language:PythonApache-2.0010

Seq2seqChatbots

A wrapper around tensor2tensor to flexibly train, interact, and generate data for neural chatbots.

Language:PythonMIT020

tetris_mcts

MCTS project for Tetris

Language:Python010

text-to-text-transfer-transformer

Apache-2.0000

tleague_projpage

000

trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Language:PythonApache-2.0000

trfl

TensorFlow Reinforcement Learning

Language:PythonApache-2.0010