shibei00's repositories
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
awd-lstm-lm
LSTM and QRNN Language Model Toolkit for PyTorch
awesome-deep-rl
For deep RL and the future of AI.
ColossalAI
Making large AI models cheaper, faster and more accessible
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
DeepRL_Algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
emacs-document
translate emacs documents to Chinese for convenient reference
horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
llama
Inference code for Llama models
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
oi-slides
我的信息学竞赛讲课课件
open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
poker-cfrm
A NLTH Poker Agent using Counterfactual Regret Minimization
procgen
Procgen Benchmark: Procedurally Generated Game-Like Gym Environments
python-mode
Vim python-mode. PyLint, Rope, Pydoc, breakpoints from box.
resume
个人中文简历 Latex 源码 https://hijiangtao.github.io/
rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Seq2seqChatbots
A wrapper around tensor2tensor to flexibly train, interact, and generate data for neural chatbots.
tetris_mcts
MCTS project for Tetris
trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"