shibei00

shibei00

Geek Repo

Company:The Chinese University of Hong Kong

Location:Hong Kong

Home Page:shibei00.github.io

Github PK Tool:Github PK Tool

shibei00's repositories

alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

apex

A continuous deep reinforcement learning framework for robotics

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

awd-lstm-lm

LSTM and QRNN Language Model Toolkit for PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

awesome-deep-rl

For deep RL and the future of AI.

Language:HTMLLicense:MITStargazers:0Issues:2Issues:0
Stargazers:0Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

License:Apache-2.0Stargazers:0Issues:0Issues:0

DeepLearning-500-questions

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06

License:GPL-3.0Stargazers:0Issues:0Issues:0

DeepRL_Algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

emacs-document

translate emacs documents to Chinese for convenient reference

Stargazers:0Issues:0Issues:0

football

Check out the new game server:

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

Learn-Vim

A book for learning the Vim editor the smart way.

License:NOASSERTIONStargazers:0Issues:1Issues:0

llama

Inference code for Llama models

License:NOASSERTIONStargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

License:MITStargazers:0Issues:0Issues:0

oi-slides

我的信息学竞赛讲课课件

Stargazers:0Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

poker-cfrm

A NLTH Poker Agent using Counterfactual Regret Minimization

Language:C++Stargazers:0Issues:1Issues:0

procgen

Procgen Benchmark: Procedurally Generated Game-Like Gym Environments

License:MITStargazers:0Issues:0Issues:0

python-mode

Vim python-mode. PyLint, Rope, Pydoc, breakpoints from box.

License:LGPL-3.0Stargazers:0Issues:0Issues:0

resume

个人中文简历 Latex 源码 https://hijiangtao.github.io/

License:MITStargazers:0Issues:0Issues:0

rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

License:MITStargazers:0Issues:0Issues:0

seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Seq2seqChatbots

A wrapper around tensor2tensor to flexibly train, interact, and generate data for neural chatbots.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

tetris_mcts

MCTS project for Tetris

Language:PythonStargazers:0Issues:1Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

trfl

TensorFlow Reinforcement Learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0