hijkzzz's repositories
alpha-zero-gomoku
A Multi-threaded Implementation of AlphaZero
cuda-neural-network
Convolutional Neural Network with CUDA (MNIST 99.23%)
deep-reinforcement-learning-notes
Deep Reinforcement Learning Notes
mini-os-kernel
A mini Unix-Like OS kernel
reinforcement-learning-wechat-jump
Reinforcement Learning for WeChat Jump
mini-interpreter
A Simple Scripting Language
web-server
A Web Server designed with Reactor I/O Model
dht-crawler
A DHT Crawler based on Goroutine
deep-learning-notes
Deep Learning Notes
noisy-mappo
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
reinforcement-learning-trading-robot
Trading Robot based on LSTM-PPO
hijkzzz.github.io
Homepage
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
mame-street-fighter-3-ai
Reinforcement Learning for Street Fighter III: 3rd Strike
NTU-Thesis-LaTeX-Template
🎓 Unofficial LaTeX templates for your graduate thesis (both master's theses and doctoral dissertations) at National Taiwan University. 國立臺灣大學碩博士學位論文 LaTeX 模板
reinforcement-learning.pytorch
Reinforcement Learning Library
termux-jupyter
Termux init script