hijkzzz's repositories

pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonLicense:Apache-2.0Stargazers:583Issues:16Issues:40

alpha-zero-gomoku

A Multi-threaded Implementation of AlphaZero

cuda-neural-network

Convolutional Neural Network with CUDA (MNIST 99.23%)

mini-os-kernel

A mini Unix-Like OS kernel

Language:CStargazers:93Issues:4Issues:0

reinforcement-learning-wechat-jump

Reinforcement Learning for WeChat Jump

mini-interpreter

A Simple Scripting Language

Language:GoStargazers:78Issues:3Issues:0

prisma

Prisma

Language:PythonStargazers:71Issues:4Issues:0

web-server

A Web Server designed with Reactor I/O Model

Language:C++Stargazers:64Issues:3Issues:0

dht-crawler

A DHT Crawler based on Goroutine

Language:GoStargazers:63Issues:3Issues:0

deep-learning-notes

Deep Learning Notes

noisy-mappo

Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)

Language:PythonLicense:MITStargazers:44Issues:3Issues:2

reinforcement-learning-trading-robot

Trading Robot based on LSTM-PPO

dotfiles

Configuration file

Language:ShellStargazers:3Issues:3Issues:0
Language:HTMLStargazers:3Issues:3Issues:0

leetcode

LeetCode & LintCode

Language:C++Stargazers:2Issues:3Issues:0
Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:0Issues:1Issues:0

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:3Issues:0

mame-street-fighter-3-ai

Reinforcement Learning for Street Fighter III: 3rd Strike

Language:PythonStargazers:0Issues:3Issues:0

NTU-Thesis-LaTeX-Template

🎓 Unofficial LaTeX templates for your graduate thesis (both master's theses and doctoral dissertations) at National Taiwan University. 國立臺灣大學碩博士學位論文 LaTeX 模板

Language:TeXLicense:MITStargazers:0Issues:2Issues:0

reinforcement-learning.pytorch

Reinforcement Learning Library

Language:PythonStargazers:0Issues:3Issues:0

staging

iclr-blogposts.github.io/staging

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

termux-jupyter

Termux init script

Language:ShellStargazers:0Issues:3Issues:0