Guoliang He's repositories
Language:CBSD-3-Clause000
CuAssembler
An unofficial cuda assembler, for all generations of SASS, hopefully :)
Language:PythonMIT000
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++NOASSERTION000
dagbo
Bayesian optimisation with semi-parametric DAG models
Language:PythonMIT000
dot_config
dot file for configuration
Language:Vim Script000
gpu-arch-microbenchmark
Dissecting NVIDIA GPU Architecture
000
huggingnft
Generate NFT or train new model in just few clicks! Train as much as you can, others will resume from checkpoint!
Language:Jupyter NotebookApache-2.0000
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:PythonMIT000
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:PythonNOASSERTION000
triton_fork
Development repository for the Triton language and compiler
Language:C++MIT000
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:PythonApache-2.0000