hlu1's repositories
AITemplate_public
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
caffe2
Caffe2 is a lightweight, modular, and scalable deep learning framework.
Language:C++Apache-2.0000
cpuinfo
CPU INFOrmation library (x86/ARM, Linux/Mach/NaCl)
Language:Objective-CBSD-2-Clause000
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++NOASSERTION000
dmlc-core
A common bricks library for building scalable and portable distributed machine learning.
Language:C++NOASSERTION000
KeepingYouAwake
Prevents your Mac from going to sleep.
Language:Objective-CMIT000
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
MIT000
models
A repository for storing pre-trained Caffe2 models.
Language:PureBasic000
TASO
A Tensor Algebra SuperOptimizer for Deep Learning
Language:C++Apache-2.0000
Language:Python000