188zzoon's starred repositories
Triton-Puzzles
Puzzles for learning Triton
flash-attention
Fast and memory-efficient exact attention
TritonTransformer
Transformer Implementation in Triton
SGEMM_CUDA
Fast CUDA matrix multiplication from scratch
coding-interview-university
A complete computer science study plan to become a software engineer.
aphrodite-engine
PygmalionAI's large-scale inference engine
Awesome-LLM-Inference
đź“–A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
optimum-quanto
A pytorch quantization backend for optimum
smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
GPTQ-triton
GPTQ inference Triton kernel
cuda_tensorflow_opencv
DockerFile with GPU support for TensorFlow and OpenCV
LibTorch-ResNet-CIFAR
ResNet Implementation, Training, and Inference Using LibTorch C++ API
awesome-deep-text-detection-recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
retinanet-tensorflow2.x
TensorFlow2.x implementation of RetinaNet
TensorFlow2.0_ResNet
A ResNet(ResNet18, ResNet34, ResNet50, ResNet101, ResNet152) implementation using TensorFlow-2.0.
carrier-of-tricks-for-classification-pytorch
carrier of tricks for image classification tutorials using pytorch.