188zzoon's starred repositories
coding-interview-university
A complete computer science study plan to become a software engineer.
TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
flash-attention
Fast and memory-efficient exact attention
awesome-deep-text-detection-recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
aphrodite-engine
PygmalionAI's large-scale inference engine
optimum-quanto
A pytorch quantization backend for optimum
Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
SGEMM_CUDA
Fast CUDA matrix multiplication from scratch
TensorFlow2.0_ResNet
A ResNet(ResNet18, ResNet34, ResNet50, ResNet101, ResNet152) implementation using TensorFlow-2.0.
GPTQ-triton
GPTQ inference Triton kernel
cuda_tensorflow_opencv
DockerFile with GPU support for TensorFlow and OpenCV
carrier-of-tricks-for-classification-pytorch
carrier of tricks for image classification tutorials using pytorch.
ConvNets-TensorFlow2
⛵️ Implementation a variety of popular Image Classification Models using TensorFlow2. [ResNet, GoogLeNet, VGG, Inception-v3, Inception-v4, MobileNet, MobileNet-v2, ShuffleNet, ShuffleNet-v2, etc...]
retinanet-tensorflow2.x
TensorFlow2.x implementation of RetinaNet
LibTorch-ResNet-CIFAR
ResNet Implementation, Training, and Inference Using LibTorch C++ API
TritonTransformer
Transformer Implementation in Triton