188zzoon's starred repositories

coding-interview-university

A complete computer science study plan to become a software engineer.

TensorFlow-Examples

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:43312Issues:2051Issues:233

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22088Issues:218Issues:122

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:12796Issues:89Issues:595

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12459Issues:116Issues:900

triton

Development repository for the Triton language and compiler

gluon-cv

Gluon CV Toolkit

Language:PythonLicense:Apache-2.0Stargazers:5785Issues:153Issues:828

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:4131Issues:33Issues:433

awesome-deep-text-detection-recognition

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:2144Issues:24Issues:159

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Language:PythonLicense:MITStargazers:1457Issues:12Issues:338

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:1109Issues:20Issues:83

aphrodite-engine

PygmalionAI's large-scale inference engine

Language:PythonLicense:AGPL-3.0Stargazers:798Issues:12Issues:136

optimum-quanto

A pytorch quantization backend for optimum

Language:PythonLicense:Apache-2.0Stargazers:662Issues:8Issues:86

Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:341Issues:3Issues:8

TensorFlow2.0_ResNet

A ResNet(ResNet18, ResNet34, ResNet50, ResNet101, ResNet152) implementation using TensorFlow-2.0.

Language:PythonLicense:MITStargazers:314Issues:5Issues:9

GPTQ-triton

GPTQ inference Triton kernel

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:270Issues:12Issues:19
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:170Issues:4Issues:4

BitMat

An efficent implementation of the method proposed in "The Era of 1-bit LLMs"

Language:PythonLicense:Apache-2.0Stargazers:147Issues:6Issues:10

cuda_tensorflow_opencv

DockerFile with GPU support for TensorFlow and OpenCV

Language:DockerfileLicense:Apache-2.0Stargazers:117Issues:10Issues:18

carrier-of-tricks-for-classification-pytorch

carrier of tricks for image classification tutorials using pytorch.

Language:PythonLicense:MITStargazers:100Issues:7Issues:0

ConvNets-TensorFlow2

⛵️ Implementation a variety of popular Image Classification Models using TensorFlow2. [ResNet, GoogLeNet, VGG, Inception-v3, Inception-v4, MobileNet, MobileNet-v2, ShuffleNet, ShuffleNet-v2, etc...]

Language:PythonLicense:Apache-2.0Stargazers:100Issues:6Issues:3

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:80Issues:8Issues:81

gans-2.0

Generative Adversarial Networks in TensorFlow 2.0

Language:PythonLicense:MITStargazers:76Issues:4Issues:10

retinanet-tensorflow2.x

TensorFlow2.x implementation of RetinaNet

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:40Issues:4Issues:3

LibTorch-ResNet-CIFAR

ResNet Implementation, Training, and Inference Using LibTorch C++ API

Language:C++License:MITStargazers:32Issues:3Issues:1

TritonTransformer

Transformer Implementation in Triton

Language:PythonLicense:MITStargazers:5Issues:1Issues:0