188zzoon's starred repositories

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:781Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:20960Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11648Issues:0Issues:0

TritonTransformer

Transformer Implementation in Triton

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:11748Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:170Issues:0Issues:0

BitMat

An efficent implementation of the method proposed in "The Era of 1-bit LLMs"

Language:PythonLicense:Apache-2.0Stargazers:143Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:11757Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:78Issues:0Issues:0

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:314Issues:0Issues:0

coding-interview-university

A complete computer science study plan to become a software engineer.

License:CC-BY-SA-4.0Stargazers:296331Issues:0Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:4006Issues:0Issues:0

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Language:PythonLicense:MITStargazers:1363Issues:0Issues:0

aphrodite-engine

PygmalionAI's large-scale inference engine

Language:PythonLicense:AGPL-3.0Stargazers:728Issues:0Issues:0

Awesome-LLM-Inference

đź“–A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:1774Issues:0Issues:0

optimum-quanto

A pytorch quantization backend for optimum

Language:PythonLicense:Apache-2.0Stargazers:630Issues:0Issues:0

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:2031Issues:0Issues:0

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:1087Issues:0Issues:0

Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

License:MITStargazers:340Issues:0Issues:0

GPTQ-triton

GPTQ inference Triton kernel

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:266Issues:0Issues:0
Language:CudaLicense:MITStargazers:19Issues:0Issues:0

cuda_tensorflow_opencv

DockerFile with GPU support for TensorFlow and OpenCV

Language:DockerfileLicense:Apache-2.0Stargazers:118Issues:0Issues:0

LibTorch-ResNet-CIFAR

ResNet Implementation, Training, and Inference Using LibTorch C++ API

Language:C++License:MITStargazers:31Issues:0Issues:0

awesome-deep-text-detection-recognition

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

License:Apache-2.0Stargazers:2494Issues:0Issues:0

TensorFlow-Examples

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:43281Issues:0Issues:0

gans-2.0

Generative Adversarial Networks in TensorFlow 2.0

Language:PythonLicense:MITStargazers:76Issues:0Issues:0

retinanet-tensorflow2.x

TensorFlow2.x implementation of RetinaNet

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:40Issues:0Issues:0

TensorFlow2.0_ResNet

A ResNet(ResNet18, ResNet34, ResNet50, ResNet101, ResNet152) implementation using TensorFlow-2.0.

Language:PythonLicense:MITStargazers:314Issues:0Issues:0

gluon-cv

Gluon CV Toolkit

Language:PythonLicense:Apache-2.0Stargazers:5771Issues:0Issues:0

carrier-of-tricks-for-classification-pytorch

carrier of tricks for image classification tutorials using pytorch.

Language:PythonLicense:MITStargazers:101Issues:0Issues:0