Beast code in Giters

[DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting

Language:Python1700

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

MIT326900

ShiftAddLLM

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Language:PythonApache-2.06100

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonApache-2.0514600

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.

Language:PythonNOASSERTION39000

licj15

Chaojian Li's starred repositories

Fov-3DGS

ServerlessLLM

ThunderKittens

grayskull-attention

QuaRot

unsloth

ac_math

LLM4HWDesign_Starting_Toolkit

ACT

Grendel-GS

cs249r_book

3D-Carbon

LogarithmicPosit

Edge-LLM

DeepSeek-V2

ShiftAddLLM

arxiv-latex-cleaner

TensorRT-Model-Optimizer

mg-verilog

Awesome-LLM-Inference

tiny-gpu

llama3

gpt-fast

owl

SA-GS

research-career-tools

ray-tracing-in-cuda

cutlass

warp

KIVI