sbwww

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

MIT000

LLM-KB

Language:Python000

LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

Apache-2.0000

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

MIT000

mace

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

Language:C++Apache-2.0000

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.0000

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonApache-2.0000

mlx

MLX: An array framework for Apple silicon

Language:C++MIT000

QPair

Language:Python010

relax

Language:PythonApache-2.0000

RWKV-Android

使用Android cpu 运行 RWKV V4 ONNX

Language:Java000

smoothquant

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonMIT000

StableDiffusionOnDevice

本项目是一个通过文字生成图片的项目，基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型，包括其配套的模型运行框架。

MIT000

TransAct-pruning

010

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.0000

wanda

A simple and effective LLM pruning approach.

Language:PythonMIT000