Mengzhao Chen (ChenMnZ)

ChenMnZ

Geek Repo

Company:HKU-MMLab & ShangHai AI Lab

Location:ShangHai

Home Page:chenmnz.github.io

Github PK Tool:Github PK Tool

Mengzhao Chen's starred repositories

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29131Issues:190Issues:4587

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25570Issues:211Issues:228

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11128Issues:161Issues:252

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9107Issues:110Issues:81

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4297Issues:43Issues:184

kimi-free-api

🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。

Language:TypeScriptLicense:GPL-3.0Stargazers:3558Issues:30Issues:111

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

executorch

On-device AI across mobile, embedded and edge for PyTorch

Language:C++License:NOASSERTIONStargazers:1632Issues:55Issues:344

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonLicense:MITStargazers:1499Issues:38Issues:36

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:882Issues:19Issues:68

optimum-quanto

A pytorch quantization backend for optimum

Language:PythonLicense:Apache-2.0Stargazers:724Issues:8Issues:103

sparsegpt

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Language:PythonLicense:Apache-2.0Stargazers:676Issues:16Issues:32

MobiLlama

MobiLlama : Small Language Model tailored for edge devices

Language:PythonLicense:Apache-2.0Stargazers:574Issues:13Issues:14

qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Language:PythonLicense:Apache-2.0Stargazers:375Issues:9Issues:25

BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Language:PythonLicense:MITStargazers:265Issues:12Issues:42

Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

QuaRot

Code for QuaRot, an end-to-end 4-bit inference of large language models.

Language:PythonLicense:Apache-2.0Stargazers:236Issues:11Issues:34

Incremental-Network-Quantization

Caffe Implementation for Incremental network quantization

Language:C++License:NOASSERTIONStargazers:190Issues:16Issues:40

KIVI

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Language:PythonLicense:MITStargazers:189Issues:5Issues:21

BitMat

An efficent implementation of the method proposed in "The Era of 1-bit LLMs"

Language:PythonLicense:Apache-2.0Stargazers:152Issues:6Issues:10

QUICK

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

Language:PythonLicense:MITStargazers:106Issues:1Issues:6
Language:PythonLicense:Apache-2.0Stargazers:92Issues:1Issues:13

MMT-Bench

ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

fast-hadamard-transform

Fast Hadamard transform in CUDA, with a PyTorch interface

Language:CLicense:BSD-3-ClauseStargazers:79Issues:3Issues:5

bllama

1.58-bit LLaMa model

Language:PythonLicense:MITStargazers:77Issues:11Issues:0

LLaVA-PruMerge

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Language:PythonLicense:Apache-2.0Stargazers:74Issues:2Issues:13

svit

Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"

Language:PythonLicense:Apache-2.0Stargazers:23Issues:9Issues:10
Language:ShellLicense:BSD-3-Clause-ClearStargazers:17Issues:0Issues:1