Mengzhao Chen (ChenMnZ)

ChenMnZ

Geek Repo

Company:HKU-MMLab & ShangHai AI Lab

Location:ShangHai

Home Page:chenmnz.github.io

Github PK Tool:Github PK Tool

Mengzhao Chen's starred repositories

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29130Issues:190Issues:4586

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25567Issues:211Issues:228

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11128Issues:161Issues:252

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9107Issues:110Issues:81

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4297Issues:43Issues:184

kimi-free-api

🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。

Language:TypeScriptLicense:GPL-3.0Stargazers:3558Issues:30Issues:111

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:1985Issues:31Issues:83

executorch

On-device AI across mobile, embedded and edge for PyTorch

Language:C++License:NOASSERTIONStargazers:1632Issues:55Issues:344

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonLicense:MITStargazers:1499Issues:38Issues:36

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1313Issues:17Issues:49

quanto

A pytorch Quantization Toolkit

Language:PythonLicense:Apache-2.0Stargazers:613Issues:8Issues:65

qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Language:PythonLicense:Apache-2.0Stargazers:375Issues:9Issues:25

BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Language:PythonLicense:MITStargazers:265Issues:12Issues:42

Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

QuaRot

Code for QuaRot, an end-to-end 4-bit inference of large language models.

Language:PythonLicense:Apache-2.0Stargazers:236Issues:11Issues:34

llmc

This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Language:PythonLicense:Apache-2.0Stargazers:196Issues:9Issues:8

FastV

[ECCV 2024] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

LLaMA3-Quantization

A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..

Language:PythonLicense:Apache-2.0Stargazers:92Issues:1Issues:13

decoupleQ

A quantization algorithm for LLM

Language:CudaLicense:Apache-2.0Stargazers:89Issues:2Issues:11

MMT-Bench

ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

fast-hadamard-transform

Fast Hadamard transform in CUDA, with a PyTorch interface

Language:CLicense:BSD-3-ClauseStargazers:79Issues:3Issues:5

bllama

1.58-bit LLaMa model

Language:PythonLicense:MITStargazers:77Issues:11Issues:0

LLaVA-PruMerge

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Language:PythonLicense:Apache-2.0Stargazers:74Issues:2Issues:13

BitDistiller

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Language:PythonLicense:MITStargazers:56Issues:3Issues:6

svit

Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"

Language:PythonLicense:Apache-2.0Stargazers:23Issues:9Issues:10
Language:PythonStargazers:18Issues:0Issues:0
Language:ShellLicense:BSD-3-Clause-ClearStargazers:17Issues:0Issues:1