cmhungsteve

Min-Hung (Steve) Chen's starred repositories

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonApache-2.023669 158 3688

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookNOASSERTION9970 82 282

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonGPL-3.08341 53 412

litgpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Language:PythonApache-2.07166 75 650

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookApache-2.02435 29 372

Vim

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:Python2429 31 84

LyCORIS

Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

Language:PythonApache-2.02014 19 125

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonApache-2.01694 29 208

fsdp_qlora

Training LLMs with QLoRA + FSDP

Language:Jupyter NotebookApache-2.01263 20 34

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonApache-2.0795 18 60

Mamba_State_Space_Model_Paper_List

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

MIT456 11 5

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonNOASSERTION442 20 15

Awesome-Parameter-Efficient-Transfer-Learning

Collection of awesome parameter-efficient fine-tuning resources.

379 7 2

Awesome-Diffusion-Model-Based-Image-Editing-Methods

Diffusion Model-Based Image Editing: A Survey (arXiv)

MIT313 12 4

DoRA

[ICML2024] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Language:PythonNOASSERTION253 10 8

TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.

Language:PythonNOASSERTION229 8 19

cmhungsteve

Min-Hung (Steve) Chen's starred repositories

LLaMA-Factory

Awesome-LLM

llama-recipes

yolov9

litgpt

adapters

Vim

LyCORIS

lorax

fsdp_qlora

VILA

Mamba_State_Space_Model_Paper_List

RADIO

Awesome-Parameter-Efficient-Transfer-Learning

Awesome-Diffusion-Model-Based-Image-Editing-Methods

DoRA

TensorRT-Model-Optimizer

VILA

LITA

DoRA

mvtorch

merlin

LeftRefill

paper-template

Sports-QA

JORA

VQscore

Image-Text-Co-Decomposition

PartDistill

DoRA-project-page