Min-Hung (Steve) Chen's starred repositories
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
fsdp_qlora
Training LLMs with QLoRA + FSDP
Mamba_State_Space_Model_Paper_List
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
Awesome-Parameter-Efficient-Transfer-Learning
Collection of awesome parameter-efficient fine-tuning resources.
Awesome-Diffusion-Model-Based-Image-Editing-Methods
Diffusion Model-Based Image Editing: A Survey (arXiv)
TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
LeftRefill
LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model (CVPR2024)
paper-template
ECCV 2024 paper template
DoRA-project-page
This is the project webpage of: DoRA: Weight-Decomposed Low-Rank Adaptation