Min-Hung (Steve) Chen (cmhungsteve)

cmhungsteve

Geek Repo

Company:@NVIDIA

Location:Taipei City, Taiwan

Home Page:https://minhungchen.netlify.app/

Twitter:@CMHungSteven

Github PK Tool:Github PK Tool


Organizations
MediaTek-NeuroPilot

Min-Hung (Steve) Chen's starred repositories

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:23669Issues:158Issues:3688

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9970Issues:82Issues:282

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonLicense:GPL-3.0Stargazers:8341Issues:53Issues:412

litgpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Language:PythonLicense:Apache-2.0Stargazers:7166Issues:75Issues:650

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2435Issues:29Issues:372

Vim

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

LyCORIS

Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

Language:PythonLicense:Apache-2.0Stargazers:2014Issues:19Issues:125

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonLicense:Apache-2.0Stargazers:1694Issues:29Issues:208

fsdp_qlora

Training LLMs with QLoRA + FSDP

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1263Issues:20Issues:34

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:795Issues:18Issues:60

Mamba_State_Space_Model_Paper_List

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonLicense:NOASSERTIONStargazers:442Issues:20Issues:15

Awesome-Parameter-Efficient-Transfer-Learning

Collection of awesome parameter-efficient fine-tuning resources.

Awesome-Diffusion-Model-Based-Image-Editing-Methods

Diffusion Model-Based Image Editing: A Survey (arXiv)

DoRA

[ICML2024] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Language:PythonLicense:NOASSERTIONStargazers:253Issues:10Issues:8

TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.

Language:PythonLicense:NOASSERTIONStargazers:229Issues:8Issues:19

VILA

VILA - A multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:136Issues:9Issues:13
Language:PythonLicense:NOASSERTIONStargazers:111Issues:8Issues:3

DoRA

Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"

mvtorch

a Pytorch library for multi-view 3D understanding and generation

Language:PythonLicense:MITStargazers:80Issues:5Issues:11

merlin

Merlin: Empowering Multimodal LLMs with Foresight Minds

Language:PythonLicense:NOASSERTIONStargazers:69Issues:6Issues:3

LeftRefill

LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model (CVPR2024)

Language:PythonLicense:Apache-2.0Stargazers:47Issues:9Issues:6

paper-template

ECCV 2024 paper template

Language:TeXLicense:MITStargazers:45Issues:6Issues:6

Sports-QA

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

JORA

JORA: JAX Tensor-Parallel LoRA Library

Language:PythonLicense:NOASSERTIONStargazers:20Issues:2Issues:1
Language:PythonStargazers:17Issues:0Issues:0
Stargazers:5Issues:0Issues:0

DoRA-project-page

This is the project webpage of: DoRA: Weight-Decomposed Low-Rank Adaptation

Language:Jupyter NotebookStargazers:1Issues:0Issues:0