Yuechen's starred repositories
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
flash-attention
Fast and memory-efficient exact attention
PnPInversion
[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"
MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
visualblocks
Visual Blocks for ML is a Google visual programming framework that lets you create ML pipelines in a no-code graph editor. You – and your users – can quickly prototype workflows by connecting drag-and-drop ML components, including models, user inputs, processors, and visualizations.
StyleCrafter
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
PointTransformerV3
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
Prompt-Highlighter
[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs
DynamiCrafter
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
ComfyUI_UltimateSDUpscale
ComfyUI nodes for the Ultimate Stable Diffusion Upscale script by Coyote-A.