Xiao Yu's repositories

awsome-distributed-training

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

License:MIT-0Stargazers:0Issues:0Issues:0

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

License:Apache-2.0Stargazers:0Issues:0Issues:0

ComfyUI

A powerful and modular stable diffusion GUI with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

ComfyUI-IF_AI_tools

ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.

Stargazers:0Issues:0Issues:0

ComfyUI-Whisper

Transcribe audio and add subtitles to videos using Whisper in ComfyUI, licensed under CC BY-NC-SA 4.0

License:NOASSERTIONStargazers:0Issues:0Issues:0

ComfyUI_StoryDiffusion

You can using StoryDiffusion in ComfyUI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

License:Apache-2.0Stargazers:0Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

License:NOASSERTIONStargazers:0Issues:0Issues:0

EasyAnimate

šŸ“ŗ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

License:Apache-2.0Stargazers:0Issues:0Issues:0

edit-one-for-all

āœļø Edit One for All: Interactive Batch Image Editing (CVPR 2024)

Stargazers:0Issues:0Issues:0

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Stargazers:0Issues:0Issues:0

IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:PythonStargazers:0Issues:0Issues:0

IMAGDressing

šŸ‘”IMAGDressingšŸ‘”: Interactive Modular Apparel Generation for Virtual Dressing

License:Apache-2.0Stargazers:0Issues:0Issues:0

InstanceDiffusion

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

License:Apache-2.0Stargazers:0Issues:0Issues:0

Intelli-Agent

Chatbot Portal with Agent: Streamlined Workflow for Building Agent-Based Applications

License:Apache-2.0Stargazers:0Issues:0Issues:0

lectures

Material for cuda-mode lectures

License:Apache-2.0Stargazers:0Issues:0Issues:0

LivePortrait

Bring portraits to life!

License:MITStargazers:0Issues:0Issues:0

mamba

Mamba SSM architecture

License:Apache-2.0Stargazers:0Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

License:Apache-2.0Stargazers:0Issues:0Issues:0

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

License:NOASSERTIONStargazers:0Issues:0Issues:0

MimicBrush

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

License:Apache-2.0Stargazers:0Issues:0Issues:0

MotionClone

Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Stargazers:0Issues:0Issues:0

PuLID

Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

StoryDiffusion

Create Magic Story!

License:Apache-2.0Stargazers:0Issues:0Issues:0

StoryGen

[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models

License:MITStargazers:0Issues:0Issues:0

TPD

This is the official repository for the paper "Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On". CVPR 2024

Stargazers:0Issues:0Issues:0

VAR

[GPT beats diffusionšŸ”„] [scaling laws in visual generationšŸ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

License:MITStargazers:0Issues:0Issues:0

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

License:NOASSERTIONStargazers:0Issues:0Issues:0

zest_code

This is the official implementation of ZeST

Stargazers:0Issues:0Issues:0