Beast code in Giters

Xiao Yu's repositories

awsome-distributed-training

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

MIT-0000

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Apache-2.0000

ComfyUI

A powerful and modular stable diffusion GUI with a graph/nodes interface.

Language:PythonGPL-3.0000

ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.

000

ComfyUI-Whisper

Transcribe audio and add subtitles to videos using Whisper in ComfyUI, licensed under CC BY-NC-SA 4.0

NOASSERTION000

ComfyUI_StoryDiffusion

You can using StoryDiffusion in ComfyUI

Language:PythonApache-2.0000

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Apache-2.0000

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

NOASSERTION000

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Apache-2.0000

edit-one-for-all

✏️ Edit One for All: Interactive Batch Image Editing (CVPR 2024)

000

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

000

IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:Python000

IMAGDressing

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Apache-2.0000

InstanceDiffusion

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Apache-2.0000

Intelli-Agent

Chatbot Portal with Agent: Streamlined Workflow for Building Agent-Based Applications

Apache-2.0000

lectures

Material for cuda-mode lectures

Apache-2.0000

LivePortrait

Bring portraits to life!

MIT000

mamba

Mamba SSM architecture

Apache-2.0000

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Apache-2.0000

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

NOASSERTION000

MimicBrush

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Apache-2.0000

MotionClone

Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation

000

PuLID

Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Apache-2.0000

seed-tts-eval

000

StoryDiffusion

Create Magic Story!

Apache-2.0000

StoryGen

[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models

MIT000

TPD

This is the official repository for the paper "Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On". CVPR 2024

000

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

MIT000

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

NOASSERTION000

zest_code

This is the official implementation of ZeST

000