Chenxi's repositories
insanely-fast-whisper
Incredibly fast Whisper-large-v3
Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
VideoCrafter
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
DiffMorpher
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
chenxwh.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Smooth-Diffusion
[CVPR 2024] Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Faster-Diffusion
Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"
seacrowd-datahub
A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
StoryDiffusion
Create Magic Story!