Ameer Azam's starred repositories
DiffSynth-Studio
Enjoy the magic of Diffusion models!
stable-audio-tools
Generative models for conditional audio generation
InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
MetaPortrait
[CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
FoleyCrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Phased-Consistency-Model
[NeurIPS 2024] Boosting the performance of consistency models with PCM!
TalkingGaussian
[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
swap-anything
Official implementation of the ECCV paper "SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"
StableAudioWebUI
A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0
Noise-free-Optimization-in-Early-Training-Steps-for-Image-Super-Resolution
[AAAI2024] Official Repository for Noise-free Optimization in Early Training Steps for Image Super-Resolution
mindiffusion
Repository of lessons exploring image diffusion models, focused on understanding and education.
CharacterGen
[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization
SPEAK-hack
Using Claude Sonnet to reverse engineer paper Listen, Disentangle, and Control: Controllable Speech-Driven Talking Head Generation
Upscale-A-Video
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution