Ameer Azam's repositories
res-adapter
Official implementation of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".
ConsistentID
Customized ID Consistent for human
DragAnything
Official code for 'DragAnything: Motion Control for Anything using Entity Representation'
sdxs
Official repo of paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
3DitScene
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
bark
🔊 Text-Prompted Generative Audio Model
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
FLAME-Universe
Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model
Generative_Deep_Learning_2nd_Edition
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
ImagenHub
A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
OpenVoice
Instant voice cloning by MyShell.
Parts2Whole
[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Phased-Consistency-Model
Boosting the performance of consistency models with PCM!
PhysDreamer
Code for PhysDreamer
ProFusion
Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach
stable-audio-tools
Generative models for conditional audio generation
VASA-1-hack
Using Claude Opus to reverse engineer code from VASA white paper - WIP - (this is for La Raza 🎷)