Ameer Azam (AMEERAZAM08)

AMEERAZAM08

User data from Github https://github.com/AMEERAZAM08

Company:Pixis AI

Location:bangalore

GitHub:@AMEERAZAM08

Twitter:@Ameerazam18

Ameer Azam's starred repositories

IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Language:PythonLicense:Apache-2.0Stargazers:20085Issues:144Issues:473

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:9663Issues:654Issues:157

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:6699Issues:60Issues:167

SwinIR

SwinIR: Image Restoration Using Swin Transformer (official repository)

Language:PythonLicense:Apache-2.0Stargazers:4561Issues:53Issues:154

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2813Issues:42Issues:103

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonLicense:NOASSERTIONStargazers:2364Issues:42Issues:69

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1559Issues:21Issues:36

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1434Issues:21Issues:71

CatVTON

CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).

Language:PythonLicense:NOASSERTIONStargazers:1054Issues:12Issues:88

MetaPortrait

[CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation

Language:PythonLicense:MITStargazers:539Issues:63Issues:33

FoleyCrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

Language:PythonLicense:Apache-2.0Stargazers:499Issues:15Issues:22

Phased-Consistency-Model

[NeurIPS 2024] Boosting the performance of consistency models with PCM!

Language:PythonLicense:Apache-2.0Stargazers:405Issues:19Issues:21

TalkingGaussian

[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

swap-anything

Official implementation of the ECCV paper "SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"

Language:PythonLicense:MITStargazers:236Issues:28Issues:7

Diffree

Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:232Issues:6Issues:13

3DitScene

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

MultiTalk

[INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset"

StableAudioWebUI

A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0

Language:PythonLicense:Apache-2.0Stargazers:47Issues:4Issues:1

Noise-free-Optimization-in-Early-Training-Steps-for-Image-Super-Resolution

[AAAI2024] Official Repository for Noise-free Optimization in Early Training Steps for Image Super-Resolution

mindiffusion

Repository of lessons exploring image diffusion models, focused on understanding and education.

Language:PythonLicense:MITStargazers:36Issues:0Issues:0
Language:Jupyter NotebookStargazers:32Issues:0Issues:0

CharacterGen

[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

Language:JavaScriptLicense:AGPL-3.0Stargazers:9Issues:1Issues:0

SPEAK-hack

Using Claude Sonnet to reverse engineer paper Listen, Disentangle, and Control: Controllable Speech-Driven Talking Head Generation

Language:PythonLicense:MITStargazers:6Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Upscale-A-Video

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Stargazers:1Issues:0Issues:0

e4s

(CVPR 2023) E4S: Fine-grained Face Swapping via Regional GAN Inversion

License:MITStargazers:1Issues:0Issues:0