AMEERAZAM08

followers

following

stars

Pixis AI

bangalore

Ameer Azam's repositories

sam-sdxl-inpainting

Language:Python11 2 1

anywhere-multi-agent

Language:Jupyter Notebook100

ConsistentID

Customized ID Consistent for human

Language:Python100

InstantStyle

Language:Jupyter Notebook100

3DitScene

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

000

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Apache-2.0000

APISR

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

GPL-3.0000

CVPR-2023-24-Papers

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

MIT000

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.0000

Generative_Deep_Learning_2nd_Edition

The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.

Language:Jupyter NotebookApache-2.0000

GPAvatar

[ICLR 2024] Generalizable and Precise Head Avatar from Image(s)

MIT000

HiDiffusion

Language:Jupyter NotebookApache-2.0000

lightning_track

[ICLR 2024] Generalizable and Precise Head Avatar from Image(s)

000

LipSync3D

Language:Jupyter Notebook000

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT000

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION000

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonMIT000

MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

NOASSERTION000

mindiffusion

Repository of lessons exploring image diffusion models, focused on understanding and education.

000

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonNOASSERTION000

Parts2Whole

[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation

MIT000

Phased-Consistency-Model

Boosting the performance of consistency models with PCM!

Language:PythonApache-2.0000

QA-Transformer

Language:Python010

stable-audio-tools

Generative models for conditional audio generation

Language:PythonMIT000

StableAudioWebUI

Language:PythonApache-2.0000

SwinIR

SwinIR: Image Restoration Using Swin Transformer (official repository)

Apache-2.0000

Unet-Denoise

Language:Python010

VASA-1-hack

Using Claude Opus to reverse engineer code from VASA white paper - WIP - (this is for La Raza 🎷)

MIT000

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

NOASSERTION000

VOODOO3D-official

Official implementation for the paper "VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment"

MIT000