mudassirkhan19

Mudassir Khan's starred repositories

EmoTalk_release

This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"

Language:PythonNOASSERTION32700

IQA-PyTorch

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Language:PythonNOASSERTION173300

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonMIT822700

hyperIQA

Source code for the CVPR'20 paper "Blindly Assess Image Quality in the Wild Guided by A Self-Adaptive Hyper Network"

Language:PythonMIT35400

content-debiased-fvd

[CVPR 2024] On the Content Bias in Fréchet Video Distance

Language:PythonMIT6400

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonMIT162800

lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Language:PythonApache-2.0110600

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02126300

PGDiff

[NeurIPS 2023] PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance

Language:PythonNOASSERTION12900

Emote-hack

Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)

Language:Python17100

VideoMamba

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Language:PythonApache-2.074500

SVD_Xtend

Stable Video Diffusion Training Code and Extensions.

Language:Python51600

Sora

Implementation of the premier Text to Video model from OpenAI

Language:PythonMIT5600

lumiere-pytorch

Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch

Language:PythonMIT23800

ailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

Language:Python195300

sd-wav2lip-uhq

Wav2Lip UHQ extension for Automatic1111

Language:PythonApache-2.0121500

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonMIT95500

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonApache-2.0272200

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.0250200

GSM

Gaussian Shell Maps for Efficient 3D Human Generation (CVPR 2024)

Language:Jupyter Notebook18700

CoDeF

[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonNOASSERTION481400

Upscale-A-Video

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

89700

ghostbuster

Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)

Language:PythonNOASSERTION12600

TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Language:PythonMIT153600

nendo

The Nendo AI Audio Tool Suite

Language:PythonMIT20500

ml-engineering

Machine Learning Engineering Open Book

Language:PythonCC-BY-SA-4.01047700

ganhacks

starter from "How to Train a GAN?" at NIPS2016

1141100

czkawka

Multi functional app to find duplicates, empty folders, similar images etc.

Language:RustNOASSERTION1890100

E4S2024

Official Implementation of 'E4S: Fine-grained Face Swapping via Editing With Regional GAN Inversion'

Language:PythonMIT12200

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonMIT248300