Mudassir Khan (mudassirkhan19)

mudassirkhan19

Geek Repo

Company:Invideo

Location:Mumbai

Github PK Tool:Github PK Tool

Mudassir Khan's starred repositories

EmoTalk_release

This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"

Language:PythonLicense:NOASSERTIONStargazers:327Issues:0Issues:0

IQA-PyTorch

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Language:PythonLicense:NOASSERTIONStargazers:1733Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:8227Issues:0Issues:0

hyperIQA

Source code for the CVPR'20 paper "Blindly Assess Image Quality in the Wild Guided by A Self-Adaptive Hyper Network"

Language:PythonLicense:MITStargazers:354Issues:0Issues:0

content-debiased-fvd

[CVPR 2024] On the Content Bias in Fréchet Video Distance

Language:PythonLicense:MITStargazers:64Issues:0Issues:0

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonLicense:MITStargazers:1628Issues:0Issues:0

lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Language:PythonLicense:Apache-2.0Stargazers:1106Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21263Issues:0Issues:0

PGDiff

[NeurIPS 2023] PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance

Language:PythonLicense:NOASSERTIONStargazers:129Issues:0Issues:0

Emote-hack

Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)

Language:PythonStargazers:171Issues:0Issues:0

VideoMamba

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:745Issues:0Issues:0

SVD_Xtend

Stable Video Diffusion Training Code and Extensions.

Language:PythonStargazers:516Issues:0Issues:0

Sora

Implementation of the premier Text to Video model from OpenAI

Language:PythonLicense:MITStargazers:56Issues:0Issues:0

lumiere-pytorch

Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:238Issues:0Issues:0

ailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

Language:PythonStargazers:1953Issues:0Issues:0

sd-wav2lip-uhq

Wav2Lip UHQ extension for Automatic1111

Language:PythonLicense:Apache-2.0Stargazers:1215Issues:0Issues:0

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonLicense:MITStargazers:955Issues:0Issues:0

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2722Issues:0Issues:0

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2502Issues:0Issues:0

GSM

Gaussian Shell Maps for Efficient 3D Human Generation (CVPR 2024)

Language:Jupyter NotebookStargazers:187Issues:0Issues:0

CoDeF

[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonLicense:NOASSERTIONStargazers:4814Issues:0Issues:0

Upscale-A-Video

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Stargazers:897Issues:0Issues:0

ghostbuster

Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)

Language:PythonLicense:NOASSERTIONStargazers:126Issues:0Issues:0

TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Language:PythonLicense:MITStargazers:1536Issues:0Issues:0

nendo

The Nendo AI Audio Tool Suite

Language:PythonLicense:MITStargazers:205Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10477Issues:0Issues:0

ganhacks

starter from "How to Train a GAN?" at NIPS2016

Stargazers:11411Issues:0Issues:0

czkawka

Multi functional app to find duplicates, empty folders, similar images etc.

Language:RustLicense:NOASSERTIONStargazers:18901Issues:0Issues:0

E4S2024

Official Implementation of 'E4S: Fine-grained Face Swapping via Editing With Regional GAN Inversion'

Language:PythonLicense:MITStargazers:122Issues:0Issues:0

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonLicense:MITStargazers:2483Issues:0Issues:0