sahi11's repositories
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
monocular-RGB-neural-head-avatars
Official PyTorch implementation of "Neural Head Avatars from Monocular RGB Videos"
av_hubert
A self-supervised learning framework for audio-visual speech, lip-reading
emoca
Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild
stylegan3
Official PyTorch implementation of StyleGAN3 - Nvidia
instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
nerfies
This is the code for Deformable Neural Radiance Fields, a.k.a. Nerfies.
video2colmap
Convert a video to a COLMAP project
VisualVoice
Audio-Visual Speech Separation with Cross-Modal Consistency
insightface
State-of-the-art 2D and 3D Face Analysis Project
EverybodyDanceNow
Motion Retargeting Video Subjects
URST
Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
fsgan
FSGAN - Official PyTorch Implementation
espnet
End-to-End Speech Processing Toolkit
NeuralVoicePuppetry
This github contains the network architectures of NeuralVoicePuppetry.
syncnet_python
Out of time: automated lip sync in the wild
Face-Super-Resolution
Face super resolution based on ESRGAN
EA-SVC
An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"
Realistic-Neural-Talking-Head-Models
My implementation of Few-Shot Adversarial Learning of Realistic Neural Talking Head Models (Egor Zakharov et al.).
Fast-AgingGAN
A deep learning model to age faces in the wild, currently runs at 60+ fps on GPUs
LipGAN
This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".
awesome-ai-in-finance
🔬 A collection for those AI (RL / DL / SL / Evoluation / Genetic Algorithm) used in financial market. otherwise, we add Technology Analysis / Alpha Research / Arbitrage and other useful strategies tools & docs in quantitative finance market.
deep-learning-v2-pytorch
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101