Beast code in Giters

sahi11's repositories

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

MIT000

monocular-RGB-neural-head-avatars

Official PyTorch implementation of "Neural Head Avatars from Monocular RGB Videos"

000

av_hubert

A self-supervised learning framework for audio-visual speech, lip-reading

NOASSERTION000

Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild

NOASSERTION000

stylegan3

Official PyTorch implementation of StyleGAN3 - Nvidia

NOASSERTION000

instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

NOASSERTION000

nerfies

This is the code for Deformable Neural Radiance Fields, a.k.a. Nerfies.

Apache-2.0000

ADOP

MIT000

video2colmap

Convert a video to a COLMAP project

000

VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

NOASSERTION000

insightface

State-of-the-art 2D and 3D Face Analysis Project

MIT000

sahi11.github.io

Language:HTML000

content_choral_separation

000

EverybodyDanceNow

Motion Retargeting Video Subjects

NOASSERTION000

URST

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization

Apache-2.0000

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

000

face-alignment

:fire: 2D and 3D Face alignment library build using pytorch

BSD-3-Clause000

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

NOASSERTION000

fsgan

FSGAN - Official PyTorch Implementation

CC0-1.0000

espnet

End-to-End Speech Processing Toolkit

Apache-2.0000

NeuralVoicePuppetry

This github contains the network architectures of NeuralVoicePuppetry.

NOASSERTION000

syncnet_python

Out of time: automated lip sync in the wild

MIT000

Face-Super-Resolution

Face super resolution based on ESRGAN

000

EA-SVC

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

MIT000

Realistic-Neural-Talking-Head-Models

My implementation of Few-Shot Adversarial Learning of Realistic Neural Talking Head Models (Egor Zakharov et al.).

GPL-3.0000

Fast-AgingGAN

A deep learning model to age faces in the wild, currently runs at 60+ fps on GPUs

000

rllib-tune-atari

000

LipGAN

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

MIT000

awesome-ai-in-finance

🔬 A collection for those AI (RL / DL / SL / Evoluation / Genetic Algorithm) used in financial market. otherwise, we add Technology Analysis / Alpha Research / Arbitrage and other useful strategies tools & docs in quantitative finance market.

000

deep-learning-v2-pytorch

Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101

MIT000