Herman's repositories
3DDFA_V2
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
AAAI22_ONE-SHOT_TALKING_FACE_GEN
Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)
apollo
An open autonomous driving platform
AudioStyleNet
This repository contains the code for my master thesis on Emotion-Aware Facial Animation
BAT_video
Anonymous repo to reproduce the visualization results of BAT.
deepspeech.pytorch
Speech Recognition using DeepSpeech2.
EVP
Code for paper 'Audio-Driven Emotional Video Portraits'.
FACIAL
FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.
first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
HDTF
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
imaginaire
NVIDIA's Deep Imagination Team's PyTorch Library
lightning-flash
Collection of tasks for fast prototyping, baselining, finetuning and solving problems with deep learning.
LiveSpeechPortraits
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)
mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
SAT
SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)
SegLoss
A collection of loss functions for medical image segmentation
SNP
Official code for View Synthesis with Sculpted Neural Points
style-based-gan-pytorch
Implementation A Style-Based Generator Architecture for Generative Adversarial Networks in PyTorch
StyleSpace
StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation
StyleSpace-pytorch
Implementation of StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation in PyTorch
SyncNetCN
Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released
SynergyNet
3DV 2021: Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry
take-off-eyeglasses
Official pytorch implementation of paper "Portrait Eyeglasses and Shadow Removal by Leveraging 3D Synthetic Data" (CVPR 2022).
TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.