zhanghm1995

Herman's repositories

Cheat-Sheet-For-FFmpeg

Language:Python100

3DDFA_V2

The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.

MIT000

AAAI22_ONE-SHOT_TALKING_FACE_GEN

Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)

000

apollo

An open autonomous driving platform

Language:C++Apache-2.0000

AudioStyleNet

This repository contains the code for my master thesis on Emotion-Aware Facial Animation

000

BAT_video

Anonymous repo to reproduce the visualization results of BAT.

Language:Python000

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

MIT000

EVP

Code for paper 'Audio-Driven Emotional Video Portraits'.

Language:Jupyter Notebook000

FACIAL

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

Language:PythonAGPL-3.0000

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

NOASSERTION000

HDTF

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

GPL-3.0000

imaginaire

NVIDIA's Deep Imagination Team's PyTorch Library

NOASSERTION000

lightning-flash

Collection of tasks for fast prototyping, baselining, finetuning and solving problems with deep learning.

Apache-2.0000

LiveSpeechPortraits

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)

MIT000

MakeItTalk

NOASSERTION000

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language:C++Apache-2.0000

One-Shot_Free-View_Neural_Talking_Head_Synthesis

000

SAT

SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)

Language:PythonMIT000

SegLoss

A collection of loss functions for medical image segmentation

000

SNP

Official code for View Synthesis with Sculpted Neural Points

MIT000

style-based-gan-pytorch

Implementation A Style-Based Generator Architecture for Generative Adversarial Networks in PyTorch

NOASSERTION000

StyleSpace

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

000

StyleSpace-pytorch

Implementation of StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation in PyTorch

000

SyncNetCN

Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released

MIT000

SynergyNet

3DV 2021: Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry

Language:PythonMIT000

take-off-eyeglasses

Official pytorch implementation of paper "Portrait Eyeglasses and Shadow Removal by Leveraging 3D Synthetic Data" (CVPR 2022).

000

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

MIT000

vico_challenge_baseline

000

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

Language:Python000

Wav2LipHD

Language:Python000