Jing Li's repositories
Audio2Gestures
Audio2Motion Official implementation for Audio2Motion: Generating Diverse Gestures from Speech with Conditional Variational Autoencoders.
DanceRevolution
Code for paper Dance Revolution: Long-Term Dance Generation with Music via Curriculum Learning
BlenderToolbox
Some simple Blender scripts for rendering paper figures
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
dlib
A toolkit for making real world machine learning and data analysis applications in C++
PerceptualSimilarity
LPIPS metric. pip install lpips
PyMO
A library for machine learning research on motion capture data
smplx
SMPL-X