z.q.mao's repositories
contentvec
speech self-supervised representations
DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
DJtransGAN
"Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks", ICASSP 2022
DocProduct
Medical Q&A with Deep Language Models
FaceFormer
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
hardware_introduction
What scienfitic programmers must know about CPUs and RAM to write fast code.
LiveSpeechPortraits
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)
Neural-Style-Transfer-Audio
This is PyTorch Implementation Of Naural Style Transfer Algorithm which is modified for Audios.
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
prosody
Helsinki Prosody Corpus and System for Predicting Prosodic Prominence from Text
Real_Time_Image_Animation
The Project is real time application in opencv using first order model
state-spaces
Sequence Modeling with Structured State Spaces
The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"