Youqiang Zheng's repositories
Exercise_pytorchvideo
Running a pre-trained PyTorchVideo classification model using Torch Hub
codec2
Open source speech codec designed for communications quality speech between 450 and 3200 bit/s. The main application is low bandwidth HF/VHF digital radio.
descript-audio-vae
VAE GAN modified from Descript Audio Codec, which replaces the RVQ with VAE
FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
g2p
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
LPCNet-1
Experimental Neural Net speech coding for FreeDV
Mel-to-LPC
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
OpenACELP
Free ACELP vocoder
python-pinyin
汉字转拼音(pypinyin)
rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
Video-to-Retail-Platform
An intelligent multimodal-learning based system for video, product and ads analysis. Based on the system, people can build a lot of downstream applications such as product recommendation, video retrieval, etc.
WaveRNN
WaveRNN Vocoder + TTS
WHU-Thesis-LaTeX
武汉大学毕业论文模板。