powei-C's repositories
Adversarial-Many-to-Many-VC
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna
albert-chinese-ner
使用预训练语言模型ALBERT做中文NER
albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0
This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-parallel training data".
Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
espnet
End-to-End Speech Processing Toolkit
fast-transformers
Pytorch library for fast transformer implementations
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
MaskCycleGAN-VC
Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.
mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
MQTTS
RVQ-based TTS
nnsvs
Neural network-based singing voice synthesis library for research
ParallelWaveGAN-VC
Unofficial Parallel WaveGAN VC with Pytorch
randomCNN-voice-transfer
Audio style transfer with shallow random parameters CNN. Result: https://soundcloud.com/mazzzystar/sets/speech-conversion-sample
roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
Singing-Voice-Vocoder
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT
This is the implementation of the paper "Converting anyone's emotion: towards speaker-independent emotional voice conversion".
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Swin-Transformer-Object-Detection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
WGANSing
Multi-voice singing voice synthesis