shengzhang0222's starred repositories
ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
contentvec
speech self-supervised representations
so-vits-svc
SoftVC VITS Singing Voice Conversion
voice-activity-detection
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
WeTextProcessing
Text Normalization & Inverse Text Normalization
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
One-Shot-Voice-Cloning
:relaxed: One Shot Voice Cloning base on Unet-TTS
ChineseTtsTflite
Android Chinese TTS Engine Base On Tensorflow TTS , use for TfLite Models Test。安卓离线中文TTS引擎,在TensorflowTTS基础上开发,用于TfLite模型测试。
chinese_text_normalization
Chinese text normalization for speech processing
TensorFlowTTS_chinese
chinese tts
Speech-Transformer-tf2.0
transformer for ASR-systerm (via tensorflow2.0)
score-ensembles-based-SVM
Combine many organs from a plant to predict their species
Speaker_Verification_Tencent
Deep Discriminative Embeddings for Duration Robust Speaker Verification
antispoofing-features
Code for the paper "Bag of features for voice anti-spoofing"
AM-MobileNet1D
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 architecture and the Additive Margin Softmax (AM-Softmax) loss function.)
speaker-recognition-papers
Share some recent speaker recognition papers and their implementations.
VoiceprintRecognition-Tensorflow
使用Tensorflow实现声纹识别