Queen_Wcy's repositories
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
BVAE-TTS
Official implementation of BVAE-TTS
chatbot-list
行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍
clean-text
🧹 Python package for text cleaning
deep-learning-model-convertor
The convertor/conversion of deep learning models for different deep learning frameworks/softwares.
free-programming-books
:books: Freely available programming books
FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
interesting-python
有趣的Python爬虫和Python数据分析小项目(Some interesting Python crawlers and data analysis projects)
INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
knn-vc
Voice Conversion With Just Nearest Neighbors
LVCNet
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
mfa-models
Collection of pretrained models for the Montreal Forced Aligner
PSST
Prosodic Speech Segmentation with Transformers
pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
SpeechAlgorithms
Speech Algorithms
survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
TTS_TFLite
This repository is a collection of TTS Models in TFLite
VAENAR-TTS
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
visqol
Perceptual Quality Estimator for speech and audio
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
WavJourney
WavJourney: Compositional Audio Creation with LLMs
zhtts
A demo of zh/Chinese Text to Speech system run on CPU in real time.