Queen_Wcy's repositories

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

audio-SNR

Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)

Language:PythonStargazers:0Issues:0Issues:0

auorange

Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AutoSpeech

[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

License:MITStargazers:0Issues:0Issues:0

BVAE-TTS

Official implementation of BVAE-TTS

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

chatbot-list

行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍

Stargazers:0Issues:0Issues:0

clean-text

🧹 Python package for text cleaning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

deep-learning-model-convertor

The convertor/conversion of deep learning models for different deep learning frameworks/softwares.

Stargazers:0Issues:0Issues:0

free-programming-books

:books: Freely available programming books

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

interesting-python

有趣的Python爬虫和Python数据分析小项目(Some interesting Python crawlers and data analysis projects)

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mfa-models

Collection of pretrained models for the Montreal Forced Aligner

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

PSST

Prosodic Speech Segmentation with Transformers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

SpeechAlgorithms

Speech Algorithms

Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

survey

A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf

Stargazers:0Issues:0Issues:0

TTS_TFLite

This repository is a collection of TTS Models in TFLite

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

VAENAR-TTS

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

visqol

Perceptual Quality Estimator for speech and audio

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

voice-filter

A unofficial Pytorch implementation of Google's VoiceFilter

Language:PythonStargazers:0Issues:0Issues:0

WavJourney

WavJourney: Compositional Audio Creation with LLMs

Stargazers:0Issues:0Issues:0

zhtts

A demo of zh/Chinese Text to Speech system run on CPU in real time.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0