There are 41 repositories under speech topic.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Code examples for new APIs of iOS 10.
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Lingvo
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
WaveNet vocoder
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Free, easy, portable audio engine for games
🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Videos, notes and experiments to understand deep learning
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Community list of startups working with AI in audio and music technology
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection
A neural network for end-to-end speech denoising
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
End-to-end ASR/LM implementation with PyTorch