lbqin's repositories
speech-vad-demo
集成Webrtc的VAD,用于切分音频文件
SpeechSynthesis
语音合成综述
aichallenge
xunfei dialect baseline
deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System https://arxiv.org/pdf/1705.02304.pdf
GCommandsPytorch
ConvNets for Audio Recognition using Google Commands Dataset
kaldi-enhan
Tools for speech enhancement based on kaldi
ML-KWS-for-MCU
Keyword spotting on Arm Cortex-M Microcontrollers
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
parallel_wavenet_vocoder
Parallel WaveNet Vocoder Based on ClariNet
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
Sinsy-Remix
The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"
tacotron-1
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model
web-speech-api
A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.