wzy's repositories
Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
audio_cut
语音切割,python ,webrtc
CMUdict
CMUdict maintenance, and tools
deepsegment
A sentence segmenter that actually works!
docker-python3-opencv4-ffmpeg4
Docker image including Python 3.7+, OpenCV 4.1+, FFmpeg 4.0+, based on Ubuntu
espnet-tts
ESPNet TTS standalone execution
g2p-seq2seq
G2P with Tensorflow
ipa-dict
Monolingual wordlists with pronunciation information in IPA
keyword_spotting
Chinese keyword spotting model using LSTM RNN
mica-gender-from-audio
Gender prediction in movie audio
NoiseMix
Mixing Noise and Audio files
punctuator2
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
SexRecognizer
University project, python script designed to recognize speaker's sex from a recording
speaker-recognition-py3
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
speaker_recognition_GMM_UBM
A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schizophrenia.
SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
speech-denoising-wavenet
A neural network for end-to-end speech denoising
Tacotron-2
Deepmind's Tacotron-2 Tensorflow implementation
WebRTC-3A1V
AEC, AGC, ANS, VAD in WebRTC
youtube-transcriber
Automatically transcribes YouTube videos