v-yunbin

followers

following

stars

wzy's repositories

Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Language:PythonApache-2.01 10

add-noise

add noise of a certain SNR to audio files

Language:C++010

audio_cut

语音切割，python ，webrtc

Language:Python000

BigCiDian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

Language:Python010

ChineseSegmentation

中文分词

Language:Python010

CMUdict

CMUdict maintenance, and tools

BSD-3-Clause000

DaCiDian

DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)

Language:Python010

deepsegment

A sentence segmenter that actually works!

Language:PythonGPL-3.0010

docker-py-kaldi-asr

LGPL-3.0000

docker-python3-opencv4-ffmpeg4

Docker image including Python 3.7+, OpenCV 4.1+, FFmpeg 4.0+, based on Ubuntu

000

espnet-tts

ESPNet TTS standalone execution

Language:PythonMIT010

g2p

g2p: English Grapheme To Phoneme Conversion

Language:PythonApache-2.0010

g2p-seq2seq

G2P with Tensorflow

NOASSERTION000

ipa-dict

Monolingual wordlists with pronunciation information in IPA

MIT000

keyword_spotting

Chinese keyword spotting model using LSTM RNN

Language:Python010

masr

中文语音识别，提供预训练模型，高识别率 Chinese Speech Recognition; Mandarin Automatic Speech Recognition;

Language:PythonNOASSERTION010

mica-gender-from-audio

Gender prediction in movie audio

Language:TypeScript010

MTTS

A Demo of Mandarin/Chinese TTS frontend

Language:PythonMIT010

NoiseMix

Mixing Noise and Audio files

Language:Python000

punctuator2

A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text

MIT000

pycorrector

pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.

Apache-2.0000

rnnoise

Recurrent neural network for audio noise reduction

Language:CBSD-3-Clause010

SexRecognizer

University project, python script designed to recognize speaker's sex from a recording

000

speaker-recognition-py3

Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

Language:PythonApache-2.0010

speaker_recognition_GMM_UBM

A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schizophrenia.

Language:Jupyter Notebook000

SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Language:PythonApache-2.0010

speech-denoising-wavenet

A neural network for end-to-end speech denoising

Language:PythonMIT010

Tacotron-2

Deepmind's Tacotron-2 Tensorflow implementation

Language:PythonMIT000

WebRTC-3A1V

AEC, AGC, ANS, VAD in WebRTC

Language:C010

youtube-transcriber

Automatically transcribes YouTube videos

Language:Jupyter NotebookMIT010