wzy's repositories

Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

add-noise

add noise of a certain SNR to audio files

Language:C++Stargazers:0Issues:1Issues:0

audio_cut

语音切割,python ,webrtc

Language:PythonStargazers:0Issues:0Issues:0

BigCiDian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

CMUdict

CMUdict maintenance, and tools

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

DaCiDian

DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)

Language:PythonStargazers:0Issues:1Issues:0

deepsegment

A sentence segmenter that actually works!

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0
License:LGPL-3.0Stargazers:0Issues:0Issues:0

docker-python3-opencv4-ffmpeg4

Docker image including Python 3.7+, OpenCV 4.1+, FFmpeg 4.0+, based on Ubuntu

Stargazers:0Issues:0Issues:0

espnet-tts

ESPNet TTS standalone execution

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

g2p

g2p: English Grapheme To Phoneme Conversion

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

g2p-seq2seq

G2P with Tensorflow

License:NOASSERTIONStargazers:0Issues:0Issues:0

ipa-dict

Monolingual wordlists with pronunciation information in IPA

License:MITStargazers:0Issues:0Issues:0

keyword_spotting

Chinese keyword spotting model using LSTM RNN

Language:PythonStargazers:0Issues:1Issues:0

masr

中文语音识别,提供预训练模型,高识别率 Chinese Speech Recognition; Mandarin Automatic Speech Recognition;

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

mica-gender-from-audio

Gender prediction in movie audio

Language:TypeScriptStargazers:0Issues:1Issues:0

MTTS

A Demo of Mandarin/Chinese TTS frontend

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

NoiseMix

Mixing Noise and Audio files

Language:PythonStargazers:0Issues:0Issues:0

punctuator2

A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text

License:MITStargazers:0Issues:0Issues:0

pycorrector

pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

rnnoise

Recurrent neural network for audio noise reduction

Language:CLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

SexRecognizer

University project, python script designed to recognize speaker's sex from a recording

Stargazers:0Issues:0Issues:0

speaker-recognition-py3

Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

speaker_recognition_GMM_UBM

A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schizophrenia.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

speech-denoising-wavenet

A neural network for end-to-end speech denoising

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Tacotron-2

Deepmind's Tacotron-2 Tensorflow implementation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

WebRTC-3A1V

AEC, AGC, ANS, VAD in WebRTC

Language:CStargazers:0Issues:1Issues:0

youtube-transcriber

Automatically transcribes YouTube videos

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0