iwaterxt

followers

following

stars

iwaterxt's repositories

voiceprint

text-independent speaker identification

Language:C++1200

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling

MIT000

baidu-allreduce

Language:CudaApache-2.0000

CAT

A CRF-based ASR Toolkit

Language:Shell000

cmake-demo

《CMake入门实战》源码

Language:CMake000

compound-loss-pytorch

Compound loss for PyTorch

Apache-2.0000

DeepSpeech

A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.

Language:PythonApache-2.0000

E2E-ASR

PyTorch Implementations for End-to-End Automatic Speech Recognition

Language:Python000

espnet

End-to-End Speech Processing Toolkit

Language:ShellApache-2.0000

gdrive.sh

Download a file or a folder easily. curl gdrive.sh | bash -s $fileid

MIT000

iwaterxt.github.io

Template for a blog hosted on GitHub Pages

NOASSERTION000

kaldi

This is now the official location of the Kaldi project.

Language:ShellNOASSERTION000

kaldi-aslp

000

kaldi-dnn-ali-gop

Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.

NOASSERTION000

kaldi-gop

Computes the Goodness of Pronunciation (GOP). Bases on Kaldi.

NOASSERTION000

kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Language:PythonBSD-2-Clause000

kaldi-io-for-python

Python functions for reading kaldi data formats. Useful for rapid prototyping with python.

Language:Python000

Multi-band-WaveRNN

000

neural_sp

End-to-end ASR/LM implementation with pytorch.

Language:Python000

nn-vad

simple dnn based vad

000

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

MIT000

OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Apache-2.0000

polysody

000

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

MIT000

python_kaldi_features

python codes to extract MFCC and FBANK speech features for Kaldi

Language:PythonMIT000

Socket-Programming-Python

Client Server running code described with comments here.

000

sparrowhawk

Apache-2.0000

sparse_image_warp_pytorch

Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779

Language:PythonMIT000

wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Language:PythonApache-2.0000

xdecoder

Fast, portable, enhanced ASR decoder

Language:C++000