iwaterxt's repositories

voiceprint

text-independent speaker identification

Language:C++Stargazers:12Issues:0Issues:0

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling

License:MITStargazers:0Issues:0Issues:0
Language:CudaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CAT

A CRF-based ASR Toolkit

Language:ShellStargazers:0Issues:0Issues:0

cmake-demo

《CMake入门实战》源码

Language:CMakeStargazers:0Issues:0Issues:0

compound-loss-pytorch

Compound loss for PyTorch

License:Apache-2.0Stargazers:0Issues:0Issues:0

DeepSpeech

A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

E2E-ASR

PyTorch Implementations for End-to-End Automatic Speech Recognition

Language:PythonStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gdrive.sh

Download a file or a folder easily. curl gdrive.sh | bash -s $fileid

License:MITStargazers:0Issues:0Issues:0

iwaterxt.github.io

Template for a blog hosted on GitHub Pages

License:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi

This is now the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

kaldi-dnn-ali-gop

Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.

License:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi-gop

Computes the Goodness of Pronunciation (GOP). Bases on Kaldi.

License:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

kaldi-io-for-python

Python functions for reading kaldi data formats. Useful for rapid prototyping with python.

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

neural_sp

End-to-end ASR/LM implementation with pytorch.

Language:PythonStargazers:0Issues:0Issues:0

nn-vad

simple dnn based vad

Stargazers:0Issues:0Issues:0

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

License:MITStargazers:0Issues:0Issues:0

OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

License:MITStargazers:0Issues:0Issues:0

python_kaldi_features

python codes to extract MFCC and FBANK speech features for Kaldi

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Socket-Programming-Python

Client Server running code described with comments here.

Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

sparse_image_warp_pytorch

Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

xdecoder

Fast, portable, enhanced ASR decoder

Language:C++Stargazers:0Issues:0Issues:0