aky15

aky15

Geek Repo

Company:Tsinghua University

Location:Beijing

Github PK Tool:Github PK Tool

aky15's starred repositories

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8182Issues:178Issues:2337

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5257Issues:55Issues:982

Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Language:Jupyter NotebookLicense:MITStargazers:4033Issues:127Issues:28

wer_are_we

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:1847Issues:27Issues:67

fast-transformers

Pytorch library for fast transformer implementations

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonLicense:MITStargazers:1388Issues:44Issues:220
Language:PythonLicense:Apache-2.0Stargazers:852Issues:49Issues:617

ctcdecode

PyTorch CTC Decoder bindings

Language:C++License:MITStargazers:817Issues:22Issues:157

speech-recognition-papers

Towards hot directions in industrial end to end speech recognition

CAT

A CRF-based ASR Toolkit

Language:PythonLicense:Apache-2.0Stargazers:318Issues:21Issues:48

Wave-U-Net-for-Speech-Enhancement

Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.

Language:PythonLicense:MITStargazers:316Issues:11Issues:13

pychain

PyTorch implementation of LF-MMI for End-to-end ASR

Language:C++Stargazers:216Issues:28Issues:0

pykaldi2

Yet another speech toolkit based on Kaldi and PyTorch

Language:PythonLicense:MITStargazers:173Issues:13Issues:14

fast_rnnt

A torch implementation of a recursion which turns out to be useful for RNN-T.

Language:PythonLicense:NOASSERTIONStargazers:136Issues:9Issues:20

CTC-OptimizedLoss

Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.

ASR-Benchmarks

An effort to track benchmarking results over widely-used datasets for ASR.

ST-NAS

Efficient Neural Architecture Search via Straight-Through Gradients

Language:PythonLicense:Apache-2.0Stargazers:13Issues:3Issues:0