Lucky Wong's repositories

CTC-OptimizedLoss

Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.

Conformer-Athena

Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.

Language:PythonLicense:Apache-2.0Stargazers:43Issues:1Issues:1

speexdsp-ns-python

Python bindings of speexdsp noise suppression library

Language:C++License:Apache-2.0Stargazers:35Issues:4Issues:1

CE-OptimizedLoss

Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.

warp-ctc-crf

An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.

PLCPA-ASYM-Loss

The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss

Language:PythonLicense:Apache-2.0Stargazers:9Issues:1Issues:0

cat_tensorflow

Crf-based Asr Toolkit with TensorFlow implement

Language:PythonStargazers:8Issues:1Issues:0

AIF-PyTorch

(NOT Official) Implementation Auto-regressive Integrate-and-Fire (AIF)

Language:PythonStargazers:4Issues:2Issues:0

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

PercepNet

(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

Language:C++License:BSD-3-ClauseStargazers:1Issues:0Issues:0

SV-GMM

Speaker Verification using GMMs

Language:PythonStargazers:1Issues:1Issues:0

warp-rnnt

CUDA-Warp RNN-Transducer with TensorFlow and PyTorch binding.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

asr_frontend

PyTorch implementation of frontend, like PCEN (per-channel energy normalization) and Mel-Filterbank (mel-filterbank).

Language:PythonStargazers:0Issues:1Issues:0

athena

an open-source implementation of sequence-to-sequence based speech processing engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

conv-tasnet

A PyTorch implementation of "Improving noise robust automatic speech recognition with single-channel time-domain enhancement network"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NKF-AEC

Acoustic Echo Cancellation with Nerual Kalman Filtering

Language:HTMLStargazers:0Issues:0Issues:0

NSD-MS2S

CHIME-7 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

rir-configuration-generator

Generation of virtual rooms configurations.

Language:PythonStargazers:0Issues:1Issues:0

self_attention_alignment

Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SpectrumAugmenter

Performs data augmentation as according to the SpecAugment paper. Modified from Lingvo (TensorFlow > 1.10.0).

Language:PythonStargazers:0Issues:1Issues:2

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

torchdistance

Edit-distance PyTorch extension with Cpu and CUDA kernels

Language:PythonStargazers:0Issues:0Issues:0

unified2021

A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION

Language:MATLABStargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

you-get

:arrow_double_down: Dumb downloader that scrapes the web

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0