aky15's starred repositories
Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
SenseVoice
Multilingual Voice Understanding Model
fast-transformers
Pytorch library for fast transformer implementations
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
speech-recognition-papers
Towards hot directions in industrial end to end speech recognition
Wave-U-Net-for-Speech-Enhancement
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
ASR-Benchmarks
An effort to track benchmarking results over widely-used datasets for ASR.