double22a's repositories
speech_dataset
The dataset of Speech Recognition
asr_nlp_paper_code
Papers of ASR, Tools of ASR
chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。 🤪 😜 阿里招p6/p7 Python Golang | gaojunqi@outlook.com | 上海张江
Awesome-Knowledge-Distillation
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2020)。
awesome-knowledge-distillation-1
Awesome Knowledge Distillation
awesome-speech-recognition-speech-synthesis-papers
Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
CAT
A CRF-based ASR Toolkit
code-switching-papers
A curated list of research papers and resources on code-switching
Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
e2e_lfmmi
This is the implementation of paper CONSISTENT TRAINING AND DECODING FOR END-TO-END SPEECH RECOGNITIONUSING LATTICE-FREE MMI submitted to ICASSP2022
espnet
End-to-End Speech Processing Toolkit
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
GigaSpeech
Large, modern dataset for speech recognition
kaldifeat
Kaldi-compatible feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd
kaldiio
A pure python module for reading and writing kaldi ark files
mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
open-speech-corpora
A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
sam
SAM: Sharpness-Aware Minimization (PyTorch)
speech-recognition-papers
Towards hot directions in industrial end to end speech recognition
SpeechAlgorithms
Speech Algorithms Collections
speechbrain
A PyTorch-based Speech Toolkit
TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.