double22a's repositories

speech_dataset

The dataset of Speech Recognition

asr_nlp_paper_code

Papers of ASR, Tools of ASR

chinese-poetry

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。 🤪 😜 阿里招p6/p7 Python Golang | gaojunqi@outlook.com | 上海张江

Language:JavaScriptLicense:MITStargazers:1Issues:0Issues:0

Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2020)。

Stargazers:0Issues:0Issues:0

awesome-knowledge-distillation-1

Awesome Knowledge Distillation

Stargazers:0Issues:0Issues:0

awesome-speech-recognition-speech-synthesis-papers

Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling

License:MITStargazers:0Issues:0Issues:0

CAT

A CRF-based ASR Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

code-switching-papers

A curated list of research papers and resources on code-switching

License:Apache-2.0Stargazers:0Issues:0Issues:0

Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

Stargazers:0Issues:0Issues:0

double22a

Config files for my GitHub profile.

Stargazers:0Issues:1Issues:0

e2e_lfmmi

This is the implementation of paper CONSISTENT TRAINING AND DECODING FOR END-TO-END SPEECH RECOGNITIONUSING LATTICE-FREE MMI submitted to ICASSP2022

Stargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

GigaSpeech

Large, modern dataset for speech recognition

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kaldifeat

Kaldi-compatible feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd

License:NOASSERTIONStargazers:0Issues:0Issues:0

kaldiio

A pure python module for reading and writing kaldi ark files

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

License:Apache-2.0Stargazers:0Issues:0Issues:0

open-speech-corpora

A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

sam

SAM: Sharpness-Aware Minimization (PyTorch)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

speech-recognition-papers

Towards hot directions in industrial end to end speech recognition

License:MITStargazers:0Issues:0Issues:0

SpeechAlgorithms

Speech Algorithms Collections

License:Apache-2.0Stargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

License:MITStargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

wer_are_we

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

Stargazers:0Issues:0Issues:0