hjjjackie's starred repositories
Chinese-text-correction-papers
text correction papers
Distill-BERT-Textgen
Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".
GraphemeBERT
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
Leveraging-Self-Supervised-Learning-for-AVSR
Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL 2022)
google-10000-english
This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
mwo2kg-and-echidna
Source code for MWO2KG and Echidna: Constructing and Exploring Knowledge Graphs from Maintenance Data
context-agnostic-engagement
This repository contains the VLEngagement dataset and the helper functions/ tools required to work with the dataset.
noisy-student-training-asr
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
CIF-ColDec
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
google-research
Google Research
awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs
LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
las-pytorch
Listen, Attend and spell model for E2E ASR. Implementation in Pytorch