Meng Wei's repositories
WeTextProcessing
Text Normalization & Inverse Text Normalization
dotfiles
:rainbow: @weimeng23 does dotfiles and preferences
speech-recognition-learning-resources
:white_check_mark: A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.
openfst
:clipboard: Unofficial mirror of OpenFst Library
Poetry
非常全的古诗词数据,收录了从先秦到现代的共计85万余首古诗词。
minbpe
Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。 阿里招 Python P6/P7 上海张江, gaojunqi@outlook.com
audio-speech-datasets
:scroll: A list of various Audio/Speech datasets about Speech Recognition, Speech Synthesis, Noise, Audio Tagging/Sound Event Detection, Speaker Diarization, Speaker Recognition, (Inverse) Text normalization, Speech Translation, Multilingual, etc. (continuously update)
Beanfun
繽放 - 樂豆第三方客戶端
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
clash-verge
A Clash GUI based on tauri. Supports Windows, macOS and Linux.
mit-deep-learning-book-pdf
MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville
cmudict
CMU US English Dictionary
free-programming-books
:books: Freely available programming books
kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
leetcode
😏 LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
noisy-student-training-asr
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
python-audio
Some Jupyter notebooks about audio signal processing with Python
nvim
The Ultimate NeoVim Config for Colemak Users
AEC-Challenge
AEC Challenge
wenet-backup
Personal backup for wenet
wenet-old-branch
Production First and Production Ready End-to-End Speech Recognition Toolkit
ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
reinforcement-learning-an-introduction
Python implementation of Reinforcement Learning: An Introduction
fq
:earth_americas: :statue_of_liberty: 翻墙软件不完全汇总