fanlu's repositories
Audiomer-PyTorch
A Convolutional Transformer for Keyword Spotting
automata_ml
An Introduction to Weighted Automata in Machine Learning
bloaty
Bloaty McBloatface: a size profiler for binaries
chinese_text_normalization
Chinese text normalization for speech processing
Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
FastASR
基于PaddleSpeech所使用的conformer模型,使用C++的高效实现模型推理,在树莓派4B等ARM平台运行也可流畅运行。
genshin_auto_fish
基于深度强化学习的原神自动钓鱼AI
Genshin_login_tool
原神抢码科技
k2
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
leaderboard
largest-ever Automatic Speech Recognition leaderboard, periodically benchmarks SOTA commercial ASR APIs from Alibaba, Baidu, Google, IFlytek, Microsoft and so on.
localatt_emorecog
A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'
Mys_Goods_Tool
米游社商品兑换工具 | 短信验证登录 | 终端图形界面
PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Realtime-Voice-Clone-Chinese
克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
speechbrain
A PyTorch-based Speech Toolkit
voxceleb_trainer
In defence of metric learning for speaker recognition
wav2VAD
A voice activity detection system based on wav2vec 2.0
wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit