fanlu's repositories

wenet

Transformer based ASR Engine.

Language:C++License:Apache-2.0Stargazers:12Issues:2Issues:0

kaldi

This is now the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:1Issues:2Issues:0

Audiomer-PyTorch

A Convolutional Transformer for Keyword Spotting

Language:PythonStargazers:0Issues:0Issues:0

automata_ml

An Introduction to Weighted Automata in Machine Learning

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

bloaty

Bloaty McBloatface: a size profiler for binaries

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

Factorized-TDNN

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

FastASR

基于PaddleSpeech所使用的conformer模型,使用C++的高效实现模型推理,在树莓派4B等ARM平台运行也可流畅运行。

License:Apache-2.0Stargazers:0Issues:0Issues:0

genshin_auto_fish

基于深度强化学习的原神自动钓鱼AI

Stargazers:0Issues:0Issues:0

Genshin_login_tool

原神抢码科技

Language:PythonStargazers:0Issues:0Issues:0

grpc

The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

k2

FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar

Language:CudaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

KWS_Max-pooling_RHE

Mining effective negative training samples for keyword spotting (PyTorch)

Language:PythonStargazers:0Issues:0Issues:0

leaderboard

largest-ever Automatic Speech Recognition leaderboard, periodically benchmarks SOTA commercial ASR APIs from Alibaba, Baidu, Google, IFlytek, Microsoft and so on.

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

localatt_emorecog

A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'

Language:PythonStargazers:0Issues:2Issues:0

Mys_Goods_Tool

米游社商品兑换工具 | 短信验证登录 | 终端图形界面

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

protobuf

Protocol Buffers - Google's data interchange format

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Realtime-Voice-Clone-Chinese

克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

voxceleb_trainer

In defence of metric learning for speaker recognition

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

wav2VAD

A voice activity detection system based on wav2vec 2.0

License:Apache-2.0Stargazers:0Issues:0Issues:0

wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0