fanlu

followers

following

stars

Organizations

wenet-e2e

fanlu's repositories

wenet

Transformer based ASR Engine.

Language:C++Apache-2.012 20

kaldi

This is now the official location of the Kaldi project.

Language:ShellNOASSERTION1 20

Audiomer-PyTorch

A Convolutional Transformer for Keyword Spotting

Language:Python000

automata_ml

An Introduction to Weighted Automata in Machine Learning

Language:Jupyter Notebook000

bloaty

Bloaty McBloatface: a size profiler for binaries

Language:C++Apache-2.0000

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonMIT010

DTLN-aec

Language:Python000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0020

Factorized-TDNN

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Language:PythonMIT010

FastASR

基于PaddleSpeech所使用的conformer模型，使用C++的高效实现模型推理，在树莓派4B等ARM平台运行也可流畅运行。

Apache-2.0000

genshin_auto_fish

基于深度强化学习的原神自动钓鱼AI

000

Genshin_login_tool

原神抢码科技

Language:Python000

grpc

The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

Language:C++Apache-2.0010

icefall

NOASSERTION000

k2

FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar

Language:CudaNOASSERTION000

KWS_Max-pooling_RHE

Mining effective negative training samples for keyword spotting (PyTorch)

Language:Python000

leaderboard

largest-ever Automatic Speech Recognition leaderboard, periodically benchmarks SOTA commercial ASR APIs from Alibaba, Baidu, Google, IFlytek, Microsoft and so on.

Language:Python010

lhotse

Language:PythonApache-2.0000

localatt_emorecog

A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'

Language:Python020

Mys_Goods_Tool

米游社商品兑换工具 | 短信验证登录 | 终端图形界面

Language:PythonMIT000

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookApache-2.0010

protobuf

Protocol Buffers - Google's data interchange format

Language:C++NOASSERTION010

PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Language:PythonBSD-3-Clause020

rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Language:PythonApache-2.0010

Realtime-Voice-Clone-Chinese

克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION000

snowfall

Language:PythonApache-2.0010

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0010

voxceleb_trainer

In defence of metric learning for speaker recognition

Language:PythonMIT000

wav2VAD

A voice activity detection system based on wav2vec 2.0

Apache-2.0000

wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Apache-2.0000