There are 13 repositories under deepspeech topic.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Examples of how to use or integrate DeepSpeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
speech to text benchmark framework
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
A testing server for a speech to text service based on coqui.ai
Golang bindings for Mozilla's DeepSpeech speech-to-text library
ASR with PyTorch
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech
A MXNet implementation of Baidu's DeepSpeech architecture
Install Mozilla DeepSpeech on a Raspberry Pi 4
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
Blender add-on to implement VOCA neural network.
Automatic Speech Recognition in Unity using Vosk library
Raspberry Pi impersonates Nintendo Switch controller
A PyTorch implementation of DeepSpeech and DeepSpeech2.
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
Mozilla DeepSpeech in flutter using Dart FFI
Training scripts for Speech-To-Text models for Ukrainian language
A crash course for training speech recognition models using DeepSpeech.
This extension helps to get a real-time transcription of audio playing in the browser using Deep Speech.
Lua Library for Speech Recognition
DeepSpeechNotes is a note taking app using Mozilla's DeepSpeech technology to transcribe speech into text notes.