There are 19 repositories under deepspeech topic.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Examples of how to use or integrate DeepSpeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
speech to text benchmark framework
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
A testing server for a speech to text service based on coqui.ai
Golang bindings for Mozilla's DeepSpeech speech-to-text library
ASR with PyTorch
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Install Mozilla DeepSpeech on a Raspberry Pi 4
Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech
An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
A MXNet implementation of Baidu's DeepSpeech architecture
Automatic Speech Recognition in Unity using Vosk library
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
Raspberry Pi impersonates Nintendo Switch controller
Blender add-on to implement VOCA neural network.
A PyTorch implementation of DeepSpeech and DeepSpeech2.
📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
🌊 A crazy simple library for reading/writing WAV files in V. Zero dependencies, 100% cross-platform.
Mozilla DeepSpeech in flutter using Dart FFI
Training scripts for Speech-To-Text models for Ukrainian language
A crash course for training speech recognition models using DeepSpeech.
Lua Library for Speech Recognition
This extension helps to get a real-time transcription of audio playing in the browser using Deep Speech.
DeepSpeechNotes is a note taking app using Mozilla's DeepSpeech technology to transcribe speech into text notes.