mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Geek Repo

Github PK Tool

deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow

Project DeepSpeech

DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier.

Documentation for installation, usage, and training models are available on deepspeech.readthedocs.io.

For the latest release, including pre-trained models and checkpoints, see the latest release on GitHub.

For contribution guidelines, see CONTRIBUTING.rst.

For contact and support information, see SUPPORT.rst.

About

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow

Mozilla Public License 2.0

Languages

Language:C++ 46.9%Language:Python 21.4%Language:C 11.2%Language:Shell 10.8%Language:C# 2.8%Language:Swift 1.8%Language:Java 1.3%Language:Makefile 0.9%Language:CMake 0.9%Language:TypeScript 0.6%Language:SWIG 0.5%Language:JavaScript 0.4%Language:Starlark 0.4%Language:Awk 0.1%Language:Ruby 0.0%Language:Objective-C 0.0%