AigizK / deepspeech-notes

DeepSpeechNotes is a note taking app using Mozilla's DeepSpeech technology to transcribe speech into text notes.

Home Page:https://deepspeech-notes.haupt.digital

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DeepSpeechNotes Screen Shot

Contributors Forks Stargazers Issues GitHub top language,

DeepSpeechNotes

Table of Contents

About the project

DeepSpeechNotes is a note taking app that uses Mozilla's DeepSpeech, Web Audio API and Node Voice Activity Detection to transcribe speech into text on the go.

It is my graduation project that was coded from scratch in 4 weeks. My main goal was to showcase current Open Source Text-To-Speech technology.

I wanted to learn something new in terms of technology, so I picked the topics Machine Learning and Speech-To-Text recognition and apply them practically. The result is DeepSpeechNotes, a note taking app that transcribes voice in near real-time.

Go to Demo

πŸš€ Built With

πŸ’… Front End

  • React
  • Web Audio API
  • @picovoice/web-voice-processor
  • Socket.io-client
  • @emotion/core and styled
  • storybook

🧱 Back End

  • Express
  • MongoDB
  • DeepSpeech
  • Node Voice Activity Detection
  • Socket.io

Get Started

To use DeepSpeechNotes, you must meet the following requirements:

  1. node.js
  2. npm
  3. MongoDB

After you moved the repository content to your webspace, you need to run these preconfigured scripts from the repository root directory:

  1. npm prod-prebuild
  2. npm prod-build
  3. Rename .env.example into .env and change content to desired port and connect to your MongoDB

The Express server will handle the following requests:

  1. https://your-url.com/storybook will route to the storybook build
  2. All other requests (including https://your-url.com) will route to the React application build (= "client/build")

You need a pretrained model for DeepSpeech to work. Please look at this readme to find out how to download the model.

Roadmap and contributing

Please have a look at open issues and maybe add your own πŸ’‘.

Contributions are greatly appreciated:

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Marc Haupt - Twitter: @Marc_Haupt - GitHub: hauptdigital - marc@haupt.digital

Twitter Follow GitHub followers

Project Link: https://github.com/hauptdigital/deepspeech-notes

Acknowledgements

About

DeepSpeechNotes is a note taking app using Mozilla's DeepSpeech technology to transcribe speech into text notes.

https://deepspeech-notes.haupt.digital

License:MIT License


Languages

Language:JavaScript 97.4%Language:HTML 2.6%