CAprogs / Automatic_Speech_Recognition-Recorder

๐Ÿ—ฃ๏ธ Record and translate audio into text using 'Google Speech Recognition' (Speech-To-Text). 3 languages supported (French, English, Arabic [Morocco]).

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

APP_logo

GitHub Repo stars GitHub GitHub forks GitHub watchers


Software for Audio Transcription - Speech To Text

Translations ( README )

Technologies ๐Ÿ“ฒ

See the requirements file for all the packages used in this project.

Features โ˜„๏ธ

  • Import audio files
  • Record audio ( in .wav )
  • Translate audio files
  • Play / Replay audio files

Get Started ๐Ÿงžโ€โ™‚๏ธ

Download the latest release and launch the app

  1. Choose a langage
  2. Choose either to Record or Import your File

๐ŸŽ‰ You can see the translation on your screen after a few seconds !

  1. ( Optional ) You can play or replay the audio imported or recorded within the app

App View ๐Ÿ‘๏ธ

Capture dโ€™eฬcran 2023-07-28 aฬ€ 16 07 49

Releases ๐Ÿ“ค

Name Supported OS version
ASR_REC_Windows Windows v1.0.0
ASR_REC_Mac-arm64 Mac v1.0.1

Roadmap ๐Ÿšง

  • Add models to the software
  • Add langages
  • Create the config.json file ( Choose the model to use ; Control the deletion of the files )
  • Create Docu ( doc file + Demo + Gif + App view )
  • Update ReadME
  • Deployment ( Docker , Update .EXE releases )

Limitations & Supports โšก๏ธ

  • Supports 3 default languages: French, English, and Arabic (Morocco).
  • Requires an internet access.
  • Uses the "Google Speech Recognition" neural network.
  • The Import File button allows you to transcribe audio files < ~1 minute.
  • The Record button allows you to record audio from your PC's microphone. The transcription is then generated in the TEXT area.
  • 'Recording' starts IMMEDIATELY when you press the Record button.
  • The recording duration is limited to 10 seconds by default.
  • The "Listen" button allows you to listen to your recording, whether it is imported or recorded within the application.
Audio type Supported
.wav โœ…
.mp3 โœ…

Author โœ๏ธ

Contributions ๐Ÿ“

Contributions are always welcome !

See contributing.md for ways to get started.

License ๐Ÿ“

This project is under the GNU LICENSE

Donation & Support โค๏ธ

If you like this project feel free to let a ๐ŸŒŸ !

About

๐Ÿ—ฃ๏ธ Record and translate audio into text using 'Google Speech Recognition' (Speech-To-Text). 3 languages supported (French, English, Arabic [Morocco]).

License:GNU General Public License v3.0


Languages

Language:Python 100.0%