mirawara / transcriber

Transcription of any large audio/video files.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Transcriber

project-image

Transcriber is a python program that allows you to transcribe any large audio and video into any language using the SpeechRecognition library and the Google API. It splits the file into chunks during silent moments because the library doesn't accept files that exceed a certain size.

🧐 Features

Here're some of the project's best features:

  • Noise reduction
  • Volume increase

🛠️ Installation Steps:

Install the requirements:

pip -r requirements.txt

🖥️ Usage:

Manual:

transcriber.py [-h] -f FILE [-nr NOISE] -o OUT [-iv IV] [-l LANG]

options:
  -h, --help            show this help message and exit
  -f FILE, --file FILE  Path to audio file
  -nr NOISE, --noise-reduction NOISE
                        Noise reduction: there are two levels: level 1 - Basic noise reduction (recommended) level 2 - Massive noise reduction
  -o OUT, --output OUT  Path to output file
  -iv IV, --increase-volume IV
                        Increase volume: you have to provide a float from 0 to 3 in the form int.dec
  -l LANG, --language LANG
                        Language (Default: en-EN)

Example:

python3 transcriber.py -f audio_example/Subconscious_Learning.mp3  -o result.txt -nr 1

😉 Tips:

  1. Use ChatGPT to write the transcript better, punctuation included. It can also be useful for summaries and maps.
  2. If the transcription fails even after trying the various levels of noise reduction and volume increase, try changing the parameters of the 'split_on_silence' function to better suit your requirements.

💖Like my work?

Contact me if you have any corrections or additional features to offer me.

About

Transcription of any large audio/video files.

License:MIT License


Languages

Language:Python 100.0%