pavankalyan066 / streamlit_whisper_transcription

Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time audio transcription using OPENAI's Whisper Ai.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Whisper Audio Transcription Streamlit App

This Streamlit application demonstrates the capabilities of OpenAI's Whisper ASR (Automatic Speech Recognition) system. Users can record or upload audio files in various formats and receive transcriptions generated by Whisper in real-time. Transcriptions can be saved as text files and downloaded for further use.

Features

  • Record audio directly from the app's interface
  • Upload audio files in .mp3, .mp4, .wav, or .m4a format
  • Display transcriptions in real-time
  • Save and download transcriptions as text files

Installation

Follow these steps to install and run the application:

  1. Clone the repository

     git clone https://github.com/your-repo-url/streamlit-whisper-transcription.git
     cd streamlit-whisper-transcription
    
  2. Create a virtual environment and install dependencies

     python -m venv venv
     source venv/bin/activate # For Windows, use 'venv\Scripts\activate'
     pip install -r requirements.txt
    
  3. Create a .env file in the project's root directory and add your OpenAI API key:

      OPENAI_API_KEY=sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
    

Usage

Local Development

Run the Streamlit app with the following command:

streamlit run app.py

Open your web browser and navigate to http://localhost:8501 to access the application.

Docker Deployment

Build the Docker image:

docker build -t streamlit-whisper-transcription .

Run the Docker image:

docker run -p 8501:8501 streamlit-whisper-transcription

Open your web browser and navigate to http://localhost:8501 to access the application.

Limitations and Disclaimer

This app is a demonstration of the potential of OpenAI's Whisper ASR system for audio transcription. The accuracy of the transcriptions depends on various factors such as the quality of the audio file, the language spoken, and background noise. The app is not intended for use in production environments and should be used for demonstration purposes only.

Contributing

If you'd like to contribute to this project, please feel free to submit a pull request or create an issue for any bugs or feature requests. Your contributions are welcome and appreciated!

About

Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time audio transcription using OPENAI's Whisper Ai.

License:MIT License


Languages

Language:Python 90.0%Language:Dockerfile 10.0%