StanGirard / speechdigest

Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit

Home Page:https://speechdigest.streamlit.app

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Speech Digest

This Streamlit app allows users to upload an audio file, transcribe the audio using OpenAI's Whisper ASR, and then summarize the transcription using OpenAI's GPT-3.5-turbo language model.

Features

  • Upload audio files in supported formats (m4a, mp3, webm, mp4, mpga, wav, and mpeg)
  • Transcribe audio using OpenAI's Whisper ASR
  • Summarize transcribed audio using OpenAI's GPT-3.5-turbo
  • Provide custom API key and select the desired model

Installation

Prerequisites

  • Python 3.6 or higher
  • Streamlit
  • OpenAI Python library (v0.27.0 or higher)

Steps

  1. Clone the repository:
git clone https://github.com/StanGirard/speechdigest
  1. Change into the repository's directory:
cd speechdigest
  1. Install the required packages:
pip install -r requirements.txt
  1. Set your OpenAI API key as an environment variable:
export OPENAI_API_KEY="your-api-key"
  1. Run the Streamlit app:
streamlit run app.py

The app will now be accessible at http://localhost:8501 in your browser.

Usage

  1. Open the app in your browser.
  2. Enter your OpenAI API key and select the desired model (GPT-4 or GPT-3.5-turbo) using the input fields.
  3. Upload an audio file in a supported format using the file uploader.
  4. The app will transcribe the audio using Whisper ASR and display the transcription.
  5. The app will then summarize the transcription using the selected language model and display the summary.

Contributing

Feel free to submit issues, feature requests, or pull requests. We appreciate any contribution to improve the app.

About

Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit

https://speechdigest.streamlit.app


Languages

Language:Python 100.0%