audio-processing openai recording summarization gpt gpt-4 whisper gpt-3 llm

Speech Digest

This Streamlit app allows users to upload an audio file, transcribe the audio using OpenAI's Whisper ASR, and then summarize the transcription using OpenAI's GPT-3.5-turbo language model.

Features

Upload audio files in supported formats (m4a, mp3, webm, mp4, mpga, wav, and mpeg)
Transcribe audio using OpenAI's Whisper ASR
Summarize transcribed audio using OpenAI's GPT-3.5-turbo
Provide custom API key and select the desired model

Installation

Prerequisites

Python 3.6 or higher
Streamlit
OpenAI Python library (v0.27.0 or higher)

Steps

Clone the repository:

git clone https://github.com/StanGirard/speechdigest

Change into the repository's directory:

cd speechdigest

Install the required packages:

pip install -r requirements.txt

Set your OpenAI API key as an environment variable:

export OPENAI_API_KEY="your-api-key"

Run the Streamlit app:

streamlit run app.py

The app will now be accessible at http://localhost:8501 in your browser.

Usage

Open the app in your browser.
Enter your OpenAI API key and select the desired model (GPT-4 or GPT-3.5-turbo) using the input fields.
Upload an audio file in a supported format using the file uploader.
The app will transcribe the audio using Whisper ASR and display the transcription.
The app will then summarize the transcription using the selected language model and display the summary.

Contributing

Feel free to submit issues, feature requests, or pull requests. We appreciate any contribution to improve the app.

About

Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit

https://speechdigest.streamlit.app

audio-processing openai recording summarization gpt gpt-4 whisper gpt-3 llm

Languages

Language:Python 100.0%