natelalor / AI_report_generator

A tool that converts long audio files into a thorough, summarized report. Leverages OpenAI and its API (ChatGPT backend), Langchain for text processing, and Pinecone for vector database facilitation.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Version 2 is here, ladies and gents!

And mere hours after version 1 was released, here's version 2: shallow_langchain_report_with_audio !!

Version 1 file: shallow_langchain_report.py (Kept on repo for usability cases)

Version 2 file: shallow_langchain_report_with_audio.py (Newest and bestest!)

shallow_langchain_report_with_audio combines the first version with implementations of code to support audio files. Thus, you can now run version 2 to submit an audio file (m4a, etc), and it will create a .txt file entitled, "audio_generated_text.txt" in the same directory. From there, it will read that .txt file and produce a summarized report on the content! Is this not the coolest thing ever?

It is still named "shallow", because the backend is not yet implemented -- which results in your output often being cut (the context window is too big for the AI output). This will be solved in Version 2, with a Pinecone backend to help provide longer sessions. Stay tuned for Version 3!

Usage Instructions

Download the .py files by cloning the repo, or however you may desire.

Then, find the llm_initialization function (the function call is the first line in main), and paste your openai api key in the text field that says, "sk-...."

  • To use version 1 (just text-to-report), run the '''shallow_langchain_report.py''' and run it somewhere with a console window (IDE).
  • To use version 2 (audio-to-text-to-report), run the '''shallow_langchain_report_with_audio.py''' and run it somewhere with a console window (IDE).

The only other usage requirement is providing it an audio file as user input. We have tested it on .m4a files, which are files from "Voice Memos" application on iPhones. Further filetype testing and support to come later on.


This tool is a part of Blueprint's AI tool development initiative. Reach out to them: LinkedIn

Project Timeline: June 15th, 2023 - July 27th, 2023

About

A tool that converts long audio files into a thorough, summarized report. Leverages OpenAI and its API (ChatGPT backend), Langchain for text processing, and Pinecone for vector database facilitation.


Languages

Language:Python 100.0%