Transcript Generation Script

This Python script is designed to generate realistic and engaging transcripts for simultaneous interpreters, simulating the flow of conversation and interaction between participants in various sessions of an upcoming event.

Prerequisites

The following files should be available in the same directory as the script:

gemini_service_account.json: Google service account credentials file.
agenda.md: A Markdown file containing the overall agenda of the event and details about specific sessions.
data_list.pkl: A pickle file containing a list of session details to generate transcripts for.

Usage

Install the required Python packages:

vertexai
pandas

Update the google_sev_acc_path variable with the path to your Google service account credentials file.
Run the script

The script will generate transcripts for the specified sessions, based on the provided context and session details. The generated transcripts will be saved in a pickle file named gemini_final_result_{timestamp}.pkl, where {timestamp} is the current date and time.

Output

The script generates the following output files:

gemini_final_result_{timestamp}.pkl: A pickle file containing the generated transcripts for each session.
failed_files_{timestamp}.txt: A text file listing any sessions for which the transcript generation failed.
df_result_{timestamp}.pkl: A pickle file containing intermediate results during the script execution.
labeling_biocon_sentences_{timestamp}.log: A log file containing information about the script's execution.

Script Overview

The script performs the following tasks:

Configures logging for the script's execution.
Reads the context (agenda and session details) from the agenda.md file.
Loads the list of sessions from the data_list.pkl file.
Initializes the VertexAI GenerativeModel for transcript generation.
Processes each session in parallel using a ThreadPoolExecutor.
For each session, generates a prompt based on the provided context and session details.
Generates the transcript using the VertexAI GenerativeModel.
Saves the generated transcripts and handles any failures during the generation process.
Writes the final results to the output files.

Note: The script uses the VertexAI GenerativeModel to generate the transcripts, which may incur costs based on your usage and pricing plan.

akbargherbal / gemini_pro_helper

Transcript Generation Script

Prerequisites

Usage

Output

Script Overview

About

Languages