toshvelaga / gemini-cup-demo-gpt4

Recreating the gemini cup demo with GPT-4V

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Recreating the recent Google Gemini Demo using OpenAI's GPT-4V model.

Setup

  • First step is to split your video up into images using the script splitVideoIntoImages.js
  • Second step is to run index.js. This will convert all the images stored in /images into base64 format and send them to the OpenAI GPT-4V model.
  • Be sure to add your open ai api key in an .env file.

Results

  • The results are frequently inconsistent, however I've noticed that if you give it how many steps there are and to list what happens in each step the accuracy is much higher.

Improvements

  • better prompt engineering

About

Recreating the gemini cup demo with GPT-4V


Languages

Language:JavaScript 100.0%