PicRoast

This repository contains code for PicRoast app.

PicRoast lets you capture a photo and turn into a comedy roast. It uses OpenAI GPT-4 with vision and ElevenLabs text-to-speech APIs.

How It Was Built

PicRoast was built an experiment to explore capabilites of GPT-4, and especially the new vision support, to generate code.

Most of the code was generate using GPT, with combo of text as well as visual instructions created with Whimsical Wireframes.

For visual intructions the flow was pretty straight forward:

Create wireframes in Whimsical
Add some annotations and flows for logic
Select and copy as image (Cmd-Shift-C)
Paste it in ChatGPT and ask it to update code (assuming it was provided to ChatGPT earlier) based on diagram
Iterate and repeat as needed.

Here's some example Whimsical snapshots that were used in the process:

All the bitmap images were also generate using new DALL·E 3 support in GPT-4.

export OPENAI_API_KEY=YOUR_API_KEY
export ELEVEN_API_KEY=YOUR_API_KEY

Update voice IDs in speech.ts (these need to be added to your VoiceLab in ElevenLabs)

Run the development server:

npm run dev

Open http://localhost:3000 with your browser to see the result.

Language:TypeScript 95.2%Language:CSS 3.9%Language:JavaScript 0.9%