Chat Out Loud with GPT

Hands-free companionship on demand. Reasonably non-laggy thanks to async + stream=true.

Usage

brew install portaudio
export OPENAI_API_KEY={your key}
export CHARACTR_CLIENT_KEY={your key}
export CHARACTR_API_KEY={your key}
pip install -r requirements.txt
python chat_out_loud.py

Mods

By default it pretends to be Snoop Dogg. Feel free to change the system message to your liking. Or to change the GPT model from 3.5 to 4.
You may need to tweak the silence_threshold in the record function depending on the sensitivity of your mic and the environment you're in. The higher it is, the louder you need to talk, but if it's too low the background noise may keep it from ever turning off.
You can choose different voices from charactr (throw a breakpoint and call charactr_api.get_voices() to see available, or swap in a different TTS system entirely if you want to clone your own voice or Snoop's voice or whatever.

How does it work?

Pyaudio listens until it hears a sound, then listens until it stops hearing a sound, then writes to a file.
The file is sent to Whisper for transcription.
The transcription is appended to the system message + any previous messages in the convo and sent to ChatGPT.
ChatGPT's response is streamed back; when it finishes a sentence, the sentence is sent to Charactr for TTS
The resulting TTS plays out loud while 3 and 4 continue in the background
Repeat!

Credits

Written by me and my buddy Alec.

vicktor / chat_out_loud_gpt

Chat Out Loud with GPT

Usage

Mods

How does it work?

Credits

About

Languages