mjaskolski/AIVoiceChat

Seamless and real-time voice interaction with AI.

Uses faster_whisper and elevenlabs input streaming for low latency responses to spoken input.

Note: The demo is conducted on a 10Mbit/s connection, so actual performance might be more impressive on faster connections.

voice_talk_vad.py - automatically detects speech

voice_talk.py - toggle recording on/off with the spacebar

🛠 Setup:

Replace your_openai_key and your_elevenlabs_key with your OpenAI and ElevenLabs API key values in the code.

Install the required Python libraries:

pip install openai elevenlabs pyaudio wave keyboard faster_whisper numpy torch

Execute the main script based on your mode preference:

python voice_talk_vad.py

python voice_talk.py

Talk into your microphone.
Listen to the reply.

Feel free to fork, improve, and submit pull requests. If you're considering significant changes or additions, please start by opening an issue.

Huge shoutout to: