Gavin Blair's starred repositories
RealTimeSpeechRecognition
Various approaches for speech recognition and speaker diarization.
RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
ai-clone-whatsapp
Create an AI clone of yourself from your WhatsApp chats (using Llama 3)
sagittarius
A GPT-4/Gemini Voice/Video Exploration Tool
LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
Reddit-DailyDigest-Bot
Reddit Daily Digest Bot is a Python-based bot that automates the compilation of top posts from specified subreddits into a daily digest. Users can customize sorting methods, time ranges, and the target subreddit for automated posting. The bot simplifies the process of staying updated on favorite subreddits by delivering a formatted summary.
python-oauth2-cli-auth
Authenticate against OAuth2 Provider in Python CLIs
superduper
Superduper: Bring AI to your database! Integrate AI models and workflows with your database to implement custom AI applications, without moving your data. Including streaming inference, scalable model hosting, training and vector search.
RealtimeTTS
Converts text to speech in realtime
self-operating-computer
A framework to enable multimodal models to operate a computer.
documentation
Official documentation for Pieces for Developers
Google-Colab-Selenium
The best way to use Selenium in Google Colab Notebooks!
OpenAI_Agent_Swarm
HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!"
LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
openai-cookbook
Examples and guides for using the OpenAI API