paulhshort's repositories
TrueAudioVIdeoGemini
This is a repo demonstrating Gemini 1.5 pros ability to ingest audio and not just transcribed text it can listen to qualities of voice guest regional accents and other things. Out! Use your own vertex api key enter that funny
chatpad
Not just another ChatGPT user-interface!
cookbook
A collection of guides and examples for the Gemini API.
transcribe
Transcribe is OpenAI's chatGPT based real time transcription, conversation, Language learning platform. It provides live transcripts from microphone and speaker. It generates a suggested conversation response using OpenAI's GPT API. It will read out the responses, simulating a real live conversation in English or another language.
speak-gpt
Your personal voice assistant based on OpenAI ChatGPT.
faster-whisper
Faster Whisper transcription with CTranslate2
AlwaysReddy
AlwaysReddy is a LLM voice assistant that is always just a hotkey away.
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
thepipe
Multimodal file/web extraction for GPT-4o in one line of code ⚡
LocalAI
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
lobe-chat
🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
PentestGPT
A GPT-empowered penetration testing tool
gemini-ai-processaudio-js
Process Audio Files With Gemini Api In Javascript