paulhshort's starred repositories
human-reader-chrome-extension
Let the web speak like a human. A simple chrome extension that uses the elevenlabs.io API to convert any text to speech.
django-connectwise
Django app for working with ConnectWise REST API. Defines models (tickets, members, companies, etc.) and callbacks. Used in https://www.topleft.team/
pyconnectwise
A library for simplifying interactions with the ConnectWise Manage API in Python
claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
Cheatsheets
A collection of all my personal cheat sheets and guides as I progress through my career in offensive security.
PowerShdll
Run PowerShell with rundll32. Bypass software restrictions.
SessionGopher
SessionGopher is a PowerShell tool that uses WMI to extract saved session information for remote access tools such as WinSCP, PuTTY, SuperPuTTY, FileZilla, and Microsoft Remote Desktop. It can be run remotely or locally.
InsightCraft
A powerful and versatile AI that can understand text, images, audio, and even generate code powered by Google's Gemini.
AudioProcessingApplication
Powered by Python, Streamlit, and the Gemini 1.5 model, this app is a game-changer in audio analysis.
Audio-Summarization-App-with-Gemini-1.5
An AI app built with Gemini model that summaries audio
voice-chat-gemini
A simple assistant chat that tries to take audio speech for an Gemini audio response.
MultiModal-Model
This project is a multi-modal model that works with multiple models combined and accepts audio, images, and text as inputs, generating corresponding audio, images, and text outputs.
gemini-ai-processaudio-js
Process Audio Files With Gemini Api In Javascript
transcriber
Transcriber & translator for audio files. Like Otter.ai but free and build with Gemini 1.5 Pro.
Gemini1.5-mp3-Summerization
Summarize an audio file with Gemini 1.5 Pro
gemini-ai-toolkit
Unlock the potential of Google's Gemini AI models with this versatile toolkit. Offering seamless chat, text generation, and multimodal interactions, supporting various file types, including PDF's, images, videos, audio, text and more. Enjoy real-time responses, customizable parameters, and easy integration for diverse AI tasks.
TrueAudioVIdeoGemini
This is a repo demonstrating Gemini 1.5 pros ability to ingest audio and not just transcribed text it can listen to qualities of voice guest regional accents and other things. Out! Use your own vertex api key enter that funny
ai_multimodal_webapp
Production Ready AI Assistant Web App
superpilot
LLMs based multi-model framework for building AI apps.
Hands-on-MLLM
Multimodal LLM gadgets implementation
vectordb-recipes
High quality resources & applications for LLMs, multi-modal models and VectorDBs
openplugin
👐🔌 LLM Tool Runner - Chat with your APIs - Abstraction layer over Function Calling - https://openplugin.com/
big-AGI
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
gemini-1-5-multimodal-chat-next
A simple Web UI for using the Gemini 1.5 model with Next.js