paulhshort's starred repositories
Cheatsheets
A collection of all my personal cheat sheets and guides as I progress through my career in offensive security.
PowerShdll
Run PowerShell with rundll32. Bypass software restrictions.
SessionGopher
SessionGopher is a PowerShell tool that uses WMI to extract saved session information for remote access tools such as WinSCP, PuTTY, SuperPuTTY, FileZilla, and Microsoft Remote Desktop. It can be run remotely or locally.
InsightCraft
A powerful and versatile AI that can understand text, images, audio, and even generate code powered by Google's Gemini.
AudioProcessingApplication
Powered by Python, Streamlit, and the Gemini 1.5 model, this app is a game-changer in audio analysis.
Audio-Summarization-App-with-Gemini-1.5
An AI app built with Gemini model that summaries audio
voice-chat-gemini
A simple assistant chat that tries to take audio speech for an Gemini audio response.
MultiModal-Model
This project is a multi-modal model that works with multiple models combined and accepts audio, images, and text as inputs, generating corresponding audio, images, and text outputs.
gemini-ai-processaudio-js
Process Audio Files With Gemini Api In Javascript
transcriber
Transcriber & translator for audio files. Like Otter.ai but free and build with Gemini 1.5 Pro.
Gemini1.5-mp3-Summerization
Summarize an audio file with Gemini 1.5 Pro
gemini-ai-toolkit
A versatile CLI and Python wrapper for Google's Gemini Pro large language models. Streamline the creation of chatbots, generate dynamic text, analyze images and transcribe audio with ease.
TrueAudioVIdeoGemini
This is a repo demonstrating Gemini 1.5 pros ability to ingest audio and not just transcribed text it can listen to qualities of voice guest regional accents and other things. Out! Use your own vertex api key enter that funny
ai_multimodal_webapp
Production Ready AI Assistant Web App
superpilot
LLMs based multi-model framework for building AI apps.
Hands-on-MLLM
Multimodal LLM gadgets implementation
vectordb-recipes
High quality resources & applications for LLMs, multi-modal models and VectorDBs
openplugin
๐๐ LLM Tool Runner - Chat with your APIs - Abstraction layer over Function Calling - https://openplugin.com/
big-AGI
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
gemini-1-5-multimodal-chat-next
A simple Web UI for using the Gemini 1.5 model with Next.js
ui-components
React component library for crafting user-friendly and engaging conversational experiences
Multimodal-RAG-Gemini-MongoDB
This is a sample code implementation of Multimodal RAG using Google Gemini & MongoDB Altas Vector Search