paulhshort's starred repositories

human-reader-chrome-extension

Let the web speak like a human. A simple chrome extension that uses the elevenlabs.io API to convert any text to speech.

Language:JavaScriptLicense:MITStargazers:12Issues:0Issues:0

django-connectwise

Django app for working with ConnectWise REST API. Defines models (tickets, members, companies, etc.) and callbacks. Used in https://www.topleft.team/

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

pyconnectwise

A library for simplifying interactions with the ConnectWise Manage API in Python

Language:PythonLicense:GPL-3.0Stargazers:44Issues:0Issues:0

claude-engineer

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.

Language:PythonStargazers:6983Issues:0Issues:0

Cheatsheets

A collection of all my personal cheat sheets and guides as I progress through my career in offensive security.

Stargazers:61Issues:0Issues:0

PowerShdll

Run PowerShell with rundll32. Bypass software restrictions.

Language:C#License:MITStargazers:1734Issues:0Issues:0

Chimera

Chimera is a PowerShell obfuscation script designed to bypass AMSI and commercial antivirus solutions.

Language:PowerShellStargazers:1379Issues:0Issues:0

SessionGopher

SessionGopher is a PowerShell tool that uses WMI to extract saved session information for remote access tools such as WinSCP, PuTTY, SuperPuTTY, FileZilla, and Microsoft Remote Desktop. It can be run remotely or locally.

Language:PowerShellStargazers:1184Issues:0Issues:0

InsightCraft

A powerful and versatile AI that can understand text, images, audio, and even generate code powered by Google's Gemini.

Language:KotlinStargazers:5Issues:0Issues:0

AudioProcessingApplication

Powered by Python, Streamlit, and the Gemini 1.5 model, this app is a game-changer in audio analysis.

Language:Jupyter NotebookStargazers:3Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

Audio-Summarization-App-with-Gemini-1.5

An AI app built with Gemini model that summaries audio

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

voice-chat-gemini

A simple assistant chat that tries to take audio speech for an Gemini audio response.

Language:PythonStargazers:2Issues:0Issues:0
Language:JavaScriptStargazers:3Issues:0Issues:0

MultiModal-Model

This project is a multi-modal model that works with multiple models combined and accepts audio, images, and text as inputs, generating corresponding audio, images, and text outputs.

Language:PythonStargazers:7Issues:0Issues:0
Stargazers:2Issues:0Issues:0

gemini-ai-processaudio-js

Process Audio Files With Gemini Api In Javascript

Language:JavaScriptStargazers:1Issues:0Issues:0

transcriber

Transcriber & translator for audio files. Like Otter.ai but free and build with Gemini 1.5 Pro.

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

Gemini1.5-mp3-Summerization

Summarize an audio file with Gemini 1.5 Pro

Language:Jupyter NotebookStargazers:3Issues:0Issues:0

gemini-ai-toolkit

Unlock the potential of Google's Gemini AI models with this versatile toolkit. Offering seamless chat, text generation, and multimodal interactions, supporting various file types, including PDF's, images, videos, audio, text and more. Enjoy real-time responses, customizable parameters, and easy integration for diverse AI tasks.

Language:PythonLicense:MITStargazers:36Issues:0Issues:0

TrueAudioVIdeoGemini

This is a repo demonstrating Gemini 1.5 pros ability to ingest audio and not just transcribed text it can listen to qualities of voice guest regional accents and other things. Out! Use your own vertex api key enter that funny

Language:TypeScriptLicense:MITStargazers:2Issues:0Issues:0

ai_multimodal_webapp

Production Ready AI Assistant Web App

Language:TypeScriptStargazers:1Issues:0Issues:0

thepipe

Extract markdown and images from PDFs, URLs, docs, slides, and more, ready for multimodal LLMs. ⚡

Language:PythonLicense:MITStargazers:872Issues:0Issues:0

superpilot

LLMs based multi-model framework for building AI apps.

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

Hands-on-MLLM

Multimodal LLM gadgets implementation

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

vectordb-recipes

High quality resources & applications for LLMs, multi-modal models and VectorDBs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:534Issues:0Issues:0

openplugin

👐🔌 LLM Tool Runner - Chat with your APIs - Abstraction layer over Function Calling - https://openplugin.com/

Language:PythonLicense:NOASSERTIONStargazers:7Issues:0Issues:0

big-AGI

Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.

Language:TypeScriptLicense:MITStargazers:5016Issues:0Issues:0

gemini-1-5-multimodal-chat-next

A simple Web UI for using the Gemini 1.5 model with Next.js

Language:TypeScriptLicense:MITStargazers:3Issues:0Issues:0