Utsav's repositories
AICommand
ChatGPT integration with Unity Editor
autotab-starter
Build browser agents for real world tasks
bananalyzer
Open source AI Agent evaluation framework for web tasks ๐๐
core-medium-parser
Core of Medium.com parser
dalle3-x-post
Automatically generate and post images (dalle3) and text (gpt4) to Twitter
deepmind-concordia
A library for generative social simulation
gen_ai_utils
A place for my generative AI concepts and ideas
gpt_chatwithPDF
GPT chat with your docs!
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
latentverse
Portal hopping with Stable Diffusion ๐พ
Leaked-GPTs
Leaked GPTs Prompts Bypass the 25 message limit or to try out GPTs without a Plus subscription.
magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
ollama-voice-mac
Mac compatible Ollama Voice
open-interpreter-website
Website for the Open Interpreter project
perplexity-ai-app
The Unofficial Perplexity AI Desktop App, powered by Electron which brings the magic of AI language processing to your desktop.
pinokio
AI Browser
pinokiod
Backend for https://github.com/pinokiocomputer/pinokio
portkey-ai-gateway
A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API.
resemble-enhance
AI powered speech denoising and enhancement
screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Shush
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
tarsier
Vision utilities for web interaction agents ๐
twenty
Building a modern alternative to Salesforce, powered by the community.
vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
web-freedium-medium-parser
Web application for Freedium.cfd