ai-vision anthropic-claude ollama openai claude gpt-4o-mini homelab image-recognition llama mistral self-hosted gpt4o llama3-1 ai-chatbot-framework ai-image-recognition

AI Screen Analyzer

AI Screen Analyzer is a powerful web application that allows users to capture screenshots, analyze them using various AI providers and models, and engage in conversations about the captured images.

🚀 Features

Modern UI: Sleek, responsive interface with dark/light mode and glass morphism effects
Screen Capture: Easily capture screenshots of your desktop or specific windows
Multi-Model AI Analysis: Analyze images using multiple AI models:
- OpenAI's GPT-4o Vision
- Anthropic's Claude 3
- Ollama's local models (including LLaVA)
Intelligent Chat: Engage in conversations about the analyzed images or any topic
Model Switching: Seamlessly switch between different AI models without losing context
Dark/Light Mode: Toggle between dark and light themes based on preference or system settings
Capture History: View and manage your recent screen captures
Responsive Design: Works great on desktop and mobile devices
Local Setup: Run the application locally for enhanced privacy and customization
Docker Support: Run in Docker for easy deployment

✨ Use Cases

UI/UX Research: Capture and analyze interfaces for design inspiration
Code Generation: Capture a website you like and ask the AI to provide the code
Technical Support: Take screenshots of errors and get AI assistance
Content Analysis: Analyze charts, graphs, or visual data
Learning Tool: Ask questions about anything you see on your screen

🚀 Quick Start

Docker (Recommended)

Add your API keys in .env file:

docker-compose up -d --build

Visit http://localhost:3000

Prerequisites:

Docker
Node.js (v22 or higher)
npm

Local Setup

Clone the repository:

git clone https://github.com/bigsk1/ai-screen-analyzer.git
cd ai-screen-analyzer

Install dependencies:
```
npm install
```

Create a .env file in the root directory and add your API keys:

REACT_APP_OPENAI_API_KEY=your_openai_api_key
ANTHROPIC_API_KEY=your_anthropic_api_key
ANTHROPIC_MODEL=claude-3-5-sonnet-20241022

OLLAMA_API_URL=http://localhost:11434

Start the development server:
```
npm run dev
```
Open your browser and navigate to http://localhost:3000.

AI Screen Analyzer in action

🧰 Technologies Used

Frontend: React, Tailwind CSS
Backend: Node.js, Express
AI Services: OpenAI API, Anthropic API, Ollama
Containerization: Docker

🔧 Configuration

Add your API keys in .env file
To change the default Anthropic model, update the ANTHROPIC_MODEL variable
For Ollama, the default URL is http://localhost:11434 when running natively, and host.docker.internal:11434 in Docker

About

AI Screen Analyzer allows users to capture screenshots, analyze them using various AI providers and models, and engage in conversations about the images.

https://bigsk1.com/posts/AI-Screen-Analyzer/

ai-vision anthropic-claude ollama openai claude gpt-4o-mini homelab image-recognition llama mistral self-hosted gpt4o llama3-1 ai-chatbot-framework ai-image-recognition

MIT License

Languages

Language:JavaScript 94.2%Language:CSS 3.9%Language:HTML 1.5%Language:Dockerfile 0.4%