📸 SnapAnsAI Bot

📌 Project Description

SnapAnsAI Bot is a tool for instant answers based on screenshots. The user presses a hotkey, and the screenshot is automatically sent to a Telegram bot, which analyzes the image using OpenAI API and returns a text response.

⚙️ Key Features

✅ Hotkeys — automatic screenshot capture (multiple combinations can be set in config.json).
✅ Automatic image sending to the Telegram bot.
✅ AI Selection — support for OpenAI for image processing.
✅ Flexible settings in config.json (change API, hotkeys, OS, etc.).
✅ Cross-platform: Windows, Linux, macOS (auto-detection of OS).
✅ Logging in bot.log for easy debugging (all errors and events are recorded in the log).
✅ Multilingual support — the bot supports language switching via config.json and adding new languages in lang.json.

🌍 Where It Can Be Used

📚 Education: Screenshots of tasks and instant explanations.
🌝 Work: Capturing important on-screen data and quick analysis.
🧫 Research: Extracting text from charts and documents.
💬 Communication: Processing text and images in real time.
🏃 Daily use: Quickly obtaining text from any screen.
🔍 Proctoring: ?

🛠️ Technology Stack

🐍 Python — main programming language.
🤖 Telegram Bot API — interaction with the bot.
🧠 OpenAI API — image and text processing.
🖥️ PyAutoGUI — screenshot capture (Windows).
🐧 gnome-screenshot — screenshot capture (Ubuntu).
🍏 screencapture — screenshot capture (macOS).
⌨️ pynput — hotkey handler.
🔗 Requests — interaction with APIs.
📃 Logging — logging system.

🔐 Integrated Modules:

flameshot — a screenshot tool

🧷 Architecture

SnapAnsAI Bot consists of several components:

📌 System Components:

1️⃣ Telegram Bot — receives AI-processed messages from the bot.
2️⃣ AI Processing Module — sends images to OpenAI or DeepSeek and retrieves a response.
3️⃣ Screenshot Capture Module — operates via PyAutoGUI, gnome-screenshot, flameshot, or screencapture.
4️⃣ Logging Module — records all events and errors in bot.log.
5️⃣ Configuration Module — loads and manages config.json.

🗂️ Project Structure

snapansai-bot/
│── config/                   # Configuration files
│   ├── config.json           # Configuration file
│   ├── lang.json             # Translation file for multilingual support
│── logs/                     # Logs
│   │── session_temp/         # Temporary files (screenshots and cache)
│   ├── bot.log               # Bot log file
│── methods/                  # Screenshot processing modules
│   ├── flameshot.py          # Screenshot handler via flameshot
│── src/                      # Project source code
│   ├── ai_api.py             # AI API handler
│   ├── bot.py                # Main Telegram bot
│   ├── config_handler.py     # Configuration management (loading, saving)
│   ├── screenshot_sender.py  # Screenshot sending script
│── test/                     # AI testing scripts
│   ├── test_ai.py            # Project launch script
│── venv/                     # Virtual environment
│── README.md                 # Project documentation
│── requirements.txt          # Dependencies file
│── start.sh                  # Project launch script

⚙️ How It Works?

🔹 System Startup:

The bot starts with start.sh.
Initializes the Telegram Bot API token.
Initializes the Chat ID.
Initializes the AI API Key.
Detects the user's OS (Windows/Linux/macOS).
Loads configuration from config.json.
Launches the hotkey handler.
Waits for a hotkey press (alt+s, ctrl+m, etc.).

🔹 Screenshot Processing:

When a hotkey is pressed, a screenshot is taken and saved in the temporary folder logs/session_temp/.
The image is sent to the AI server for analysis.
The received text response is sent to the Telegram bot.
The screenshot is deleted from the temporary folder.

🔹 Using flameshot:

If flameshot is used, the save path can be set in the configuration.
To disable pop-up notifications about saving, run:

flameshot config

and uncheck Show desktop notifications in the General tab.

🛠️ Installation & Launch

🔄 Launching the Project

bash start.sh

During installation, you will need to provide the API key for the Telegram bot and AI. After installation, you can press the hotkeys, and screenshots will be processed by AI and sent to the Telegram bot.

🔄 Configuration Setup

In config.json, you can edit:

"bot_token": "" - Telegram Bot API token
"chat_id": 0 - Telegram Chat ID (auto-detection)
"language": "" - Language (available: en, ru, de)
"api_key": "" - AI API key (available: OpenAI, DeepSeek)
"model": "" - AI model (OpenAI: gpt-4o-mini, gpt-4o, o3-mini, gpt-4.5-preview, etc.)
"hotkey": "" - Hotkeys (you can specify one or more, e.g., "hotkey": "alt+s, ctrl+m, p, /")
"os": "" - OS (auto-detection)
"method": - Default is "default" — built-in OS screenshot managers are used. Available: "flameshot".

In lang.json, you can edit:

"prompt": "" - Prompt
Add other languages

💀 Roadmap

✅ Improved hotkey processing
✅ Integration with OpenAI
✅ Tested on Ubuntu 24.04 (Wayland)
✅ Flexible configuration via config.json
✅ Added flameshot as an alternative method
🔜 Testing on Windows, MacOS
🔜 Adding Docker
🔜 Integration with other AI models (DeepSeek, Claude, Gemini, etc.)

💰 Donations

USDT (TRC20)/USDC (TRC20): TUQj3sguQjmKFJEMotyb3kERVgnfvhzG7o
SOL (Solana): 6VA9oJbkszteTZJbH6mmLioKTSq4r4E3N1bsoPaxQgr4
XRP (XRP): rDkEZehHFqSjiGdBHsseR64fCcRXuJbgfr

✨ Designed for convenience and quick access to answers.

beesyst / snapansai-bot