LLocalSearch

What it is

This is a completly locally running meta search engine using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.

Here is a video of it in action, running completely locally (don't worry I'm working on a dark mode):

2024-03-29.13-32-38.mp4

signal-2024-03-29-195736_002.mp4

Status

This is a proof of concept, the code is horrible. I didn't intend to make this public yet, but I wanted to share it with a few people. Please open issues and PRs if you have any suggestions.

Features

Completely local (no need for API keys)
Runs on "low end" LLM Hardware (demo video uses a 7b model)
User can see the progress of the agents and understand how the answer was found

Roadmap

Separating "agent updates" / debug information from the final result (something like the langsmith interface?)
Implement a stateful agent chain (so the user can ask follow up questions)
Code refactoring to provide a more solid base for future development and collaboration

How it works

The user query is sent to server
The server starts an agent chain ("imagine LLMs taking with each other")
The agent (in our case starling-lm running on Ollama) will process the query and select one of its tools
- Websearch (using SearXNG) will scrape the top n results from the web (the agent will choose the search query). Remove all unnecessary information (like html tags) and store the result into a vector db
- SearchVectorDB will search the vector db for the most similar results to the query
Step 3 will happen multiple times, with the agent choosing different tools and combining the results
The final result is sent back to the user and displayed in the frontend

While this is happening, the user can see the progress of the agents steps in the frontend.

Self-hosting / Development

Currently, both options are the same. I plan to package this into a single docker image for easier deployment.

Requirements

A running Ollama server somewhere in your network
- an LLM on Ollama
  - (I recommend hermes-2-pro-mistral or starling-lm-7b-beta)
  - Change the model name (used by the search) in the docker-compose file environment variables
- the all-minilm model for embeddings
Docker Compose

Included in the compose file are

search backend (based on the Go Langchain library)
search frontend (svelte & tailwind)
Chroma DB (for storing search results in vector space)
SearXNG (meta search engine used by the agent chain)
Redis (for caching search results)

git clone https://github.com/nilsherzig/LLocalsearch.git
# make sure to check the env vars inside the compose file
# build the containers and start the services
make dev 
# make dev will start the frontend on port 3000. Both front and backend will hot reload on code changes.

About

Apache License 2.0

Languages

Language:Go 55.3%Language:Svelte 34.8%Language:JavaScript 3.2%Language:TypeScript 3.0%Language:Dockerfile 2.2%Language:Makefile 0.8%Language:HTML 0.7%Language:CSS 0.1%