athletedecoded / llm-rag

LLM w/ RAG from scratch using Rust

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Rusty RAG

LLM w/ RAG from scratch using Rust + Ollama + PoloDB

Setup

# Install ollama
$ make ollama

# Get model and tokenizer files
$ make models

Seed RAG Database

$ cargo run --bin seed

Scan RAG Database

$ cargo run --bin scan

Launch

$ cargo run

Navigate to http://127.0.0.1:8000/

Evaluation

$ cargo run --bin eval

Run Binaries

$ make release
$ cd target/release
$ ./seed # seed DB
$ ./scan # scan DB
$ ./llm-rag # run RAG service @ http://127.0.0.1:8000/
$ ./eval # evaluate

ToDos:

  • Debug RAG retrieval
  • Refactor for binary build
  • Error handling and refactoring

About

LLM w/ RAG from scratch using Rust

License:MIT License


Languages

Language:Rust 73.4%Language:HTML 21.6%Language:Makefile 3.2%Language:Dockerfile 1.8%