LLM-RAG-with-MLDE
LLM RAG System with MLDE
Architecture
Index Generation
Retrieval Index
Showcase
Prerequisite
docker create network rag-system
How to Run
Build container images
make build
Start LLM
make up-fastchat-controller
make up-fastchat-model-worker
make up-fastchat-api-server
Start determined cluster
make up-determined
Create Embedding DB
make run-inference
Start RAG System
make up-rag