sprintup / rag

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Build a retrieval augmented generated (RAG) model

Figuring out how to build a RAG model to submit to the below contest (submissions due June 17, 2024). The submission just has to use NVIDIA technologies and Langchain. They'd prefer you to use their endpoints but from what I gather, if you can prove how you use NVIDIA technologies, they'll consider your submission.

source code

alt alt

alt The above diagram is an explainer from databricks about how RAG databases work from the edx course. The include LLM files will only run in databricks.

Misc Resources

Install

  • python -m venv venv_rag
  • source vevn_rag/bin/activate
  • pip install -r requirements.txt

or

pip install streamlit
pip install langchain_nvidia_ai_endpoints
pip install faiss-cpu
pip install langchain
pip install -U langchain-community
pip install unstructured

Start

  • source .env
  • source venv_rag/bin/activate
  • streamlit run main.py

About


Languages

Language:Python 99.8%Language:Dockerfile 0.2%