Build a retrieval augmented generated (RAG) model

Figuring out how to build a RAG model to submit to the below contest (submissions due June 17, 2024). The submission just has to use NVIDIA technologies and Langchain. They'd prefer you to use their endpoints but from what I gather, if you can prove how you use NVIDIA technologies, they'll consider your submission.

The above diagram is an explainer from databricks about how RAG databases work from the edx course. The include LLM files will only run in databricks.

2.7.1 Notebook Demo Part 1

the source code for the notebook is found here

Misc Resources

Install

python -m venv venv_rag
source vevn_rag/bin/activate
pip install -r requirements.txt

or

pip install streamlit
pip install langchain_nvidia_ai_endpoints
pip install faiss-cpu
pip install langchain
pip install -U langchain-community
pip install unstructured

sprintup / rag

Build a retrieval augmented generated (RAG) model

Build a Retrieval-Augmented Generation Chatbot in 5 Minutes

Large Language Models: Application through Production

2.7.1 Notebook Demo Part 1

Misc Resources

Install

Start

About

Languages