suyogdahal / chat-with-multiple-pdfs

A simple application to chat with multiple pdfs at once.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Chat With Multiple Pdfs

Introduction


A simple application to upload multiple pdfs at once and ask questions across them. It will return you the most closest answer along with pinpointing the source of information too.

High-level overview


  1. PyPDF2 for pdf reading/parsing.

  2. Langchain for most of the LLM related tasks like splitting, embedding and semantic search.

  3. FAISS for vectorstore.

  4. Streamlit for putting it all together.

Setup and Installation

  1. Start by cloning the repository to your local machine:
git clone https://github.com/suyogdahal/chat-with-multiple-pdfs.git
cd chat-with-multiple-pdfs
  1. Poetry is being used as the dependency manager in this project. If you don't have it installed, install it from the official Poetry documentation.

  2. Once you have Poetry installed, simply run the following command to install the application dependencies:

poetry install
  1. Activate the Poetry shell to handle dependencies in a virtual environment:
poetry shell
  1. Use the command below to run the app:
streamlit run app.py

Once the application is running, navigate to the localhost URL (usually http://localhost:8501) displayed in your terminal. Now, you are all set to upload your PDFs and chat with them!

About

A simple application to chat with multiple pdfs at once.


Languages

Language:Python 97.2%Language:Dockerfile 2.8%