The aim of the code is to present a solution for retrieving specific passages or paragraphs from documents along with the document names based on user queries.
Table of Contents
The retrieval of specific information from documents can be a challenging task, particularly when dealing with complex documents such as policy documents. In these types of documents, there are often multiple sections that contain important information, and it can be time-consuming to search through them manually. This is where the need for an automated solution arises. The aim of our work is to present a solution for retrieving specific passages or paragraphs from documents based on user queries. The solution will involve extracting the text information from each document and then searching for the relevant sections based on user queries
This is an example of how you may give instructions on setting up your project locally. To get a local copy up and running follow these simple example steps.
Before running the code , you need to install the following python libraries:
- nltk
pip install nltk
- re
pip install regex
- glob
pip install glob2
- chardet
pip install chardet
- Clone the repo
git clone https://github.com/github_username/repo_name.git
- Install Python packages (Checkout the Prerequisites)
pip install <Package-name>
- Run the main file using the command
python Main.py
After installing all the packages successfuly and runing the python file, you will have to choose if you want to exit or enter text or a phrase or a wildcard query.
- Stopword Removal
- Stemming
- Return all the paragraphs which contain the phrase or text along with the file name
- Wildcard Query Handling
- Spelling Correction
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
) - Commit your Changes (
git commit -m 'Add some AmazingFeature'
) - Push to the Branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
Your Name - Utkarsh Tiwari - utkarsh.samar@gmail.com
Project Link: https://github.com/UtkarshTiwari123/Information-Retrieval-System