Needed modules: pip install pdfminer pip install nltk pip install numpy pip install sklearn
Compares PDF documents and visualizes similarity using graph. Documents are represented as TF-IDF vector and their similarity is based on cosinus similarity. Visualization is done using Python's library Dash.
Needed modules: pip install pdfminer pip install nltk pip install numpy pip install sklearn
Compares PDF documents and visualizes similarity using graph. Documents are represented as TF-IDF vector and their similarity is based on cosinus similarity. Visualization is done using Python's library Dash.
MIT License