marcusdelang / SimpleSearchEngine

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Simple TF-IDF Search Engine

A search engine that, provided with a list of documents, supports single term searches and returns a TF IDF sorted list of matching documents. Results also include the TF-IDF score for each document.

Test

cd SimpleSearchEngine
mvn test

Example

The following documents are indexed:
Document 1: “The brown fox jumped over the brown dog.”
Document 2: “The lazy brown dog, sat in the other corner”
Document 3: “The Red Fox bit the lazy dog!”

A search for “brown” should now return the list: [document 1, document 2]
A search for “fox” should return the list: [document 3, document 1]
A search for “dog” should return the list: [document 3, document 1, document 2

About


Languages

Language:Java 100.0%