shibin-george / elgoog

elgoog is a search & retrieval engine that uses Inverted Index for lookup and Bayesian Inference Network for retrieval

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

elgoog

elgoog is a (basic) document-search and query-retrieval engine that I developed for the Applied Information Retrieval (CS546, Fall-2019) class at UMass, Amherst.

elgoog uses inverted-index for fast query lookup.

It also supports phrase operators (like ordered and unordered windows or exact phrase matches (which is same as ordered window operator with no separation between the terms)) by using a Bayesian inference network model for query retrieval.

elgoog currently uses 4 scoring mechanisms:

  1. raw-count
  2. BM-25
  3. Query likelihood using Jelenik Mercer smoothing
  4. Query likelihood using Dirichlet smoothing

About

elgoog is a search & retrieval engine that uses Inverted Index for lookup and Bayesian Inference Network for retrieval

License:GNU General Public License v3.0


Languages

Language:Java 100.0%