JavadZandiyeh / AUT-IR-Projects

AUT Information Retrieval Course Projects

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

AUT-IR-Projects

AUT Information Retrieval Course Projects

  • Preprocessing -- normalizing, tokenizing, stemming, removing stop words
  • Creating the positional index
  • Query processing -- processing "and", "or", and "not" parts
  • Zipf and Heaps laws
  • TF-IDF -- term frequency, inverse document frequency
  • Similarity metrics -- cosine similarity
  • Index elimination -- Champion list
  • Elasticsearch

About

AUT Information Retrieval Course Projects


Languages

Language:Jupyter Notebook 98.4%Language:Python 1.6%