PanwarJayant / Wikuery

A wikipedia search engine

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Wikuery (Phase-1) by Jayant Panwar-2019114013

A wikipedia search engine

Extra Requirements

  • pyStemmer: fast stemming module
  • xml.sax: scalable XML parser
  • nltk.corpus: for using english stopwords

Contents

  • index.sh: main file for executing the indexing code
  • indexer.py: main python file that handles the flow for indexing
  • handler.py: file handler file that fetches given file paths and writes indexing files
  • wikiProcessor.py: python file for processing a Wikipedia page
  • textProcessor.py: python file for simple text processing

About

A wikipedia search engine


Languages

Language:Python 99.8%Language:Shell 0.2%