sampathsl / IR_ASSIGNMENT_1_UOM_CS5615

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Information Retrieval Basic Techniques

This Sample project demonstrated the basic IR techniques by using python. (tokenization, isolated word correction, context sensitive word correction, Stemming, and Lemmatization)

Data Files Used

  1. Twitter Feed
  2. Student Course Feedback Data
  3. Research Paper

Libraries & modules are required as prerequisites

  1. Installing pyspellchecker as the isolated word corrector pip install pyspellchecker

  2. Installing the symspellpy module pip install -U symspellpy

  3. Download wordnet lemmarizer by using the python console.

        >> import nltk
        >> nltk.download()
    

About


Languages

Language:Jupyter Notebook 100.0%