erikapaceep / NLP

basic algorithm for NLP

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NLP & IR

This repository covers some NLP & IR concepts from basic text indexing to advanced text analysis.

  • Jaccard_coefficient.py allows to compute Jaccard coefficient between two sentences using unigram, bigram and trigram
  • Gamma_code.py compute the gamma code fro a posting list
  • Naive Bayes with laplace smoothing given a collection of sentences with lables (binary) return the label of one test sentence computed using Naive Bayes with Laplace smoothing
  • Naive Bayes given a collection of sentences with lables (binary) return the label of one test sentence computed using Naive Bayes

About

basic algorithm for NLP

License:MIT License


Languages

Language:Jupyter Notebook 99.5%Language:Python 0.5%