NLP & IR
This repository covers some NLP & IR concepts from basic text indexing to advanced text analysis.
- Jaccard_coefficient.py allows to compute Jaccard coefficient between two sentences using unigram, bigram and trigram
- Gamma_code.py compute the gamma code fro a posting list
- Naive Bayes with laplace smoothing given a collection of sentences with lables (binary) return the label of one test sentence computed using Naive Bayes with Laplace smoothing
- Naive Bayes given a collection of sentences with lables (binary) return the label of one test sentence computed using Naive Bayes