rajpratim / nlp

:memo: This repository recorded my NLP journey.

Home Page:https://www.linkedin.com/in/edwardma1026/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NLP - Tutorial

Repository to show how NLP can tacke real problem. Including the source code, dataset, state-of-the art in NLP

Text Preprocessing

Section Sub-Section Description Link
Word Tokenization Tokenize word Medium Github
Part of Speech Medium Github
Lemmatization Medium Github
Stemming Medium Github
Stop Words Medium Github
Sentence Tokenization Medium Github
Phrase Word Recognition

Text Processing

Section Sub-Section Description Link
Spell Checking Lexicon-based

Infomration Retrieval

Section Sub-Section Description Link
Pattern-based Recognition Medium
Lexicon-based Recognition Medium
Named Entity Recognition (NER) Pre-trained NER Medium Github
Custom NER

Text Summarization

Section Sub-Section Description Link
Extractive Approach Medium Github
Abstractive Approach

Distance Measurement

Section Sub-Section Description Link
Euclidean Distance, Cosine Similarity and Jaccard Similarity Medium Github
Edit Distance Medium Github
Word Moving Distance (WMD) Medium Github

Vector Representation

Section Sub-Section Description Link
Traditional Method Bag-of-words (BoW) Medium Github
Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) Medium Github
Character Level Character Embedding Medium Github
Word Level Negative Sampling and Hierarchical Softmax
Word2Vec, GloVe, fastText Medium Github
Contextualized Word Vectors (CoVe) Medium Github
Embeddings from Language Models (ELMo) Medium Github
Sentence Level Skip-thoughts Medium Github
InferSent Medium Github
Document Level lda2vec Medium
doc2vec

Model Interpretation

Section Sub-Section Description Link
ELI5, LIME and Skater Medium Github
SHapley Additive exPlanations (SHAP) Medium Github
Anchors Medium Github

Myth

Section Sub-Section Description Link
Using Deep Learning can resolve all problem? Medium Kaggle

Source Code

Section Sub-Section Description Link
Spellcheck Github
InferSent Github

About

:memo: This repository recorded my NLP journey.

https://www.linkedin.com/in/edwardma1026/


Languages

Language:Python 60.0%Language:Jupyter Notebook 36.9%Language:sed 1.7%Language:Shell 1.5%