Ananthu27 / NLP-MTECH

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NLP-MTECH

Mtech Summer Sem NLP Project, TOPIC : Perform various techniques (Data cleaning, data reduction, pre-processing, feature selection, feature engineering, anomaly deteciton, data visualization, data type convertion, handling missing data, filter wanted outliers) on any text data.

PREPROCESSING DONE -> lowercasing, Stemming, Removing Stopwords, Lemmatization, Part-of-Speech Tagging, finally tokenization.

CLASSIFICATION TECHINQUES USED -> Term Frequency-Inverse Document Frequency (TF-IDF), Support Vector Machine (SVM)

DATA VIZUALIZATION TECHNIQUES USED -> Topic Modeling Visualization, Word Cloud, Box Plots.

About


Languages

Language:Python 100.0%