mehulraj19 / toxicCommentClassification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

toxicCommentClassification

This is my academic project and in this, I have used different ML alogorithms for text classification. I have used tf-idf vectorizers to vectorize the text data that further goes for implementation of the models. We have done vectorizer for both word and character and then made a union set which is finally used to vectorize data. This helps in removing the stop words as well.
The data has been taken from here.

Models implemented:

  • MultinomialNB
  • RandomForestClassifier
  • LogisticRegression
  • SGDClassifier
  • Perceptron
  • MLPClassifier

About


Languages

Language:Jupyter Notebook 100.0%