riya-joshi-401 / Sentiment-analysis-on-Disney-land-reviews

EDA and sentiment analysis on disney land reviews

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Sentiment-analysis-on-Disney-land-reviews

  • The dataset includes 42656 reviews of 3 Disneyland branches - Paris, California and Hong Kong, posted by visitors on Trip Advisor.

Column Description:

  • Review_ID: unique id given to each review
  • Rating: ranging from 1 (unsatisfied) to 5 (satisfied)
  • Year_Month: when the reviewer visited the theme park
  • Reviewer_Location: country of origin of visitor
  • Review_Text: comments made by visitor
  • Disneyland_Branch: location of Disneyland Park

  • On this dataset I have performed EDA and sentiment analysis of reviews using metrics like Sentiment Polarity and VADER Polarity in NLP and created wordclouds for better visulaization.This processed data is then feeded to different classifier models to get trained and predict the sentiment of the test reviews.

    Models used:
  • XGBoost - Extreme Gradient Boost alsorithm is based on the Gradient Boosting model which uses the boosting technique of ensemble learning where the underfitted data of the weak learners are passed on to the strong learners to increase the strength and accuracy of the model.
  • Decision Tree - This algorithm works on the basis of creating tree structures to take decisions
  • Random Forest - This algorithm works on the concept of emsemble learning.It used bagging technique to train multiple predictors on the same sampled instances to achieve a higher degree of accuracy.

  • XGBoost gave highest accuracy among all . Test accuracy : 83%

    About

    EDA and sentiment analysis on disney land reviews

    License:GNU General Public License v3.0


    Languages

    Language:Jupyter Notebook 100.0%