Analyzing hotel reviews and performing sentiment analysis using Natural Language Processing.
- TF-IDF vectorizer
Term frequency-inverse document frequency is the text vectorizer used that transforms the text into a usable vector.
- Logistic Regression Classifier
Logistic regression, a supervised learning classification algorithm, is used to predict the probability of the target variable. Here we use binomial logistic regression to predict whether review is 'happy' or 'unhappy'.