SarCode / TripAdvisor

Scoring Reviewer

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

sarcode

LinkedIn

TripAdvisor-Scoring Reviewer

The dataset used in this project includes quantitative and categorical features from online reviews of 21 hotels located in Las Vegas Strip, extracted from TripAdvisor

The dataset contains 504 records and 20 tuned features, 24 per hotel (two per each month, randomly selected), regarding the year of 2015. The CSV contains a header, with the names of the columns corresponding to the features

Headers corresponding to each column has categorical variables which have text labels but most machine learning algorithms in python accept data with numeric labels only therefore we'll be encoding the categorical variables to numeric labels

Most scoring features

Nr.reviews
Nr.hotel reviews
Helpful votes
Traveler type
Swimming Pool
Exercise Room
Basketball Court
Yoga Classes
Club
Free Wifi
Hotel Name
Hotel stars
Nr.rooms
Member years

Accuracy

Using KNN( K-Nearest Neighbours ) is 40.59%

Language and Software

Python
Spyder

Libraries

Numpy
Panda

Data Cleaning

Label Encoder
OneHotEncoder

About

Scoring Reviewer


Languages

Language:Python 100.0%