This project was done under the NUS module DSA4262, of which it focuses on projects that center around medicine and healthcare. I wrote an article regarding this topic, here is a link.
- The nature of this project is natural language processing
- Each review has a rating between 1 - 10
- Objective is to predict/classify other reviews as to whether they're positive or negative
- Tokenizing (Cleaning/Preprocessing)
- Exploratory Data Analysis (Exploration)
- Stemming (Cleaning/Preprocessing)
- Lemmatizing (Cleaning/Preprocessing)
- Vectorizing/OneHotEncoding (Cleaning/Preprocessing)
- Modelling (Cleaning/Preprocessing)
- Predicting (Machine Learning)
- Exploration of Final Model (Exploration)
- Feature Analysis (Exploration)
- Logistic Regression
- Decision Trees
- XGBoost