collaborative-filtering knnwithmeans mae product-recommender python recommender recommender-system rmse svd svd-matrix-factorisation

Exploring-Product-Recommendation-Systems

pandas: Go to Pandas Installation or use command: pip install pandas
numpy: Go to NumPy Installation or use command: pip install numpy
matplotlib: Go to Matplotlib Installation or use command: pip install matplotlib
seaborn: Go to Seaborn Installation or use command: pip install seaborn
scikit-learn: Go to Scikit-Learn Installation or use command: pip install scikit-learn
surprise: Go to Surprise Installation or use command: pip install scikit-surprise`

EDA Steps

Data loading and initial exploration
Data cleaning and manipulation
Checking for missing values and duplicates
Analyzing the distribution of product ratings

Data Preprocessing Steps and Inspiration

Handling Missing Values: Checked for missing values and duplicates in the dataset.
Subset Selection: Selected a subset of the dataset for analysis to optimize performance.

Graphs

Recommendation Techniques

Popularity-Based Recommender: Recommends products based on their popularity (number of ratings).
Collaborative Filtering:

a. User-Based Filtering:

KNNBasic: A basic K-nearest neighbors algorithm for collaborative filtering based on user similarities.
KNNWithMeans: An enhanced K-nearest neighbors algorithm that takes into account the mean ratings of users for better predictions.

b. Item-Based Filtering:

KNNBasic: A basic K-nearest neighbors algorithm for collaborative filtering based on item similarities.
KNNWithMeans: An enhanced K-nearest neighbors algorithm that takes into account the mean ratings of items for better predictions.

c. Matrix Factorization (SVD): Uses Singular Value Decomposition to predict user ratings for products based on past user ratings.

Assumptions

Ratings provided by users are reliable.
User preferences are consistent over time.
Products with higher ratings are preferred by users.

Evaluation Metrics

RMSE (Root Mean Square Error) was used to evaluate the performance of different models.

Results

Top 30 products as per popularity recommender

SVD was the best model with the least RMSE of 0.898

Top 5 products for a given user(an example) with SVD

Top 5 products similar to a given product(an example) with SVD

Recommendations

Further data collection and feature engineering could improve recommendation accuracy.
Regularly updating the model with new product data can help maintain recommendation relevance.
Implementing user feedback mechanisms to continuously improve recommendations.

Limitations

The dataset may contain biases that could affect the recommendations.
The recommendation performance is limited by the quality and quantity of the available data.

Future Possibilities of the Project

Exploring additional recommendation algorithms and ensemble methods.
Implementing deep learning models for better performance.
Developing real-time recommendation systems based on user interactions.

References

About

Product Recommender

collaborative-filtering knnwithmeans mae product-recommender python recommender recommender-system rmse svd svd-matrix-factorisation

Languages

Language:Jupyter Notebook 100.0%

tgchacko / Exploring-Product-Recommendation-Systems

Exploring-Product-Recommendation-Systems

Table of Contents

Project Overview

Data Sources

Data Description

Attributes

Summary

Tools