suvrashaw / Predicting-Kickstarter-Project-Success

We predict if a Kickstarter project proposal will succeed in reaching its fundraising goal, using the project launch data of 200k + proposals scraped from Kickstarter.

Home Page:https://share.streamlit.io/suvrashaw/predicting-kickstarter-project-success/prime/app.py

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Predicting Kickstarter Project Success Open In Colab Open in Streamlit badge

Blue Quote Technology Google Classroom Header (4)

We predict whether a Kickstarter project proposal succeeds or fails to meet the fund-raising objective by only providing information from the project launch by means of 220, 000 project proposals scraped from Kickstarter. We evaluate the performance for these predictions of different machine learning models based on the project category, the fundraising objective and short product descriptions.

Sample Data: Download from here.

data

Blueprint

data

EDA: Some snaps from the notebook

0 1 2 3 4 5 6 7 8 9

Modelling

The Baseline Model we selected is Logistic Regression, so in that model we achieved Precision_score:- 0.9853, Recall_score:- 0.966, f1_Score:- 0.9759 and AUC Score:- 0.9828. We also plot fpr vs tpr and in that Train AUC:- 0.9792 and Test AUC:- 0.9796 and we also check Confusion Matrix for train and test both. The Two Performance Model are selected are Decision Tree and Gradient Boosting Decision Tree —

  • Decision Tree with Hyperparameter tuning(GridSearchCv):
    1. Using Decision Tree with GridSearchCv we found Best Parameters which are Max_depth = 10 and Max_Sample_split = 100.
    2. Then we plot heatmap with best parameter using groupby of max_depth and max_sample_split
    3. By using this parameter we found root_mean_square_error:- 0.07 and accuracy_score:- 0.9779
    4. We plot fpr vs tpr and in that Train AUC:- 0.998 and Test AUC:- 0.997 and we also check Confusion Matrix for train and test both.
  • Gradient Boosting with Hyperparameter tuning(GridSearchCv):
    1. Using GBDT with GridSearchCv we found Best Parameters which are learning_rate = 1 and Max_depth= 3.
    2. Then we plot heatmap with best parameter using groupby of max_depth and learning_rate
    3. By using this parameter we found root_mean_square_error:- 0.07 and accuracy_score:- 0.9771
    4. We plot fpr vs tpr and in that Train AUC:- 0.998 and Test AUC:- 0.998 and we also check Confusion Matrix for train and test both.

Tools Used

Python
Jupyter Notebook
Google Colab
Streamlit
Flask
GitHub
GitBash

Libraries Used

pandas
numpy
sklearn
matplotlib
seaborn
streamlit
nltk

Hosted on Streamlit: Check here.

streamlit.demo.mp4

Team Members

About

We predict if a Kickstarter project proposal will succeed in reaching its fundraising goal, using the project launch data of 200k + proposals scraped from Kickstarter.

https://share.streamlit.io/suvrashaw/predicting-kickstarter-project-success/prime/app.py


Languages

Language:Python 100.0%