praveendecode / Industrial_Copper_Modeling

Enhance data analysis and machine learning skills in the 'Industrial Copper Modeling' project. Tackle complex sales data challenges, employ regression models for pricing predictions, and master lead classification for targeted customer solutions

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Industrial Copper Modeling

image

Overview

Enhance data analysis and machine learning skills in the 'Industrial Copper Modeling' project. Tackle complex sales data challenges, employ regression models for pricing predictions, and master lead classification for targeted customer solutions

Features

1. Data Preprocessing:

  • Gain a deep understanding of dataset variables and types.
  • Handle missing data with appropriate strategies.
  • Prepare categorical features through encoding and data type conversion.
  • Address skewness and ensure data balance.
  • Identify and manage outliers.
  • Resolve date discrepancies for data integrity.

2. Exploratory Data Analysis (EDA) and Feature Engineering:

  • Visualize and correct skewness.
  • Identify and rectify outliers.
  • Feature improvement and creation for more effective modeling.

3. Classification:

  • Success and Failure Classification: Focusing on 'Won' and 'Lost' status.
  • Algorithm Assessment: Evaluating algorithms for classification.
  • Algorithm Selection: Choosing the Random Forest Classifier.
  • Hyperparameter Tuning: Fine-tuning with GridSearchCV and cross-validation.
  • Model Accuracy and Metrics: Assessing performance and metrics.
  • Model Persistence: Saving the model for future use.

4. Regression:

  • Algorithm Assessment: Identifying algorithms for regression.
  • Algorithm Selection: Opting for the Random Forest Regressor.
  • Hyperparameter Tuning: Fine-tuning with GridSearchCV and cross-validation.
  • Model Accuracy and Metrics: Evaluating regression model performance.
  • Model Persistence: Saving the regression model for future applications.

Getting Started

1. Clone the repository:

      https://github.com/praveendecode/Industrial_Copper_Modeling

2. Install required packages:

      pip install -r requirements.txt

3. Run the Streamlit app:

      streamlit run app.py

4. Access the app in your browser:

      http://localhost:8501

Skills Covered

  • Python
  • Numpy
  • Pandas
  • Scikit-Learn
  • Matplotlib
  • Seaborn
  • Pickle
  • Streamlit
  • Docker

Results

  • Classification: Achieved 98.999% accuracy with ExtraTrees Forest Classifier.
  • Regression: Achieved 98.3% accuracy with ExtraTrees Forest Regressor.

This project not only demonstrates data analysis and machine learning skills but also showcases practical applications for solving complex challenges in the manufacturing industry.

About

Enhance data analysis and machine learning skills in the 'Industrial Copper Modeling' project. Tackle complex sales data challenges, employ regression models for pricing predictions, and master lead classification for targeted customer solutions


Languages

Language:Jupyter Notebook 99.2%Language:Python 0.8%