Demand Forecast

What is the project about?

Products such as electronics, household appliances have varying characteristics and demand cycles. Category managers, responsible for overseeing these items throughout their lifecycle, face challenges in planning purchases, especially when the assortment spans thousands of items. As data volume increases and platform complexity grows, traditional methods of inventory management and demand forecasting become less effective. In response to these challenges, Supermegaretaillite is investing in the development of an automated system, including an ML-based demand forecasting service. To preserve the primacy of the source code, the SRC module is loaded in PyPi for further use in repositories training-pipeline and inference-pipeline.

How use this project?

This project is deployed on the server and is fully ready to work 24/7. If you want to use this service, you need to:

Open this web adress
For example, you would like to know the demand for the product with sku id 20 for the next 7 days. You should fill in the fields in web service: SKU_ID = 20, Stock = 10, Horizon Days = 7, Confidence Level = 0.90
Сlick the buttons: 3.1 "Get how much to order" to find out how much diawara you need to order from the supplier. 3.2 "Get stock level" to find out how much stock you will have in 7 days. 3.3 "Get low stock sku id" to find out which products will be out of stock in 7 days.

Installation

Add your AWS credentials to the .env file in the root directory of the project.

Example of .env file:

AWS_ACCESS_KEY_ID=your_access_key_id
AWS_SECRET_ACCESS_KEY=your_secret_access_key

To run docker-compose:

sudo chown -R 472:472 ./grafana_data
sudo chmod -R 777 ./grafana_data
docker-compose up

Usage

You can visit on accessed URL by Streamlit after run docker-compose.

To look the API documentation, visit:

http://localhost:8000/docs

To look at the metric charts in Grafana, visit:

http://localhost:3000

and login with username - admin and password - admin

To look the application by FastAPI, use:

uvicorn app:app --reload --host localhost --port 8000

To look the application by Streamlit, use:

streamlit run web_app/streamlit_app.py

Documentation

If you want to learn ML Sistem Design Documentation, you can open docs/ML_System_Design.md

If you want to learn Documentation about the project, you can open docs/build/html/index.html

Pipeline this project

Main tools used

Project Organization

.
├── app.py                <--- FastAPI app
├── configs               <--- Configs for this project
│   └── train_config.yaml     <--- Training configuration file
├── data                  <--- Data for this project
│   ├── external              <--- External data sources
│   ├── interim               <--- Intermediate data
│   ├── processed             <--- Processed data ready for analysis
│   │   ├── features_targets.csv  <--- Features and targets for training
│   │   ├── predictions.csv     <--- Model predictions
│   │   └── sku_demand_day.csv  <--- Demand forecast for each SKU
│   └── raw                   <--- Raw data before processing
│       ├── demand_orders.csv  <--- Demand orders data
│       ├── demand_orders_status.csv  <--- Demand orders status data
│       ├── features.csv     <--- Features data
│       └── sales.csv       <--- Sales data
├── data.dvc               <--- DVC data tracking file
├── docker-compose.yml     <--- Docker Compose configuration
├── Dockerfile             <--- Dockerfile for building the project container
├── docs                   <--- Documentation for this project
│   ├── Makefile               <--- Makefile for building docs
│   └── ML_System_Design.md    <--- ML System Design document
├── dvc.lock               <--- DVC lock file
├── dvc.yaml               <--- DVC pipeline definition
├── images                 <--- Images used in documentation
│   └── Demand_Forencast_Pipeline.jpg
├── __init__.py            <--- Package initializer
├── LICENSE                <--- License for this project
├── logs                   <--- Logs generated by the application
│   ├── app.log
├── Makefile               <--- Makefile for various build tasks
├── MANIFEST.in            <--- Manifest for package distribution
├── models                 <--- ML models and associated files
│   ├── losses.json            <--- Model training losses
│   └── model.pkl              <--- Serialized model
├── notebooks              <--- Jupyter notebooks for exploration and analysis
│   └── EDA.ipynb              <--- Exploratory Data Analysis notebook
├── project_structure.txt  <--- Project structure description
├── prometheus_data        <--- Prometheus monitoring configuration
│   ├── app_metrics.py         <--- Application metrics for Prometheus
│   └── prometheus.yml         <--- Prometheus configuration file
├── README.md              <--- Readme file with project overview
├── requirements.txt       <--- Python dependencies
├── setup.cfg              <--- Setup configuration
├── setup.py               <--- Setup script for installing the package
├── src_demand_forecast    <--- Source code for demand forecast
│   ├── data                   <--- Data processing scripts
│   │   ├── __init__.py
│   │   └── split_dataset.py   <--- Script for splitting the dataset
│   ├── download               <--- Data download scripts
│   │   ├── download_from_s3.py<--- Script to download data from S3
│   │   └── __init__.py
│   ├── entities               <--- Entity definitions
│   │   ├── feature_params.py
│   │   ├── __init__.py
│   │   ├── model_params.py  <--- Model parameters
│   │   ├── split_params.py  <--- Split parameters
│   │   ├── train_pipeline_params.py  <--- Training pipeline parameters
│   │   └── validation_params.py  <--- Validation parameters
│   ├── features               <--- Feature engineering scripts
│   │   ├── AddFeatures.py  <--- Script for adding features
│   │   ├── AddTargets.py  <--- Script for adding targets
│   │   ├── build_sku_by_day.py  <--- Script for building SKU demand by day
│   │   ├── build_transformer.py  <--- Script for building a transformer
│   │   └── __init__.py
│   ├── inference              <--- Inference scripts
│   │   ├── __init__.py
│   │   └── make_request.py    <--- Script for making inference requests
│   ├── __init__.py
│   ├── models                 <--- Model training and evaluation scripts
│   │   ├── __init__.py
│   │   ├── repro_experiments.py <--- Reproducibility experiments
│   │   └── train_model.py     <--- Model training script
│   ├── upload                 <--- Data upload scripts
│   │   ├── __init__.py
│   │   └── s3_storage.py      <--- Script to upload data to S3
│   └── visualization          <--- Data visualization scripts
│       ├── __init__.py
│       └── visualize.py       <--- Script for visualizing data
├── tests                  <--- Unit tests for the project
│   ├── app                    <--- Tests for the FastAPI app
│   │   └── test_streamlit_app.py  <--- Tests for the Streamlit app
│   ├── conftest.py          <--- Pytest configuration
│   ├── data                   <--- Tests for data processing
│   │   └── test_data.py     <--- Tests for data processing
│   └── models                 <--- Tests for models
├── tox.ini                <--- Tox configuration for testing
├── train_pipeline.py      <--- Script for running the training pipeline
└── web_app                <--- Web application scripts
    ├── Dockerfile         <--- Dockerfile for the web application
    ├── __init__.py
    ├── requirements.txt  <--- Python dependencies for the web application
    └── streamlit_app.py    <--- Streamlit web application

Edipool / DemandForecast