https://www.kaggle.com/datasets/shree1992/housedata
The dataset contains information about houses in Sidney and Melbourne. It includes the information about the number of bedrooms, bathrooms, size, location, price and other features of the houses.
This project can be run using Asdf and Poetry, or using only Python.
- Install Asdf if you haven't already.
- Clone the repository.
- Run
asdf installthis will install the required versions of Python and Poetry. - Navigate to the project directory.
- Run
poetry installto install the dependencies.
- Install Python if you haven't already.
- Clone the repository.
- Navigate to the project directory.
- Run
pip install -r requirements.txtto install the dependencies.
- Download the dataset from https://www.kaggle.com/datasets/shree1992/housedata
- Unzip the dataset and place it in the data directory with the name
housing.csv - If you are running with Poetry add
poetry runbefore the command. - Start the MLFlow server by running
mlflow ui --backend-store-uri sqlite:///mlflow.db - Run
python main.pyto train and register the models. - Serve the model as an API by running
mlflow models serve -m ./mlruns/1/<artifact_id>/artifacts/random_forest_model -p 8000 --no-conda - Run
python monitor.pyto load a sample data, check for anomalies and generate a report.