luciegaba / data-mining-project

This project is articulated over 2 parts: PCA Clustering Analysis and Regression

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Table of Contents


About the Project

The following project is realized as part of the Data Mining course given by the MoSEF Master at the University of Paris 1 Panthéon-Sorbonne. The main goal is to apply our knowledge in ML and Feature Engineering. For this purpose, the project is articulated over 2 parts: PCA Clustering Analysis and Regression (predict electricity per sector).

Getting datasets

Datasets are from open data sources such as INSEE. You can find them at the following addresses:

Installing the project

In order to execute notebook without problems, you may put datasets in the data folder. You may also use pip install -r requirements.txt

Contact

About

This project is articulated over 2 parts: PCA Clustering Analysis and Regression


Languages

Language:Jupyter Notebook 99.9%Language:Python 0.1%