martingaldeca / Senniors-data-explorer

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Senniors previous data explorer

This repository is a notebook with different purposes.

Explore the data

The main purpose of the repo is to explore the data of the provided csv, and understand it better for future machin learning training.

Visualize the data

In order to visualize the data the repository uses matplotlib and the methods provided by pandas to plot the dataframes.

Maybe in the future it will be migrated to seaborn or plotly

Requirements

You must make sure you have the following:

  • poetry
  • pipenv

Installation

Just go to the project folder and open a terminal.

The run the following:

poetry env use 3.10
poetry install
poetry run jupyter notebook --no-browser

And then some links will appear on terminal, just click on any of them.

Then go to etl_and_visualization.ipynb file and execute all the blocks of the notebook.

About


Languages

Language:Jupyter Notebook 100.0%