GhayathAlobaidi / ETL-Pipeline--HTML-Pandas-SQL

A project that converts a CSV file, clean the data, convert it into a Pandas DataFrame and push out into SQL database.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

1

A few months ago, I was tasked to outline/draw out a prototype automating a system to log onto the National Center for Education Statistics website, download data, clean it and save it in MySQL.

-This was my response to the challenge while I continue to learn ways in automating a better system.

-The goal is to download the data in CSV format, convert it into a Pandas DataFrame and then push/save the data into the appropriate MySQL tables.

-The Jupyter notebook shows the ETL along the process.

About

A project that converts a CSV file, clean the data, convert it into a Pandas DataFrame and push out into SQL database.


Languages

Language:Jupyter Notebook 100.0%