arjayjean / superhero_power_stat

SuperHero API ETL

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SuperHeroAPI ETL

🧰 Languages and Tools


I have created an ETL that:

  • Extract: Data is extracted from a SuperHero API by Yoann Cribier
  • Transform: It will be transformed in Python from its JSON format into a form that fits the MySQL database
  • Load: Then loaded into a MySQL database

⚙ ETL Process

Notes:

  • Before I execute the ETL, I need to take more time transforming the data. The data was transformed but I need to replace the redundant data that was loaded. There were entities that were empty values, that needed to be NULL, and names that were repeated in the "alias" and "name" column. This will take time away from analysis that needs to be made. Instead of making those analysis, data cleaning would have to be done before; which is what I have done in MySQL.

About

SuperHero API ETL

License:MIT License


Languages

Language:Python 100.0%