etl-pipeline jupyter-notebook pandas psycopg

Movies-ETL

OVERVIEW

Britta needs your help to create an automated pipeline that takes in new data, performs the appropriate transformations, and loads the data into existing tables. You’ll need to refactor code to create one function that takes in the three files—Wikipedia data, Kaggle metadata, and the MovieLens rating data—and performs the ETL process by adding the data to a PostgreSQL database.

STEPS

1: Write an ETL Function to Read Three Data Files

2: Extract and Transform the Wikipedia Data

3: Extract and Transform the Kaggle data

4: Create the Movie Database

About

creating an automated pipeline that takes in new data, performs the appropriate transformations, and loads the data into existing tables

etl-pipeline jupyter-notebook pandas psycopg

Languages

Language:Jupyter Notebook 100.0%