BaileeRice / Movies-ETL

creating an automated pipeline that takes in new data, performs the appropriate transformations, and loads the data into existing tables

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Movies-ETL

OVERVIEW

Britta needs your help to create an automated pipeline that takes in new data, performs the appropriate transformations, and loads the data into existing tables. You’ll need to refactor code to create one function that takes in the three files—Wikipedia data, Kaggle metadata, and the MovieLens rating data—and performs the ETL process by adding the data to a PostgreSQL database.

STEPS

1: Write an ETL Function to Read Three Data Files

2: Extract and Transform the Wikipedia Data

3: Extract and Transform the Kaggle data

4: Create the Movie Database

image

image

image

image

About

creating an automated pipeline that takes in new data, performs the appropriate transformations, and loads the data into existing tables


Languages

Language:Jupyter Notebook 100.0%