Kyrylo-Ktl / PySpark

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PySpark & AWS tutorial

Udemy course link

You can find the course using this link

Topics

  • RDD - snippets for working with PySpark resilient distributed datasets
  • DF - snippets for working with PySpark data frames
  • Streaming - snippets for working with PySpark RDD anf DF streaming
  • ETL - pipeline for extracting, transforming and loading data from a text file into an AWS PostgreSQL database
  • CDC - change data capture or replication on going, read more here

Course certificate

img

About


Languages

Language:Python 99.8%Language:Shell 0.2%