You can find the course using this link
- RDD - snippets for working with PySpark resilient distributed datasets
- DF - snippets for working with PySpark data frames
- Streaming - snippets for working with PySpark RDD anf DF streaming
- ETL - pipeline for extracting, transforming and loading data from a text file into an AWS PostgreSQL database
- CDC - change data capture or replication on going, read more here