mikaelahonen-solita / aws-glue-tutorial

AWS Glue tutorial for data developers.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

AWS Glue tutorial

AWS Glue tutorial for data developers.

This is a complementary repository for this AWS Glue tutorial with Spark and Python for data developers.

DynamicFrame vs DataFrame in AWS Glue

Note the difference between DynamicFrame and DataFrame. DataFrame is Spark native table like structure. DynamicFrame class is an attempt from AWS to address limitations of the DataFrame.

DynamicFrames might be handy to read and write data. Often the data processing is more efficient with standard PySpark functions.

About

AWS Glue tutorial for data developers.


Languages

Language:Python 100.0%