-
Lambda function Demonstration using Container Image, This Lambda function achieve following task
-
Very Basic high level overview of this lambda function
- Lambda function
Ghactivity_ingestor
Get Data from GHArchive, save all json files to s3 table(landing/ghactivity)
. - Lambda function
Ghactivity_transfomer
will be triggered automatically onS3 PUT Event (landing/ghactivity)
- Lambda function
Ghactivity_transfomer
read all json files convert intoparquet file
and store onS3 table raw/ghactivity/
- Create
Glue Crawler
to crawl incremental data toAthena
table and run adhoc queries
- Create Lambda function in AWS console, set following Environment variable Run :
Goto AWS CONSOLE SET ENVIRONMENT VAR Based on Need
BUCKET_NAME : <YOUR_VAL>
FOLDER: <YOUR_VAL>
JOB_ID: <YOUR_VAL>
SOURCE_FOLDER : <YOUR_VAL>
TGT_FOLDER: <YOUR_VAL>
JOB_ID_1: <YOUR_VAL>
PYTHONPATH:/var/task/app