kimtth / Data-Engineering-AWS-Cloud

Repository from Github https://github.comkimtth/Data-Engineering-AWS-CloudRepository from Github https://github.comkimtth/Data-Engineering-AWS-Cloud

Data-Engineering-AWS-Cloud

  • AWS DataPipeline (Managing Jobs / Job Scheduler)
  • AWS Lamda (Trigger Glue Job)
  • AWS Glue (Job / ETL / PySpark / Crawler)
  • AWS Athena (fetching unstructured data in S3 as a query)
  • AWS Redshift (vs GooleBigquery) : DWH / Trasaction / Cost by the Number of Node / Table is required / No Partion, But key
  • AWS Redshift Spectrum (Data Enrichment / Multiple datasources)
  • Quicksight (BI)

HLA

Dataset from kaggle brazilian ecommerce

About


Languages

Language:Python 100.0%