Notes taken and exercises done on the Data Engineering on Google Cloud Platform Specialization
- Duration: five-weeks
- Content: 5 course of 1 week each
Introduction to the Big Data and Machine Learning capabilities of Google Cloud Platform (GCP)
Create and manage computing clusters to run Hadoop, Spark, Pig and/or Hive jobs on Google Cloud Platform. Access various cloud storage options from their compute clusters and integrate Google’s machine learning capabilities into their analytics programs
Carry out no-ops data warehousing, analysis and pipeline processing
Machine learning (ML) and TensorFlow concepts
Streaming data pipelines using Google Cloud Pub/Sub and Dataflow to enable real-time decision making