mammhoud / Data-engineering

this repo for materials and exercises for Data engineering Master Class for Sprints.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Engineering Master Class

A journey has taken me through:

  • Self-directed learning: Countless online resources, tutorials, and experimentation fueled my initial passion.
  • Hands-on projects: From personal data analysis tasks to contributing to open-source data initiatives, I honed my skills in real-world scenarios.
  • Industry mentorship: Learning from seasoned data engineers provided invaluable guidance and insights.

Expert Guidance Every Step of the Way

Meet the esteemed mentors who will accompany you on this journey:

  • ENG. Amro Saleh
  • ENG. Ahmed Reda
  • ENG. Mohamed Essam

By combining these experiences and skills, this training course formulated this:

  • A strong basis in the concepts of basic data engineering.
  • Practical skills in data exploration, cleaning, transformation, and ingestion.
  • The ability to benefit from artificial and cloud intelligence solutions to analyze strong data.
  • Experience in visualizing data to create touch stories.

Dive Deep into the Data Universe

This program equips you with the tools to tackle any data challenge:

  • Conquer the Power of Databases: Master both familiar (MySQL) and cutting-edge (NoSQL) solutions to store valuable data efficiently.
  • Become a Data Ingestion Guru: Learn the art of seamlessly bringing data from various sources into an analytical environment.
  • Data Wrangling: From Messy to Marvelous: Discover the magic of data cleaning and transformation, transforming raw data into actionable insights.
  • Demystify AI and Cloud Solutions: Explore the exciting world of Artificial Intelligence and leverage the power of cloud platforms to scale data endeavors.
  • Tame the Beast: Cloudera Big Data Solutions: Unleash the power of Cloudera's big data platform, designed to handle massive datasets easily.
  • Data Visualization: Tell Compelling Stories: Master the art of data visualization, transforming complex data into clear and impactful visuals.

Graduation Project: Big Data Debut

Project Title: Analyzing Covid-19 Data Streams

Technologies: Leverage the power of Big Data, Hadoop, Cloudera, PowerBI, and HQL to analyze COVID-19 data streams, gain valuable insights, and contribute to the fight against this global pandemic.

About

this repo for materials and exercises for Data engineering Master Class for Sprints.ai


Languages

Language:Jupyter Notebook 98.8%Language:Python 0.8%Language:HiveQL 0.4%