sudarshan-koirala / data-engineering-resources

Repo with data engineering resources

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

data-engineering-resources

Repo with data engineering resources

  • To be clear, this is not a roadmap for getting started with Data Engineering.
  • I am not covering the books you should study, university studies, certificates, etc.
  • I assume you have satisfactory understanding of Python and SQL. Scala good to have.
  • Having basics understanding of CI/CD is needed ( There is DevOps / MLOps to help offcourse )
  • After knowing the basics and how things work, it's upon you, what to do ( Or lets say if it's your cup of tea / coffee or not )

Remember one thing, knowing and implementing Data Engineering tools are different thing, try to implement if it is a simple program or project.

Choose between

Modern Open Source Data Stack

image

Do some research on what sorts of company you want to apply job and what tools they use ( you can achieve this by just going through the job description of those companies) Example:


Books

  • There are many books but if you want me to suggest one, go for Fundamentals of Data Engineering by Joe Reis, Matt Housley.

Github

There are many repos with greate content / links. Some of them 👇 Suggestion: Just search data engineering and find the best ones,

Cloud Computing

Youtube ( Free University )

  • There is unlimited knoweledge you can grasp, try to find the best ones and follow them instead of jumping among videos.

Main thing I want to highlight, practice practice and practice, take help with AI assistants 👇

AI Assistants ( Remember, personal use or enterprise use )


This page will be updated over time. Cheers !!

About

Repo with data engineering resources

License:MIT License