PrathameshNimkar / Big-Data-Analysis-using-the-Hadoop-Ecosystem

Learn and implement the Hadoop Ecosystem to drive Big Data Analytics.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Big Data Analysis using the Hadoop Ecosystem

Learn and implement the Hadoop Ecosystem to drive Big Data Analytics.

I've created a set of tutorials for you to begin your journey into the Big Data world. If you're new to data engineering within the Hadoop Ecosystem, you've come to the right place!



Screenshot

The above image is the pipeline I'll be using/following for my upcoming tutorials on Big Data analytics using the Hadoop Ecosystem. To begin with, let’s learn about the architectures of each individual system/tool.

We'll build upon this knowledge by implementing a practical real-life project within the aviation domain. Now, for reference, I know very little about this domain, so this should be an interesting challenge to tackle not just for you but for me as well. I'm super excited, hope you are as well...

If you're new to Big Data, I highly recommend to go through the below (in order). So, without further delay, let's begin...



Tutorials:

  1. Hadoop Distributed File System (HDFS)
  2. Sqoop
  3. Flume
  4. MapReduce
  5. Pig
  6. Hive
  7. HBase
  8. Spark
  9. Hue
  10. Tableau



Lastly:

  • Should you have any feedback/suggestions, I would love to hear them.
  • If you'd like to contribute, feel free to submit a pull request.
  • If you'd like for me to address a specific topic in further detail, do not hesitate to connect with me.

Originally published on my Medium account here

About

Learn and implement the Hadoop Ecosystem to drive Big Data Analytics.