mysahgithub's repositories
shell-scripting-tutorial
A complete begineers guide to learn shell scripting from scratch which includes Videos, Practice scenarios and project idea.
emr-serverless-samples
Example code for running Spark and Hive jobs on EMR Serverless.
terraform-aws-emr-cluster
Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS
winutils
winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows
emr-on-eks
Run EMR workloads on EKS
aws-data-engineering
Resources for the free AWS Data Engineering course on youtube
Big-Data-Systems-Intelligence-Analytics
Python, SQL, Snowflake, AWS services, Google Cloud Platform, Hadoop, HDFS, Mapreduce, Hive, Pig, MongoDB, HBase, Apache kafka, Apache Airflow, Docker, Tableau
YouTube-Data-Analysis-using-Hadoop
Hadoop, HDFS, MapReduce, Hive, Pig, Java, Eclipse(Maven), Ubuntu(Linux commands)
hive-metastore
Hadoop/Hive/Spark container to perform CI tests
kafka-streaming-click-analysis
Use Kafka and Apache Spark streaming to perform click stream analytics
Hadoop-Spark-Environment
spark standalone-vagrant
vagrant-spark-cluster
Spark standalone cluster creation automatically using Vagrant and Ansible.
vagrant-hadoop
Vagrant for Hadoop
vagrant-hbase
A Vagrantfile to get up and running with Hadoop and HBase development.