Analysis of Production Data
Requirements: Vmware CentOs Hadoop Apache Spark Scala
Importing both the tables(orders and order_items from retail_db database) from mysql to HDFS using sqoop.
Then perform necessary analytics using apache spark.
Problem statement, get the revenue and number of orders from order_items on daily basis.
Analysis of Production Data
Requirements: Vmware CentOs Hadoop Apache Spark Scala
Importing both the tables(orders and order_items from retail_db database) from mysql to HDFS using sqoop.
Then perform necessary analytics using apache spark.