abhishekmsharma / big-data-electricity-consumption-analysis-apache-spark

Developed for analysing and visualizing trends related to electricity and energy consumption

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

big-data-electricity-consumption-analysis-apache-spark

Developed for analysing and visualizing trends related to electricity and energy consumption

The project worked on a dataset containing more than 2 million records about electricity consumption on a per minute basis. The plethora of data was read and processed using Apache Spark Streaming. Spark Machine Learning Library (MLlib) was used for analyzing the usage patters, clustering the data points, and predicting the trends in electricity consumption.

Technology used: Apache Hadoop, Apache Spark, Spark MLlib, Java

Data: https://archive.ics.uci.edu/ml/datasets/individual+household+electric+power+consumption

About

Developed for analysing and visualizing trends related to electricity and energy consumption


Languages

Language:Java 100.0%