QuentinAmbard / timeseries

timeseries forecasting with spark ML

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Timeseries forecasting with Spark ML

  • Download https://zeppelin.apache.org, Spark > 2.0 and load the notebook.
  • The timeseries can be loaded from .csv file instead of the C* table:
val df: DataFrame = spark.read
  .format("csv")
  .option("header", "true")
  .option("dateFormat", "M/d/YYYY")
  .schema(StructType(Array(StructField("time", DateType, true), StructField("amount", FloatType, true))))
  .csv("/home/quentin/projects/spark-ml/ml/src/main/resources/sale_timeserie2.csv")

df.createOrReplaceTempView("sales") 
  • This project is a very basic demo for an introduction to timeseries and spark ML

About

timeseries forecasting with spark ML