praateekmahajan / tempo

The purpose of this project is to provide an API for manipulating time series on top of Apache Spark. Functionality includes featurization using lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, and downsampling & interpolation. This has been tested on TB-scale of historical data and is unit tested for quality purposes.

Home Page:https://github.com/databrickslabs/tempo

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

praateekmahajan/tempo Issues

No issues in this repository yet.