California-Data-Collaborative / RateComparison

Easily compare the revenue, equity, and demand implications of different water rate structures.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Show in graph when the real data ends and predictions start

christophertull opened this issue · comments

Maybe add a vertical dotted line where the true data ends? or change to dashed line chart for predicted values?

I like the idea of a dashed line for predicted values though both would work. If we're really smart we'll show error bars for the predicted values ;)

Will take some thought about how we want to do error bars. Right now our forecasts are mean usage for each customer in each month. One option would be deriving error bars from the standard deviation of the residuals for each predicted month.

Yeah I think we might want to hold off on error bars while we're doing this sort of arithmetic accounting type forecast. When we get into more sophisticated econometric type forecasting there's natural links to error bars with OLS and (I think?) also ARIMA.

Might also be some stuff we can learn from the Water Demand Forecasting tool @fwhigh and the DataKind folks built. See here: https://github.com/California-Data-Collaborative/waterd/blob/master/app/train.R

Note also something we discussed with @EricSchles

Going to close this for now. @anudeepvanjavakam1 added a nice looking line to demarcate the predictions in dde330a:
image

Created a new issue for discussion of error bounds here: #31