AidanG1 / datathon2022

Winner of First Place Underclassmen Prize at Rice Datathon 2022

Home Page:https://breezee.me/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

πŸƒBreezee

Any Way The Wind Blows

Created by Aidan Gerber, Noah Spector, Sai Mettupalli, and Jacob Kasner

Table of Contents πŸ“•

  1. Objectives
  2. Cleaning
  3. Modeling
  4. Live Hosting
  5. Citations

Objectives 🧠

Breezee seeks to establish predictions for 72 hours into the future regarding "any way the wind blows," and how fast. We leverage the National Data Buoy Center's network of sensors to obtain data from the past 45 days, then display it in our dashboard alongside the future prediction model trained using the historical data.

Cleaning 🧼

The datasets for each individual station presented themselves as text files on the National Oceanographic and Atmospheric Association's website. We developed a pipeline to request the text file given the station name and parse it into a pandas DataFrame.

Modeling πŸ‹οΈ

We split the wind speed and wind direction predictions into two separate models.

For wind speed, in order to arrive at the Gated Recurrent Unit model, we first needed to attempt other options.

  1. Extreme Learning Machine initially seemed promising, but the methodology focuses primarily on classification.
  2. Markov Chains drew from our experience in COMP 140, yet only allowed for tracking one value across time, rather than incorporating other features.
  3. Simple Linear Regression across datetime provided a decent line of best fit, but could not account for the oscillating data pattern.
  4. Long Short-term Memory showed promise, but it had difficulty making any predictions besides the average.

Gated Recurrent Units performed best.

For wind direction, we repurposed our already-written Markov chain to give a prediction of 8 possible directions.

Live Hosting πŸ’»

  • Deta Deployment for Dashboard
  • Google Colab for Model Training and Predictions
  • JupyterNotebooks for Data Exploration and Cleaning Methodology
  • Gets data live from buoys and makes new predictions

Citations πŸ§‘β€πŸ«

https://www.ndbc.noaa.gov/

https://www.tensorflow.org/tutorials/structured_data/time_series

Colab link running project : https://colab.research.google.com/drive/1-ocY5TOAvqVGfvS2pJiQYma6IBsST9zN#scrollTo=gl36zBdPpfIY

https://www.tensorflow.org/guide/keras/rnn

https://www.tensorflow.org/tfx/tutorials/serving/rest_simple

https://towardsdatascience.com/quickly-share-ml-webapps-from-google-colab-using-ngrok-for-free-ae899ca2661a

About

Winner of First Place Underclassmen Prize at Rice Datathon 2022

https://breezee.me/


Languages

Language:Python 97.7%Language:Svelte 2.2%Language:JavaScript 0.1%Language:HTML 0.1%