Dan Vatterott's repositories
BMM_attentional_CNN
A CNN with an attentional module that I built while attending the brains minds and machines summer course
jupyter_notebooks
Some of my jupyter notebooks
Kodi_addons
repsitory for kodi addons that I build
baseball_matchup_predictions
predicting baseball game outcomes using a collaborative filtering approach
explore_feature_automation
Jupyter notebook exploring the feature tools library.
mlb_errors
Quick analysis of retrosheet.org to see if some baseball players are more likely to hit into errors than others.
presidential_speeches
Analysis of presidential speeches across time
reminder_email
Create weekly reminder emails on random day
stackex_sum
Sifting the Overflow, a product I created as a Data Science Fellow at Insight (www.siftingtheoverflow.com)
amazon_helpfulness
looking at whether I can predict helpfulness of reviews
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
data-science-ipython-notebooks
Continually updated data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
data_challenge
generating fake data for data challenges
dataPipeline_presentation
Short presentation about building data-pipelines
dvatterott.github.io
website
fantasy_football
scraping and modeling fantasy football data
opiate_obits
Public health awareness project involving NLP of obituaries from people who died from opiod addiction.
pyspark_tutorial
Prepare.ai tutorial on building a data science pipeline with PySpark.
reveal_external
Plugin for reveal.js to import external sections.
sql_presentation
SQL Presentation at Insight
tv_vs_movies
investigating whether the quality of tv and movies has changed over time