Alvin Thai's repositories
amazon_review_summarizer
A product comparison tool for Amazon.com
applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
d3-3.x-api-reference
An archive of the D3 3.x API Reference.
data_sketches
A collection of data related UX sketches made by Krisztina Szerovay
custom_preprocessors
custom preprocessors for sklearn
histoviewer
Dynamic histograms for pandas DataFrames
MLB_pitch_type_predictor
Multi-classification predictions and explanations for MLB play-by-play data
OrderedOVRClassifier
API for performing Ordered One-Vs-Rest Classification with scikit-learn
auto_ml
Automated machine learning for analytics & production
BreakoutDetection
Breakout Detection via Robust E-Statistics
DecisionTreeExplorer
Simple Shiny App for visualizing simplicity/performance tradeoffs in decision trees
guess_the_actor
Guess the actor who appears in a pair of movies!
lantern
Data exploration glue
Laurae
Advanced High Performance Data Science Toolbox for R by Laurae
lifelines
Survival analysis in Python
meaningful_use_of_ehrs
An analysis of the Medicare EHR Incentive Program from 2011-2017
oreilly-intro-to-predictive-clv
Repo that contains the supporting material for O'Reilly Webinar "An Intro to Predictive Modeling for Customer Lifetime Value" on Feb 28, 2017
pandas-profiling
Create HTML profiling reports from pandas DataFrame objects
probabilistic-programming-from-scratch
Notebook version of an article on the Fast Forward Labs blog
pyflux
Open source time series library for Python
random_slides
test repo for remarkjs presentations
selenium-python
Selenium Python Bindings Documentation
storytelling-with-data-ggplot
Recreation of Cole Nussbaumer Knaflic's Storytelling with Data plots using R an ggplot2