Scott Hajek's repositories
greenplum-ds-tutorial
Tutorials on how to use Greenplum and MADlib for data science.
ibis-docker-postgres
Specification for a Docker image containing PostgreSQL with PL/Python and postGIS installed.
pyspark-uploader
Enables rapid development of packages to be used via PySpark on a Spark cluster by uploading a local Python package to the cluster.
nbstripout
strip output from Jupyter and IPython notebooks
acrobat-prefs
Preferences and settings for Adobe Acrobat, including Redaction search patterns
aws-pricing
Ways to programmatically fetch current pricing for AWS services.
convert-encoding
Command line script to Convert file encodings
data-science-knowledge
Knowledge base of concepts helpful for data scientists
interpret-community
The Interpret Community extends Interpret repo with additional interpretability techniques and utility functions to handle real-world datasets and workflows.
karabiner-config
Configurations for Karabiner-Elements https://pqrs.org/osx/karabiner/
nlp-cs224n
Code and exercises related to Stanford cs224n course
postgres-docker
Docker Official Image packaging for Postgres
spark-lessons-learned
Examples related to Apache Spark
uci-mlr-practice
Practice machine learning techniques on data in the UCI Machine Learning Repository
visualization-python
Demonstrate ways to visualize data in Python Jupyter notebooks
wordle-model
Experiments in models and algorithms for solving wordle puzzles