Scott Hajek's repositories
wordle-model
Experiments in models and algorithms for solving wordle puzzles
data-science-knowledge
Knowledge base of concepts helpful for data scientists
pip
The Python package installer
nbstripout
strip output from Jupyter and IPython notebooks
kedro-viz
Visualise your Kedro data pipelines.
pypa.io
Sphinx src for pypa.io
aws-pricing
Ways to programmatically fetch current pricing for AWS services.
bundle
Bundles any python application into package deployable in Docker and Kubernetes.
convert-encoding
Command line script to Convert file encodings
interpret-community
The Interpret Community extends Interpret repo with additional interpretability techniques and utility functions to handle real-world datasets and workflows.
nbdime
Tools for diffing and merging of Jupyter notebooks.
pyspark-uploader
Enables rapid development of packages to be used via PySpark on a Spark cluster by uploading a local Python package to the cluster.
uci-mlr-practice
Practice machine learning techniques on data in the UCI Machine Learning Repository
ibis
Productivity-centric Python data analysis framework for SQL systems and the Hadoop platform. Co-founded by the creator of pandas
snarfler
Web-scraping and SOAP API examples.
moves
Data Science Demo: Real-time model scoring as a service using Pivotal Cloud Foundry
greenplum-ds-tutorial
Tutorials on how to use Greenplum and MADlib for data science.
acrobat-prefs
Preferences and settings for Adobe Acrobat, including Redaction search patterns
karabiner-config
Configurations for Karabiner-Elements https://pqrs.org/osx/karabiner/
ibis-docker-postgres
Specification for a Docker image containing PostgreSQL with PL/Python and postGIS installed.
postgres-docker
Docker Official Image packaging for Postgres
visualization-python
Demonstrate ways to visualize data in Python Jupyter notebooks
spark-lessons-learned
Examples related to Apache Spark
nlp-cs224n
Code and exercises related to Stanford cs224n course