Ryan M White's repositories
ansible-role-data-science
Ansible role to provision Data Science related tooling
ansible-role-kubernetes
Ansible role to provision Kubernetes client side tools
ansible-role-pachyderm
Ansible role to install Pachyderm via Helm
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
cython-cmake-example
Utilities and example for using CMake to build Cython modules.
faker
Faker is a Python package that generates fake data for you.
FEBRL-0.4.2-fork
Fork of the Freely Extensible Biomedical Record Linkage program https://sourceforge.net/projects/febrl
hello-world
Just a test repo
histbook
Versatile, high-performance histogram toolkit for Numpy.
lvm-rsync-backup
Scripts for efficiently and safely backing up a linux system using LVM snapshots and rsync
m-layer-proto
M-Layer concept and digital object register with protocol buffers
messytables
Tools for parsing messy tabular data. (See also successors https://github.com/frictionlessdata/tabulator-py and https://github.com/frictionlessdata/tableschema-py)
physt
P(i/y)thon h(i/y)stograms.
practical-python
Practical Python Programming (course by @dabeaz)
prefect
The Prefect Core automation engine
pyudorandom
Mix up items without guarantees.
tdigest
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
transitions
A lightweight, object-oriented finite state machine implementation in Python
vimfiles
Provisioning vim