peter's starred repositories
numerical-linear-algebra
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
vim-anywhere
Use Vim everywhere you've always wanted to
mathematics_dataset
This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.
PyHamcrest
Hamcrest matchers for Python
pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
spark-structured-streaming-internals
The Internals of Spark Structured Streaming
PySpark-Boilerplate
A boilerplate for writing PySpark Jobs
spark-style-guide
Spark style guide
followthemoney
Data model and processing tools for investigative entity data
machine-failure-detection
PCA and DBSCAN based anomaly and outlier detection method for time series data.
azure-apim-deployment-utils
Python utilities to extract, update and deploy to and from Azure API Management instances
greenbutton-python
Python parser for ESPI ("Green Button") files.
docker-aci-workshop
Docker and Azure Container Instances workshop
adv-diagnostics
Course repository for XBUS-511 - Diagnostics for More Informed Machine Learning
intro-to-dl
Course repository for XBUS-512 - Introduction to AI and Deep Learning
greenbutton-python
Python parser for ESPI ("Green Button") files.
music-mining
Datasets and analysis for recordings that have charted globally and been nominated for a Grammy.