Georg Walther's repositories
distributable_docker_sql_on_hadoop
Toy Hadoop cluster combining various SQL-on-Hadoop variants
distributed_docker_hadoop
Toy example of a distributed Hadoop cluster
backup.waltherg.github.io
Personal website.
industry-machine-learning
A curated list of applied machine learning and data science notebooks and libraries across different industries.
libextract
Extract data from websites using basic statistical magic
non-smoking-berlin
Places in Berlin that are non-smoking
scikit-learn
scikit-learn: machine learning in Python
dataproc-initialization-actions
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
DeepLearningMovies
Kaggle's competition for using Google's word2vec package for sentiment analysis
glove-python
Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/
hadoop-docker
Hadoop docker image
incubator-airflow
Apache Airflow (Incubating)
katacoda-scenarios
Katacoda Scenarios
pandoc-book-template
A simple Pandoc template to build documents and ebooks.
phusion-anaconda
A docker image based on phusion/baseimage that bundles some nice dotfiles and the anaconda python distribution
pytest-cookies
:cookie: The Pytest Plugin for your Cookiecutters
pytorch-forecasting
Time series forecasting with PyTorch
ubuntu-browsers
A recent version of Ubuntu packaged with various browsers.
wetterdienst
Open weather data for humans