Mirko Prescha's repositories
spark-zeppelin-docker
docker image with spark and zeppelin
spark-json-to-table
Sample ETL process written in Spark 2.1 using dataset type safety including unittests. Runs on docker image providing spark and zeppelin.
alexa-cookbook
A series of sample code projects to be used for educational purposes during Alexa hackathons and workshops, and as a reference for tutorials and blog posts.
alexa-my-reminders
alexa-skill reading from my s3-bucket
alexa-read-bucket
read text files placed in s3
Algorithms
A collection of algorithms
amazon-redshift-utils
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
aws-cloudformation-deploy
Copy cloudformation template to s3 and submit --create command if no stack exists or --update if it exists already
cp-helm-charts
The Confluent Platform Helm charts enable you to deploy Confluent Platform services on Kubernetes for development, test, and proof of concept environments.
dbt-external-tables
dbt macros to stage external sources
iglu-central
Contains all JSON Schemas, Avros and Thrifts for Iglu Central
jekyll-simpleyyt
myDataBlog
kafka-connect-ftp
A Kafka Connect Source for FTP servers - Monitors files on an FTP server and feeds changes into Kafka
kpi_retention
Spark(Scala) App to join tab delimited master data with events and do some aggregations
moto
Moto is a library that allows your python tests to easily mock out the boto library
mySnippets
useful snippets for me
pandas-videos
Jupyter notebook and datasets from the pandas Q&A video series
scala-course
my snippets created during coursera scala course
scala-school
examples used in the scala school sessions
snowplow
Cloud-native web, mobile and event analytics, running on AWS and GCP
Spark-Array-Relationalizer
SPark based Array eXPLODER - relationalize any array values into separate rows
spark-json-schema
JSON schema parser for Apache Spark