jwittbold's repositories

gdelt-gkg-databricks

ETL Pipeline to ingest and transform GDELT GKG 2.0 records

Language:PythonStargazers:3Issues:1Issues:0

gdelt-gkg

GDELT Global Knowledge Graph Pipeline

Language:PythonStargazers:1Issues:0Issues:0

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

airflow-log-analyzer

A simple python script to analyze and return errors within Airflow log files.

Language:PythonStargazers:0Issues:0Issues:0

hadoop_streaming_mapreduce

Basic Hadoop Streaming MapReduce project

Language:PythonStargazers:0Issues:0Issues:0

hdinsight-spark-miniproject

Deploying HDInsight Spark Cluster on Azure

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

mini_pipeline

Python ETL script.

Language:PythonStargazers:0Issues:0Issues:0

riskybank

A simple mock ATM/Banking program

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

spring_capital

ETL pipeline for stock data using Spark on Azure

Language:PythonStargazers:0Issues:0Issues:0

autoinc-hdfs-spark

Refactoring a MapReduce project to utilize Spark on HDFS.

Language:PythonStargazers:0Issues:0Issues:0

azure-data-factory-ELT

Working with Azure Data Factory ELT

Stargazers:0Issues:0Issues:0

dsc_intro

Exercises to accompany the free Springboard introductory data science "taster" course.

Stargazers:0Issues:0Issues:0

euro_cup_2016_mini_project

SQL queries for euro_cup_2016 mini project

Stargazers:0Issues:0Issues:0

gdelt

Exploring GDELT

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

spark-optimization

Optimizing a Spark SQL query

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

SQL_optimization_mini_project

Optimization of six SQL queries

Stargazers:0Issues:0Issues:0