Doug Balog (dougb)

dougb

Geek Repo

Company:@valor-engineering

Location:Pittsburgh, PA

Home Page:http://www.balog.net/~doug

Twitter:@6nop

Github PK Tool:Github PK Tool

Doug Balog's repositories

djbdns

D. J. Bernsteins DNS servers

Language:CStargazers:1Issues:1Issues:0

bytecask

Key/value database inspired by Bitcask

Language:ScalaStargazers:0Issues:0Issues:0

data-validator

A tool to validate data built around Apache Spark.

Language:ScalaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fantasy-football

Choosing a fantasy football team using spark, hive, python, and really just about anything.

Language:JavaStargazers:0Issues:0Issues:0

getting-started

This repository is a getting started guide to Singer.

Language:MakefileStargazers:0Issues:0Issues:0

hdfs

A native go client for HDFS

Language:GoLicense:MITStargazers:0Issues:1Issues:0

imap_tools

Work with email by IMAP

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kafka-connect-cassandra

Kafka Connect Cassandra Connector. This project includes source/sink connectors for Cassandra to/from Kafka.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

netqmail

netqmail

Language:CLicense:NOASSERTIONStargazers:0Issues:1Issues:0

pennsylvania-vaccines

This is a centralized repository for the Pennsylvania Vaccine Updates bots.

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

petuum

SailingLab's Petuum project.

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:1

PowerGraph

PowerGraph: A framework for large-scale machine learning and graph computation.

Language:C++Stargazers:0Issues:1Issues:0

rich

Rich is a Python library for rich text and beautiful formatting in the terminal.

License:MITStargazers:0Issues:0Issues:0

scala-chart

Scala Chart Library

Language:ScalaLicense:LGPL-3.0Stargazers:0Issues:1Issues:0

silt

SILT: A Memory-Efficient, High-Performance Key-Value Store

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

singer-python

Writes the Singer format from Python

License:Apache-2.0Stargazers:0Issues:0Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

spark-deep-learning

Deep Learning Pipelines for Apache Spark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

sqlmesh

SQLMesh is a DataOps framework that brings the benefits of DevOps to data teams. It enables data scientists, analysts, and engineers to efficiently run and deploy data transformations written in SQL or Python.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tap-framework

a framework for rapidly prototyping new singer taps

Language:PythonStargazers:0Issues:1Issues:0

tap-shopify

Singer.io tap for extracting Shopify data

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

wumpus

Wumpus is an information retrieval system developed at the University of Waterloo. Its main purpose is to study issues that arise in the context of indexing dynamic text collections in multi-user environments.

Language:C++License:GPL-2.0Stargazers:0Issues:0Issues:0