Patrick Duin (patduin)

patduin

Geek Repo

Company:@HotelsDotCom @ExpediaGroup

Location:Krakow

Github PK Tool:Github PK Tool

Patrick Duin's repositories

GameOfLife

Playing around and learning scala

Language:ScalaLicense:NOASSERTIONStargazers:1Issues:1Issues:0

aws-glue-data-catalog-client-for-apache-hive-metastore

The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

cascading

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows on a Hadoop cluster. See https://github.com/Cascading/cascading for the release repository.

Language:JavaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

corc

An ORC File Scheme for the Cascading data processing platform.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

jdeb

This library provides an Ant task and a Maven plugin to create Debian packages from Java builds in a truly cross platform manner.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

plunger

A unit testing framework for the Cascading data processing platform.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0