Alex Holmes (alexholmes)

alexholmes

User data from Github https://github.com/alexholmes

Location:California

Home Page:https://twitter.com/grep_alex

GitHub:@alexholmes

Alex Holmes's repositories

hadoop-book

Source code to accompany the book "Hadoop in Practice", published by Manning.

Language:JavaLicense:Apache-2.0Stargazers:202Issues:42Issues:9

hiped2

Source code that accompanies the book "Hadoop in Practice, Second Edition".

Language:JavaLicense:Apache-2.0Stargazers:79Issues:22Issues:5

vagrant-hadoop-spark-hive

Vagrant project to spin up a single virtual machine running current versions of Hadoop, Hive and Spark

Language:ShellLicense:Apache-2.0Stargazers:74Issues:6Issues:6

hdfs-file-slurper

Utility to easily copy files into HDFS

Language:JavaLicense:Apache-2.0Stargazers:69Issues:13Issues:12

json-mapreduce

InputFormat that can split multi-line JSON

Language:JavaLicense:Apache-2.0Stargazers:49Issues:8Issues:3

avro-maven

A simple example of how to use the Avro Maven plugin to generate Avro sources.

hadoop-utils

A set of Hadoop utilities to make working with Hadoop a little easier.

Language:JavaLicense:Apache-2.0Stargazers:26Issues:9Issues:2

hsync

HDFS rsync-like utility to replicate data between HDFS clusters

htuple

A library to simplify compound field partitioning, sorting and grouping in MapReduce.

Language:JavaLicense:Apache-2.0Stargazers:13Issues:1Issues:6

avro-sorting

Examples of built-in and customizable sorting in Avro and Hadoop.

Language:JavaLicense:Apache-2.0Stargazers:6Issues:2Issues:0
Language:HTMLLicense:NOASSERTIONStargazers:2Issues:2Issues:0

filecrush

Remedy small files by combining them into larger ones.

Language:JavaStargazers:1Issues:1Issues:0

java-external-sort

sort large files in Java

props4j

Use Java Annotations to load properties into your beans

redline

Pure Java Rpm Library

Language:JavaLicense:MITStargazers:1Issues:1Issues:0

storm-trending-words

Quick and dirty trending words example on Storm.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:1Issues:0
Language:JavaStargazers:0Issues:1Issues:0

hdfscompact

A HDFS file compacter.

License:Apache-2.0Stargazers:0Issues:1Issues:0

mleap

MLeap: Deploy ML Pipelines to Production

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0