Aaron Binns (aaronbinns)

aaronbinns

Geek Repo

Location:San Francisco, CA

Github PK Tool:Github PK Tool

Aaron Binns's repositories

bacon

Experimenting with Apache Pig.

jbs

Builds Lucene/Solr indexes out of NutchWAX segments and revisit records via Hadoop.

Language:JavaLicense:Apache-2.0Stargazers:6Issues:0Issues:0

tnh

(T)he (N)ew (H)otness. Improved full-txt search of archival web data.

Language:JavaLicense:Apache-2.0Stargazers:6Issues:0Issues:0

slarpy

(s)o(l)r+(ar)c+(py)thon

Language:PythonStargazers:4Issues:0Issues:0

db-deploy

Scripts and stuff to make Databricks deployments easier for MMC customers

Stargazers:2Issues:0Issues:0

waimea

Full-text indexing pipeline of Pig scripts.

Language:PythonStargazers:1Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

db-repo-path

Demonstration of approach to access Python modules on workers in Git repo

Language:PythonStargazers:0Issues:0Issues:0

db-test

Testing rando stuff for Databricks

Language:PythonStargazers:0Issues:0Issues:0

elasticsearch-dump

Import and export tools for elasticsearch

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

heritrix3

Local hacks and patches to IA Heritrix3

Language:JavaStargazers:0Issues:0Issues:0

ia-hadoop-tools

Clone of iof ia-hadoop-tools repo, but just zipnum branch with new features for zipnum and cluster merging.

Language:JavaStargazers:0Issues:0Issues:0

opennlp

Mirror of Apache OpenNLP (Incubating)

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pegasus

VM based deployment for prototyping Big Data tools on Amazon Web Services

Language:ShellStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

wayback

Fork of IA wayback with some local patches/hacks.

Language:JavaStargazers:0Issues:1Issues:0