Matt Burgess's repositories
pdi-html-to-xml-plugin
A plugin for Pentaho Data Integration that uses JTidy to parse HTML into XHTML/XML
pdi-memcached-plugin
Plugins for Pentaho Data Integration (PDI) allowing reading from and writing to memcached
pdi-neo4j-query
Plugin(s) for Pentaho Data Integration (PDI) to enable queries of the Neo4j graph database
blinkdb
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
cdc
Community Distributed Cache
crash
Common ReusAble SHell
csv-jdbc-plugin
A plugin for Pentaho products that allows JDBC access to CSV files (see http://csvjdbc.sourceforge.net/)
ferry
Ferry lets you define, run, and deploy big data applications on your local machine using Docker
github-api
Java API for GitHub
gradle-console
Opens a console window that allows the user to play with his project code and dependencies
groovy-vfs
A DSL for Groovy on top of Apache VFS2
hbase
Mirror of Apache Hadoop HBase
hw-sandbox-storm-provision
Storm provisioning for Hortonworks Sandbox 2.0 VM
kettle-spock
A collection of Spock tests for testing Pentaho Data Integration (aka Kettle)
log-synth
Generates more or less realistic log data for testing simple aggregation queries.
optiq
Dynamic data management framework
parquet-format
Columnar file format for hadoop
pdi-annotations
An annotation processor for easily creating plugins for Pentaho Data Integration (PDI)
pdi-detect-change
A plugin for Pentaho Data Integration that detects changes in fields for incoming rows
pdi-log-synth-plugin
A plugin for Pentaho Data Integration that uses log-synth to generate rows
pdi-random-row-distributions
A set of Row Distribution plugins for Pentaho Data Integration that use probabilities for row distribution (Markov Chain, e.g.)
pdi-socket-row-plugin
A set of PDI step plugins to produce and consume rows via a socket transport
pdi-valuemeta-map
A plugin that allows collections of key/value pairs (a map) to be used as a value type in Pentaho Data Integration
pdi-vfs
Kettle-specific implementation of Apache VFS
pentaho-coding-standards
Repository for IDE specific code-style formatters as well as the master CheckStyle template
ProxyChain
A Java project aimed at injecting behavior into black-box object systems (JDBC drivers, e.g.)