zhangyf-neu / i2mapreduce

incremental MapReduce for mining evolving big data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

i2mapreduce

incremental MapReduce for mining evolving big data

We extended Hadoop and developed i2MapReduce, which supports

  • fine-grain incremental processing (kv-pair level rather than task level in Incoop)
  • general-purpose iterative processing (that extends the original iMapReduce and supports more iterative algorithms)
  • incremental processing for iterative computation.

For more infomation, please refer to our i2MapReduce paper in TKDE 2015.

  • Yanfeng Zhang, Shimin Chen, Qiang Wang, Ge Yu. i2MapReduce: Incremental MapReduce for Mining Evolving Big Data [J]. IEEE Transactions on Knowledge and Data Engineering (TKDE), 27(7), July, 2015, pp. 1906-1919.

This project is a prototype implementation of the iMapReduce idea. The prototype is based on Hadoop 1.0.3. Only the src directory is committed. To run i2MapReduce, you should compile it just like compiling Hadoop on your own machine. It is better used for research perspective, but we don't recommend to use it in production.

The sample-runshell directory contains the pagerank example run shells, including naive pagerank code, iterative pagerank code, pagerank data update code (random generate changes), and incremental pagerank code.

About

incremental MapReduce for mining evolving big data


Languages

Language:Java 80.8%Language:Shell 7.2%Language:Python 2.8%Language:C++ 2.0%Language:HTML 1.8%Language:C 1.6%Language:PHP 0.7%Language:Perl 0.7%Language:Makefile 0.6%Language:Objective-C 0.5%Language:AspectJ 0.3%Language:Smalltalk 0.3%Language:M4 0.2%Language:Ruby 0.1%Language:TeX 0.1%Language:JavaScript 0.1%Language:CSS 0.1%Language:XSLT 0.0%Language:Thrift 0.0%