twitter / sbf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SBF (Sparse Binary Factorization)

SBF is a solution to the community discovery problem from undirected graphs that uses Metropolis-Hastings sampling (you don't need to know anything about that in order to use this package). The discovered communities can be overlapping or disjoint. The implementation runs faster than and scales to bigger graphs than any other community discovery packages we have tried - for example, it has been used to discover 500,000 communities from a graph with 100 Million nodes and 5 Billion edges in less than 2 hours using 16 threads.

Quick Start

After cloning the repo, build as follows (for the impatient, use option -DskipTests to skip tests):

$ mvn package 

If you don't have maven on your computer, you'll need to first install that.

You can see two examples in the examples folder. For example, unweighted8node.config has the config needed to run on the simple 8-node (unweighted) graph specified in unweighted8node.txt. You can run it as follows, assuming mvn package succeeded.

$ cd examples
$ java -jar ../target/sbf-1.0.0.jar unweighted8node.config

Once the application finishes, you can see the output in unweighted8node.assignments.

Similarly, you can also run it using weighted8node.config.

examples/runExamples.sh does all of this.

About

License:Apache License 2.0


Languages

Language:Java 100.0%