gSpan

gSpan algorithm in data mining

Features:

Support basic sub-structure mining.
Target: frequent subgraphs, connected, labeled, undirected, single-machine, embedded instead of induced
Require: all graphs can be stored in memory(if not, algorithm may have to change, the memory cost usually < 1G)

Usage:

use make to generate the executable, that is, dig:

make

run on your graph.data with the support and you will get the result:

./dig file 0.07 [ans.txt]

0.1 for big graph, while 0.5 for small graph

Notice:

Notice the difference between > and >= when considering minsup, and the float2integer is also a problem! this may cause result different, but this is not our fault!

TODO:

bugs exist(less results)
induced version
distributed version

LOGS: 11/01/2016

finish the basic version of this project
do some optimizations

Reference:

http://www.cs.ucsb.edu/~xyan/software/gSpan.htm
http://www.tuicool.com/articles/FvaEJju
source codes on github

About

implement of gSpan algorithm, to search frequent graph patterns

GNU General Public License v3.0

Languages

Language:GAP 99.0%Language:C++ 0.9%Language:Makefile 0.0%Language:Shell 0.0%