zyc1314 / konect-extr

Network dataset extraction library – part of the KONECT project by Jérôme Kunegis, University of Namur

Home Page:http://konect.cc/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

KONECT Extraction
=================

This collection of network extraction tools is part of the KONECT
project, i.e., the Koblenz Network Collection, a network analysis
project by Jérôme Kunegis.

http://konect.cc/ 

This package contains code for generating the network datasets used in
KONECT.

In each subdirectory of extr/ you can find code to extract one group of
datasets, usually from one source, and generate a TSV file from it.  If
you are looking for the code for a specific network, you can find the
name of the subirectory in the field "extr" of the meta.* file, which
comes with the TSV files you can download from our network pages.

Please note that not every network has the extraction code publically
available.  This is only the code we can make available.  Also, many
datasets have been extracted so long ago that we don't have the code
anymore.  Also, some datasets have been contributed by people that
didn't give us their code.  (If you contribute a dataset to KONECT, the
best is to give us a self-contained directory that uses Stu to generate
the dataset in the correct format.)

Keep in mind that most code in here WAS EXECUTED JUST ONCE.  The code is
of corresponding quality.  We provide the code for the purpose of full
disclosure -- not in order to deliver a full-features extraction
library. 

Some directories here also contain code for analysing the datasets,
which has been used in a few papers.  


Usage
=====

To build the datasets, execute "make" or "stu" inside each subdirectory
of extr/, depending on whether a 'Makefile' or a 'main.stu' is present.
The code downloads the datasets from their online sources and converts
them to the KONECT format.

Note that many directories have additional requirements.  Read the
'Makefile' or 'main.stu' (or rarely, 'README') for more information. 


Requirements
============

You may need to install the following additional software packages,
depending on the dataset: 

    - Stu:  https://github.com/kunegis/stu/
    - Perl 5
    - Some other tools; depending on the dataset


Extraction Tools
================

The directories sh/ and c/ contain general-purpose functions for
transforming data into the KONECT format.  They are used by various
directories under extr/, and you may also use them to extract your own
software.


Contributing
============

See CONTRIBUTING. 

If you use and like KONECT, please consider contributing to it by making a
donation to:

https://www.paypal.com/donate/?hosted_button_id=Q9JY2FB3AFHR6	


License
=======

The extraction code and library is distributed under the terms of the
GNU General Public License version 3, which you can find provided in the
file 'COPYING' in this directory. 


Stu
===

We're converting the directories to Stu, a replacement for Make, also
written by Jérôme Kunegis.  Stu can be installed from

https://github.com/kunegis/stu/

Stu targets:

default target:  generate all data.

@deploy:  copy over to konect/.  These copy the relevant files into
konect/dat/, but usually don't create the symlinks in uni/.


Code Quality
============

Most code in the directories is broken, because it was last executed
just once, and that was a long time ago in most cases.  Many things have
changed in the time.  If you want to execute any of this code, you'll
most likely need to hack it.  

In particular:

- The URL from which KONECT gets the data is not available anymore. 
- Scripts have been moved to sh/, but uses of it have not been updated
- Extraction directories are now all under extr/, but code has not been
  changed.
- Some code is really old and was written long before I learned proper
  practices for writing shell scripts, etc.

Also, some directories use Stu and older ones Make.  

If you need help, please politely ask Jérôme Kunegis
<konectproject@outlook.com> and he may update the code to work.  (Up to
websites that have disappeared; Jérôme can't do anything about those ;)

About

Network dataset extraction library – part of the KONECT project by Jérôme Kunegis, University of Namur

http://konect.cc/

License:GNU General Public License v3.0


Languages

Language:MATLAB 31.9%Language:Perl 27.2%Language:Shell 14.9%Language:Makefile 11.0%Language:TeX 10.6%Language:C++ 2.9%Language:Java 1.0%Language:sed 0.3%Language:Python 0.2%