tombinic / DarkWeb

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DarkWeb Graph Analysis 🌐

alt text

Network Image

Table of Contents 📖

Overview 👀

Graph theory has long been a favored tool for analyzing social relationships as well as quantifying engineering properties such as searchability. For both reasons, there have been numerous graph-theoretic analyses of the World Wide Web (www) from the seminal to the modern. These analysis have been repeated for the notorious “darkweb”. The darkweb is sometimes loosely defined as “anything seedy on the Internet”, but we define the darkweb strictly, as simply all domains underneath the “.onion” psuedo-top-level-domain, i.e., we define the darkweb to be synonymous with the onionweb.

The graph represents the DarkWeb network:
  • A node represents a domain.
  • An edge (u to v) means there exists a page within domain u linking to a page within domain v.
  • The weight of the edge from u to v is the number of pages on domain u linking to pages on domain v.

Project Folders Description 💾

  • Dataset: in this folder there are some useful csv files:
    • Darkweb.csv: this is the darkweb graph, which is composed by 4 column (Source, Target, Type, Weight)
    • Darkweb_scc.csv: the same as before with some added info like Id and Label (exported from Gephi)
  • Gephi: in this folder there is Gephi project that you can import in the homonym software
  • SourceCodePython: in this folder there is Python source code that is necessary to run most project analysis
  • SourceCodeMatlab: in this folder there is Matlab source code that is necessary to run community analysis, especially all what concerns persistance probabilities
  • Results: in this folder you can find all results carried out by the whole analysis process. Results from Gephi, Python and Matlab
  • Presentation: in this folder you can find the pptx file related to the project presentation
  • Paper: in this folder you can find the original paper related to this project in which my analysis is based on

Graph Analysis 🔸

Some initial stats:

Network Image

Network Image

Obviously, the DarkWeb is a very danger zone: so in this project what I would to point out is understanting what is the potential way to destroy it. We can analyze what is the network response regarding failures (random attacks) and targeted attacks: the metrics taken into account are the size of the largest connected component and the efficiency value.

Network Image

Network Image

If you want to deep into the whole analysis you are free to explore the entire repo. Have fun! 🎆

References 🔸

[1] Dataset DarkWeb (Dataset DarkWeb)
[2] Graph Theoretic Properties of the Darkweb, V. Griffith, Y. Xu, C. Ratti, 2017. (arxiv.org)
[3] Finding and testing network communities by lumped Markov chains, C. Piccardi, 2011. (journals.plos.org)
[4] Profiling core-periphery network structure by random walkers, F. Della Rossa, F. Dercole, C. Piccardi, 2013. (nature.com)

About


Languages

Language:MATLAB 55.2%Language:Python 44.8%