NNetwork
is a Custom graph/network/multi-weighted network class optimized for scalable subgraph sampling and searching algorithms. NNetwork stores a dictionary that maps each node to a list of its neighbors to allow for O(1) access for finding neighbors.
The efficiency of neighbor access is import for sampling algorithm such as random walks and Markov chain Monte Carlo motif sampling on graphs, which rely on accessing neighborhood information at every iteration of sampling. In comparison, many packages rely on calculations involving powers of adjacency matrices to calculate random walks of length k.
The default class of NNetwork
encodes a network with weighted edges, which can also have list-valued edge weights as its 'color'.
Update for 0.1.0:
Built-in functions contain sampling algorithms for mesoscale network patches using various MCMC motif sampling algorithms [1]. At stationary distribution, it computes a uniformly chosen k-walk in the graph, which can optionally enforced to be non-backtraking, and the induced adjacency pattern is returned as a k x k matrix. Algorithimically, a given k-walk is randomly updated using a suitable MCMC algorithm. The so-computed k x k mesoscale patches
are basis of subgraph analysis and network dictionary learning in [2].
By Josh Vendrow and Hanbaek Lyu
To install NNetwork, run this command in your terminal:
$ pip install -U NNetwork
This is the preferred method to install NNetwork, as it will always install the most recent stable release.
If you don't have pip installed, these installation instructions can guide you through the process.
Undirected Graphs
Create an undirected (weighted) graph from an edgelist:
>>> from NNetwork import NNetwork
>>> edgelist = [[1,2],[2,3],[3,4]]
>>> G = NNetwork()
>>> G.add_edges(edgelist)
>>> G.has_edge(2,3)
True
>>> G.get_edge_weight(2,3)
1
Get the neighbors of a node:
>>> G.neighbors(3)
[2,4]
Find the intersection of edges with another network:
>>> edgelist2 = [[2,3],[3,4],[5,7]]
>>> G2 = NNetwork()
>>> G2.add_edges(edgelist2)
>>> G.intersection(G2)
[[2,3],[3,4]]
Weighted Graphs
Create a weighted graph from an edgelist:
>>> from NNetwork import NNetwork
>>> edgelist = [[1,2,0.5],[2,3,0.8]]]
>>> G = NNetwork()
>>> G.add_wtd_edges(edgelist)
>>> G.get_edge_weight([2,3])
0.8
Convert weighted graph to an unweighed graph by thresholding
>>> G_simple = G.threshold2simple(0.7)
>>> G_simple.edges()
[[2,3]]
Mesoscale patch computation
>>> edgelist = [[1,2],[2,3],[1,3],[1,4],[1,5]]
>>> G = nn.NNetwork()
>>> G.add_edges(edgelist)
>>> print(G.vertices)
['1', '2', '3', '4', '5']
>>> print(G.edges)
{"['1', '2']": 1, "['2', '1']": 1, "['2', '3']": 1, "['3', '2']": 1, "['1', '3']": 1, "['3', '1']": 1, "['1', '4']": 1, "['4', '1']": 1, "['1', '5']": 1, "['5', '1']": 1}
>>> X, embs = G.get_patches(k=3, sample_size=4, skip_folded_hom=False)
>>> print(X) # each column is a vectorizaiton of k x k induced adjacency matrix
array([[0., 0., 0., 0.],
[1., 1., 1., 1.],
[0., 1., 0., 0.],
[1., 1., 1., 1.],
[0., 0., 0., 0.],
[1., 1., 1., 1.],
[0., 1., 0., 0.],
[1., 1., 1., 1.],
[0., 0., 0., 0.]])
>>> print(embs) # four consecutive 3-walks in G
[array(['2', '3', '2'], dtype='<U32'),
array(['1', '2', '3'], dtype='<U32'),
array(['4', '1', '4'], dtype='<U32'),
array(['4', '1', '4'], dtype='<U32')]
If you use our work in an academic setting, please cite our papers:
[1] Hanbaek Lyu, Facundo Memoli, and David Sivakoff, “Sampling random graph homomorphisms and applications to network data analysis.” https://arxiv.org/abs/1910.09483 (2019)
[2] Hanbaek Lyu, Yacoub Kureh, Joshua Vendrow, and Mason A. Porter, “Learning low-rank mesoscale structures of networks” https://arxiv.org/abs/2102.06984 (2021)
See CONTRIBUTING.md for information related to developing the code.
master
is for the most up-to-date development, very rarely should you directly commit to this branch. Your day-to-day work should exist on branches separate frommaster
. It is recommended to commit to development branches and make pull requests to master.4. It is recommended to use "Squash and Merge" commits when committing PR's. It makes each set of changes tomaster
atomic and as a side effect naturally encourages small well defined PR's.
-
Create an initial release to test.PyPI and PyPI.
- Follow This PyPA tutorial, starting from the "Generating distribution archives" section.
-
Create a blank github repository (without a README or .gitignore) and push the code to it.
-
Delete these setup instructions from
README.md
when you are finished with them.