DOC PREVIEW
MIT ESD 342 - Basic Network Metrics

This preview shows page 1-2-14-15-29-30 out of 30 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 30 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 30 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 30 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 30 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 30 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 30 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 30 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

IntroductionDegree CorrelationJoint Degree DistributionK-nearest NeighborsPearson Degree CorrelationRich Club MetricNetwork RewiringGenerating a Graph with a Specified Degree SequenceBetween-nessFinding CommunitiesUsing Mo-Han Hsieh's Newman-Girvan programExamples: Random Graph and V-8 EngineAppendix 1: Calculating Pearson Degree CorrelationReferencesBasic Network Metrics Notes by Daniel Whitney January 5, 2008 Introduction Researchers are constantly seeking metrics to characterize networks (and other complex things) as a way of, or in the hope of, summarizing complex situations and behaviors in a convenient and compact way. As indicated below, this does not always work, and we should not be surprised at this. Interest in structural metrics has been strong in the network research community in part because networks tend to be very large and in need of some kind of summary characterization. Another reason is the hope that structure, captured conveniently in metrics, can be linked to observable or even hidden behaviors. Interest has been very strong in understanding the internet, by far the largest and potentially most interesting network in existence. The structure of this network has been studied for at least 10 years and different researchers have come up with different proposed structures. There is at present no way to validate these different proposals. Several methods have been used to estimate its degree sequence and most researchers agree that it follows a power law: p(k)~k−γ where γ~2.5−3. [Barabasi and Albert] Since such a degree distribution implies a few nodes of very high degree, some researchers concluded that these nodes linked preferentially to each other, creating a network of centrally located hubs, which presumably were major servers. However, it has emerged that while structure drives structural metrics, the metrics themselves imply little or nothing about the actual structure, and networks with highly different structure can have the same degree distribution, average nodal degree, and power law coefficient γ. [Li, Alderson, et al] In these notes, we describe a number of metrics that seek to characterize networks in one way or another. Some of these provide metrics about individual nodes whereas others return a value that applies only to the network as a whole. Some of the simplest metrics are given in the notes called Network Models and Basic Operations. These include the number of nodes, number of edges, and average nodal degree. In the present notes, the metrics relate to structural topics such as how nodes of different degrees tend to link to each other or tend to cluster into relatively tightly connected subregions of the network even while the network as a whole remains connected. Also included in these notes are methods for generating graphs that have specified degree sequences or have degree sequences that are characterized statistically as being drawn from an ensemble with a given probabilistic distribution.Degree Correlation Degree correlation is a basic structural metric that calculates the likelihood that nodes link to nodes of similar or dissimilar nodal degree. The former case is called positive degree correlation while the latter is called negative degree correlation. In the social sciences, a network with positive degree correlation is called assortative while one with negative degree correlation is called dis-assortative. Three ways of characterizing the amount of degree correlation are used, each containing less detail and expressing the result in more compact terms. These are the joint degree distribution, the k-nearest neighbors metric, and the Pearson Degree Correlation. These are discussed below in turn. Joint Degree Distribution The joint degree distribution examines each pair of connected nodes and notes their respective nodal degrees. Each time a pair has degrees k1,k2(), one is added to the entry of the matrix JDD. If 12 nodes of degree 3, for example, are linked, then there will be a 12 in the (3 entry in JDD. If many nodes of similar degree are linked, then JDD will have large entries along and near its diagonal. Conversely, if many node of dissimilar degree are linked, then JDD will have large entries on the anti-diagonal. k1,k2(),3) The text of the JDD routine JDD1 is below: function [JDD]=JDD1(A,name) %computes joint degree dist of A kvA=kvec(A); kmax=max(kvA); JDD=zeros(kmax); N=size(A,1); M=sum(kvA); for i = 1:N for j=1:N if A(i,j)==1 k1=kvA(i); k2=kvA(j); JDD(k1,k2)=JDD(k1,k2)+1; end end end JDD=JDD/M; figure;pcolor(JDD);colorbar shading interp xlabel('k1') ylabel('k2') title(['Joint Degree Distribution for ',name]) Here is code that creates a simple undirected network from a nodelist file, plots it (see result in Figure 1), and calculates its JDD (see result in Figure 2): >> test3=dlmread('testmatrix3.csv')test3 = 1 2 3 5 2 1 4 5 3 1 0 0 4 2 5 0 5 1 2 4 >> test3adj=adjbuildn(test3) test3adj = 0 1 1 0 1 1 0 0 1 1 1 0 0 0 0 0 1 0 0 1 1 1 0 1 0 >> CNet(test3adj) %CNet plots a network with node 1 at 3 o’clock, nodes numbered counter-clockwise >> JDD1(test3adj,'test3adj') ans = 0 0 0.0833 0 0 0.1667 0.0833 0.1667 0.5000 >> Figure 1. A Simple Undirected Network (test3adj)Figure 2. The JDD for the Network in Figure 1. In this figure, larger entries in matrix JDD get redder colors. K-nearest Neighbors Routine knn calculates the k-nearest neighbor statistic. Instead of recording every pair of nodes, as JDD does, knn simply averages the degrees of the neighbors of each node of a given degree and plots the results as linear, semi-log, and log-log plots. If a degree is missing, it is skipped in the graph. The result (linear graph only) of knn on test3adj is shown in Figure 3. The Matlab output is below, consisting of the neighbor degree averages for nodes of degree 1, 2, and 3 respectively. >> knn(test3adj,'test3adj') ans = 3.0000 3.0000 2.5556Figure 3. knn for test3adj. On average, nodes of degree = 1 link to nodes of degree = 3. The same is true for nodes of degree = 2. Nodes of degree = 3 link on average no nodes of degree = 2.55. If knn rises as nodal degree rises, this indicates that nodes of


View Full Document

MIT ESD 342 - Basic Network Metrics

Documents in this Course
Load more
Download Basic Network Metrics
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Basic Network Metrics and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Basic Network Metrics 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?