Top Banner
Lecture 6: Network centrality Slides are modified from Lada Adamic
34
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Lecture 6:

Network centrality

Slides are modified from Lada Adamic

Page 2: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Measures and Metrics

Knowing the structure of a network, we can calculate various useful quantities or measures that capture particular features of the network topology. basis of most of such measures are from social network analysis

So far, Degree distribution, Average path length, Density

Centrality Degree, Eigenvector, Katz, PageRank, Hubs, Closeness,

Betweenness, ….

Several other graph metrics Clustering coefficient, Assortativity, Modularity, …

2

Page 3: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Characterizing networks:Who is most central?

?

?

?

3

Page 4: Lecture 6: Network centrality Slides are modified from Lada Adamic.

network centrality

Which nodes are most ‘central’?

Definition of ‘central’ varies by context/purpose

Local measure: degree

Relative to rest of network: closeness, betweenness, eigenvector (Bonacich power

centrality), Katz, PageRank, …

How evenly is centrality distributed among nodes? Centralization, hubs and autthorities, …

4

Page 5: Lecture 6: Network centrality Slides are modified from Lada Adamic.

centrality: who’s important based on their network position

indegree

In each of the following networks, X has higher centrality than Y according to

a particular measure

outdegree betweenness closeness

5

Page 6: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Outline

Degree centrality Centralization

Betweenness centrality Closeness centrality

Eigenvector centrality Bonacich power centrality

Katz centrality PageRank Hubs and Authorities

6

Page 7: Lecture 6: Network centrality Slides are modified from Lada Adamic.

He who has many friends is most important.

degree centrality (undirected)

When is the number of connections the best centrality measure?o people who will do favors for youo people you can talk to (influence set, information access, …)o influence of an article in terms of citations (using in-degree)

7

Page 8: Lecture 6: Network centrality Slides are modified from Lada Adamic.

degree: normalized degree centrality

divide by the max. possible, i.e. (N-1)

8

Page 9: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Prestige in directed social networks

when ‘prestige’ may be the right word admiration influence gift-giving trust

directionality especially important in instances where ties may not be reciprocated (e.g. dining partners choice network)

when ‘prestige’ may not be the right word gives advice to (can reverse direction) gives orders to (- ” -) lends money to (- ” -) dislikes distrusts

9

Page 10: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Extensions of undirected degree centrality - prestige

degree centrality indegree centrality

a paper that is cited by many others has high prestige a person nominated by many others for a reward has high prestige

10

Page 11: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Freeman’s general formula for centralization: (can use other metrics, e.g. gini coefficient or standard deviation)

CD CD (n

*) CD (i) i1

g[(N 1)(N 2)]

centralization: how equal are the nodes?

How much variation is there in the centrality scores among the nodes?

maximum value in the network

11

Page 12: Lecture 6: Network centrality Slides are modified from Lada Adamic.

degree centralization examples

CD = 0.167

CD = 0.167CD = 1.0

12

Page 13: Lecture 6: Network centrality Slides are modified from Lada Adamic.

degree centralization examples

example financial trading networks

high centralization: one node trading with many others

low centralization: trades are more evenly distributed

13

Page 14: Lecture 6: Network centrality Slides are modified from Lada Adamic.

when degree isn’t everything

In what ways does degree fail to capture centrality in the following graphs?

ability to broker between groups likelihood that information originating anywhere in the

network reaches you…

14

Page 15: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Outline

Degree centrality Centralization

Betweenness centrality

Closeness centrality

15

Page 16: Lecture 6: Network centrality Slides are modified from Lada Adamic.

betweenness: another centrality measure

intuition: how many pairs of individuals would have to go through you in order to reach one another in the minimum number of hops?

who has higher betweenness, X or Y?

XY

16

Page 17: Lecture 6: Network centrality Slides are modified from Lada Adamic.

CB (i) g jk (i) /g jkjk

Where gjk = the number of geodesics connecting j-k, and

gjk = the number that actor i is on.

Usually normalized by:

CB' (i)CB (i ) /[(n 1)(n 2) /2]

number of pairs of vertices excluding the vertex itself

betweenness centrality: definition

17

betweenness of vertex ipaths between j and k that pass through i

all paths between j and k

directed graph: (N-1)*(N-2)

Page 18: Lecture 6: Network centrality Slides are modified from Lada Adamic.

betweenness on toy networks

non-normalized version:

A B C ED

A lies between no two other vertices B lies between A and 3 other vertices: C, D, and E C lies between 4 pairs of vertices (A,D),(A,E),(B,D),(B,E)

note that there are no alternate paths for these pairs to take, so C gets full credit

18

Page 19: Lecture 6: Network centrality Slides are modified from Lada Adamic.

betweenness on toy networks

non-normalized version:

19

Page 20: Lecture 6: Network centrality Slides are modified from Lada Adamic.

betweenness on toy networks

non-normalized version:

20

broker

Page 21: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Nodes are sized by degree, and colored by betweenness.

example

Can you spot nodes with high betweenness but relatively low degree?

What about high degree but relatively low betweenness?

21

Page 22: Lecture 6: Network centrality Slides are modified from Lada Adamic.

betweenness on toy networks

non-normalized version:

A B

C

E

D

why do C and D each have betweenness 1?

They are both on shortest paths for pairs (A,E), and (B,E), and so must share credit:

½+½ = 1

Can you figure out why B has betweenness 3.5 while E has betweenness 0.5?

22

Page 23: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Alternative betweenness computations

Slight variations in geodesic path computations inclusion of self in the computations

Flow betweenness Based on the idea of maximum flow

edge-independent path selection effects the results May not include geodesic paths

Random-walk betweenness Based on the idea of random walks Usually yields ranking similar to geodesic betweenness

Many other alternative definitions exist based on diffusion, transmission or flow along network edges

23

Page 24: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Extending betweenness centrality to directed networks

We now consider the fraction of all directed paths between any two vertices that pass through a node

Only modification: when normalizing, we have (N-1)*(N-2) instead of (N-1)*(N-2)/2, because we have twice as many ordered pairs as unordered pairs

CB (i) g jkj ,k

(i) /g jk

betweenness of vertex ipaths between j and k that pass through i

all paths between j and k

CB

' (i)CB(i) /[(N 1)(N 2)]

24

Page 25: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Directed geodesics

A node does not necessarily lie on a geodesic from j to k if it lies on a geodesic from k to j

k

j

25

Page 26: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Outline

Degree centrality Centralization

Betweenness centrality

Closeness centrality

26

Page 27: Lecture 6: Network centrality Slides are modified from Lada Adamic.

closeness: another centrality measure

What if it’s not so important to have many direct friends?

Or be “between” others

But one still wants to be in the “middle” of things, not too far from the center

27

Page 28: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Closeness is based on the length of the average shortest path between a vertex and all vertices in the graph

Cc (i) d(i, j)j1

N

1

)1)).((()(' NiCiC CC

Closeness Centrality:

Normalized Closeness Centrality

closeness centrality: definition

28

depends on inverse distance to other vertices

Page 29: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Cc' (A)

d(A, j)j1

N

N 1

1

1 2 3 4

4

1

10

4

1

0.4

closeness centrality: toy example

A B C ED

29

Page 30: Lecture 6: Network centrality Slides are modified from Lada Adamic.

closeness centrality: more toy examples

30

Page 31: Lecture 6: Network centrality Slides are modified from Lada Adamic.

degree number of

connections denoted by size

closeness length of shortest

path to all others denoted by color

how closely do degree and betweenness correspond to closeness?

31

Page 32: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Closeness centrality

Values tend to span a rather small dynamic range typical distance increases logarithmically with network size

In a typical network the closeness centrality C might span a factor of five or less It is difficult to distinguish between central and less central

vertices a small chance in network might considerably affect the

centrality order

Alternative computations exist but they have their own problems

32

Page 33: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Influence range

The influence range of i is the set of vertices who are reachable from the node i

33

Page 34: Lecture 6: Network centrality Slides are modified from Lada Adamic.

Extensions of undirected closeness centrality

closeness centrality usually implies all paths should lead to you paths should lead from you to everywhere else

usually consider only vertices from which the node i in question can be reached

34