Top Banner
Jure Leskovec ([email protected]) Computer Science Department Cornell University / Stanford University Tutorial at ICML 2009
156
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Netwoks icml09

Jure Leskovec ([email protected])Computer Science DepartmentCornell University / Stanford University

Tutorial at ICML 2009

Page 2: Netwoks icml09

Introduce properties, models and tools for modeling and analysis of large real-world networks

Goal: find patterns, rules, clusters, outliers, … in large static and evolving graphs

Acknowledgements: Jon Kleinberg, Christos Faloutsos, Ravi Kumar, Andrew Tomkins, Lars Backstrom, Michael Mahoney, Anirban Dasgupta, Kevin Lang, Zoubin Ghahramani, LiseGetoor, Deepay Chakrabarti, Eric Horvitz

6/14/2009 2Jure Leskovec, ICML '09

Page 3: Netwoks icml09

Internet (a) Citation network (b) World Wide Web (c)

(b) (c)(a)

(d)(e)

Sexual network (d) Dating network (e)

Jure Leskovec, ICML '09 36/14/2009

Page 4: Netwoks icml09

Network data is increasingly available: Large on-line computing applications where data

can naturally be represented as a network: On-line communities: Facebook (120 million users) Communication: Instant Messenger (~1 billion users) News and Social media: Blogging (250 million users) Also in systems biology, health, medicine, …

Network is a set of weakly interacting entities Links give added value: Google realized web-pages are connected Collective classification

6/14/2009 Jure Leskovec, ICML '09 4

Page 5: Netwoks icml09

Information networks: World Wide Web: hyperlinks Citation networks Blog networks

Social networks: Organizational networks Communication networks Collaboration networks Sexual networks Collaboration networks

Technological networks: Power grid Airline, road, river networks Telephone networks Internet Autonomous systems

Florentine families Web graph

Collaboration networkFriendship network

Jure Leskovec, ICML '09 56/14/2009

Page 6: Netwoks icml09

Biological networks: metabolic networks food webs neural networks gene regulatory

networks Language networks: Semantic networks

Software networks: Call graphs

Yeast proteininteractions

Semantic network

Language networkSoftware network

Jure Leskovec, ICML '09 66/14/2009

Page 7: Netwoks icml09

The emergence of ‘cyberspace’ and the World Wide Web is like the discovery of a new continent.

Jim Gray, 1998 Turing Award address

Complex networks as phenomena, not just designed artifacts

What are the common patterns that emerge?

Jure Leskovec, ICML '09 76/14/2009

Page 8: Netwoks icml09

We want Kepler’s Laws of Motion for the Web.Mike Steuerwalt, NSF KDI workshop, 1998

Need statistical methods to quantify large networks

What do we hope to achieve from models of networks? Patterns and statistical properties of network data Design principles and models Understand why networks are organized the way

they are (predict behavior of networked systems)

Jure Leskovec, ICML '09 86/14/2009

Page 9: Netwoks icml09

Mining social networks has a long history in social sciences: Wayne Zachary’s PhD work (1970-72): observe social ties and

rivalries in a university karate club During his observation, conflicts led the group to split Split could be explained by a minimum cut in the social network

Jure Leskovec, ICML '09 96/14/2009

Page 10: Netwoks icml09

Traditional obstacle:Can only choose 2 of 3: Large-scale Realistic Completely mapped

Now: large on-line systems leave detailed records of social activity On-line communities: MyScace, Facebook, LiveJournal Email, blogging, electronic markets, instant messaging On-line publications repositories, arXiv, MedLine

Jure Leskovec, ICML '09 106/14/2009

Page 11: Netwoks icml09

Network data spans many orders of magnitude: 436-node network of email exchange over 3-months

at corporate research lab [Adamic-Adar, SocNets ‘03] 43,553-node network of email exchange over 2 years

at a large university [Kossinets-Watts, Science ‘06] 4.4-million-node network of declared friendships on

a blogging community [Liben-Nowell et al., PNAS ‘05, Backstrom et at., KDD ‘06] 240-million-node network of all IM communication

over a month on Microsoft Instant Messenger [Leskovec-Horvitz, WWW ‘08]

Jure Leskovec, ICML '09 116/14/2009

Page 12: Netwoks icml09

How does massive network data compare to small-scale studies?

Massive network datasets give us more and less: More: can observe global phenomena that are

genuine, but literally invisible at smaller scales Less: don’t really know what any node or link means.

Easy to measure things, hard to pose right questions Goal: Find the point where the lines of research

converge

Jure Leskovec, ICML '09 126/14/2009

Page 13: Netwoks icml09

What have we learned about large networks?

Structure: Many recurring patterns Scale-free, small-world, locally clustered, bow-tie,

hubs and authorities, communities, bipartite cores, network motifs, highly optimized tolerance

Processes and dynamics:Information propagation, cascades, epidemic thresholds, viral marketing, virus propagation, diffusion of innovation

Jure Leskovec, ICML '09 136/14/2009

Not in today’s tutorial

Page 14: Netwoks icml09

Structure and models for networks What are properties of real graphs?

How to model them?

Part 1: Modeling global network structure “Mechanistic” approaches (math, physics)

Part 2: Modeling local network structure (links): Statistical/ML approaches

Part 3: Modeling network structure at the level of groups of nodes Graph partitioning/clustering

Jure Leskovec, ICML '09 146/14/2009

Page 15: Netwoks icml09

Erdos-Renyi random graphs Preferential attachment Small-world model Power-law degree distributions Local clustering Six degrees of separation Kronecker graphs

6/14/2009 Jure Leskovec, ICML '09 15

Page 16: Netwoks icml09

How do large network “look like”? Empirical: statistical tools to quantify structure networks Models: mechanisms that reproduce such properties

(models also make “predictions” about other properties) 3 parts/goals: Large scale statistical properties of large networks Models that help understand these properties Predict behavior of networked systems based on measured

structural properties and local rules governing individual nodes

6/14/2009 Jure Leskovec, ICML '09 16

Page 17: Netwoks icml09

What is the simplest way to generate a graph? Erdos-Renyi Random Graph model [Erdos-Renyi, ‘60] aka.: Poisson/Bernoulli random graphs

Two variants: Gn,p: graph on n nodes and each edge (u,v) appears i.i.d. with

prob. p. So a graph with m edges appears with prob. pm(1-p)M-m, where M=n(n-1)/2 is the max number of edges

Gn,m: graphs with n nodes, m uniformly at random picked edges

6/14/2009 Jure Leskovec, ICML '09 17

What kinds of networks does such process produce?

Page 18: Netwoks icml09

Degree distribution is Binomial (Poisson in the limit). Let pk denote a fraction of nodes with degree k

Diameter is O(log n): G has expansion α if∀ S⊆V: #edges leaving S≥ α⋅|S| Let Sj be a set of nodes within j steps of v. Then

|Sj+1| ≥ α|Sj|. So in O(log n) steps |Sj| grows to Θ(n).

Emergence of giant component: avg. degree k=2m/n: k=1-ε: all components are of size Ω(log n) k=1+ε: 1 component of size Ω(n), others have size Ω(log n)

6/14/2009 Jure Leskovec, ICML '09 18

!)1(

kezpp

kn

pzk

knkk

−− ≈−

=

Page 19: Netwoks icml09

Take real network plot a histogram of pk vs. k

6/14/2009 Jure Leskovec, ICML '09 19

Flickr social network

n= 584,207, m=3,555,115

[Leskovec et al. KDD ‘08]

Page 20: Netwoks icml09

Plot the same data on log-log axis:

6/14/2009 Jure Leskovec, ICML '09 20

Flickr social network

n= 584,207, m=3,555,115

[Leskovec et al. KDD ‘08]

Page 21: Netwoks icml09

Degrees are heavily skewed: Distribution is heavy tailed:

Various names, kinds and forms: Long tail, Heavy tail, Zipf’s law, Pareto’s law

6/14/2009 Jure Leskovec, ICML '09 21

Many other quantities follow heavy-tailed distributions

Page 22: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 22

Page 23: Netwoks icml09

Power law degree exponent is typically 2 < α < 3 Web graph [Broder et al. 00]: αin = 2.1, αout = 2.4

Autonomous systems [Faloutsoset al. 99]: α = 2.4

Actor collaborations [Barabasi-Albert 00]: α = 2.3

Citations to papers [Redner 98]: α ≈ 3

Online social networks [Leskovecet al. 07]: α ≈ 2

6/14/2009 Jure Leskovec, ICML '09 23

Page 24: Netwoks icml09

Tails are heavy: E[x] If α ≤ 2 : E[x]= ∞ If α ≤ 3 : Var[x]=∞

Estimating power-law exponent α from data:1. Fit a line on log-log axis using least squares2. Plot Complementary CDF P(X>x), then α=1+α*

where α* is the slope of P(X>x). E.i., if P(X=x) x-α

then P(X>x)=x-(α-1)

3. Use MLE: xi is degree of node i

6/14/2009 Jure Leskovec, ICML '09 24

BAD!

For further details see [Clauset-Shalizi-Newman 2007]

Ok

Ok

Page 25: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 25

Linear scaleLog scale, α=1.75

CCDF, Log scale, α=1.75

CCDF, Log scale, α=1.75,

exp. cutoff

Flickr

Page 26: Netwoks icml09

Random network Scale-free (power-law) networkFunction is scale free if:f(ax) = c f(x)

(Erdos-Renyi random graph)

Degree distribution is Binomial

Degree distribution is Power-law

Jure Leskovec, ICML '09 Part 1-266/14/2009

Page 27: Netwoks icml09

Preferential attachment [Price 1965, Albert-Barabasi 1999]: A new node creates m out-links Prob. of linking to node i is

proportional to its degree ki

Herbert Simon’s result Power-laws arise from “Rich get

richer” (cumulative advantage)

Examples [Price 65]: Citations: new citations of a

paper are proportional to the number it already has

6/14/2009 Jure Leskovec, ICML '09 27

Gives graphs with power-law degree distribution: α=3

3−∝ kpk

Page 28: Netwoks icml09

Preferential attachment is a key ingredient Extensions: Early nodes have advantage: node fitness Geometric preferential attachment

Copying model [Kleinberg et al.]: Picking a node proportional to

the degree is same as picking an edge at random (pick node and then it’s neighbor)

6/14/2009 Jure Leskovec, ICML '09 28

Page 29: Netwoks icml09

4 online social networks with exact edge arrival sequence

Directly observe mechanisms leading to global network properties

6/14/2009 Jure Leskovec, ICML '09 29

(F)(D)(A)(L)

[Leskovec et al. KDD 08]

and so on for millions…

Page 30: Netwoks icml09

Jure Leskovec, ICML '09

τkkpe ∝)(Gnp

PA

(D)

(F)(L)

(A)

Network τ

Gnp 0

PA 1

F 1

D 1

A 0.9

L 0.6

We unroll the true network edge arrivals Measure node degrees where edges attach

6/14/2009 30

[Leskovec et al. KDD 08]

PA holds! (with a little caveat)

Page 31: Netwoks icml09

Real-world networks are resilient to random node attacks One has to remove all web-pages of degree > 5 to disconnect the web But this is a very small percentage of web pages

Random network has better resilience to targeted attacks

Fraction of removed nodes

Mea

n pa

th le

ngth

Fraction of removed nodes

Internet (Autonomous systems)

Randomremoval

Preferential node removal

Jure Leskovec, ICML '09 Part 1-316/14/2009

[Albert et al. Nature ‘00]

Random network

Randomremoval

Preferential node removal

Page 32: Netwoks icml09

Since web graph is scale-free (and not random) outliers (high-degree webpages) are common

Thus ranking webpages based on the link structure of the web graph works: PageRank Hubs and Authorities

6/14/2009 Jure Leskovec, ICML '09 32

Page 33: Netwoks icml09

Six degrees of separation[Milgram 60s]: Random people in Nebraska

were asked to send letters to stock brokers in Boston

Letters can only be passed to first-name acquaintances

On average letters reached the goal in 6 steps

6/14/2009 Jure Leskovec, ICML '09 33

Page 34: Netwoks icml09

Network: who talks to whom on MSN messenger240M nodes, 1.3 billion edges

346/14/2009 Jure Leskovec, ICML '09

[Leskovec-Horvitz WWW ‘08]

Page 35: Netwoks icml09

35

MSN Messenger

Average path length is 6.690% of nodes is reachable <8 steps

Hops Nodes

0 1

1 10

2 78

3 3,96

4 8,648

5 3,299,252

6 28,395,849

7 79,059,497

8 52,995,778

9 10,321,008

10 1,955,007

11 518,410

12 149,945

13 44,616

14 13,740

15 4,476

16 1,542

17 536

18 167

19 71

20 29

21 16

22 10

23 3

24 2

25 36/14/2009 Jure Leskovec, ICML '09

[Leskovec-Horvitz WWW ‘08]

Page 36: Netwoks icml09

36

MSN Messenger

Average path length is 6.690% of nodes is reachable <8 steps

Hops Nodes

0 1

1 10

2 78

3 3,96

4 8,648

5 3,299,252

6 28,395,849

7 79,059,497

8 52,995,778

9 10,321,008

10 1,955,007

11 518,410

12 149,945

13 44,616

14 13,740

15 4,476

16 1,542

17 536

18 167

19 71

20 29

21 16

22 10

23 3

24 2

25 3

We already saw that diameters of networks tend to be small.

But edges in social networks tend to be local/clustered.

6/14/2009 Jure Leskovec, ICML '09

[Leskovec-Horvitz WWW ’08]

Page 37: Netwoks icml09

uw

v

Just before the edge (u,v) is placed how many hops is between u and v?

Jure Leskovec, ICML '09

Network % Δ

Flickr 66%

Delicious 28%

Answers 23%

LinedIn 50%

GnpPA

(D)

(F)

(L) (A)

Fraction of triad closing edges

PA holds but edges are local. Most close triangles!6/14/2009 37

[Leskovec et al. KDD ‘08]

Page 38: Netwoks icml09

How to have local edges (lots of triangles) and small diameter?

Small-world model [Watts-Strogatz 1998]: Start with a low-dimensional

regular lattice Rewire: Add/remove edges to create shortcuts

to join remote parts of the lattice For each edge with prob. p move the

other end to a random vertex

6/14/2009 Jure Leskovec, ICML '09 38

[Watts-Strogatz Nature ‘98]

Page 39: Netwoks icml09

Rewiring allows to interpolate between regular lattice and a random graph

Jure Leskovec, ICML '09 Part 1-39

High clusteringHigh diameter

High clusteringLow diameter

Low clusteringLow diameter

6/14/2009

[Watts-Strogatz Nature ‘98]

Page 40: Netwoks icml09

Clu

ster

ing

coef

ficie

nt, C

= 1

/n ∑

Ci

Prob. of rewiring, p

Ci=1 Ci=1/3 Ci=0

6/14/2009 40Jure Leskovec, ICML '09

[Watts-Strogatz Nature ‘98]

Page 41: Netwoks icml09

Conseqences of Milgram’s experiment: Short paths exist in networks People are able to find them! How?

6/14/2009 Jure Leskovec, ICML '09 41

[Milgram ‘67]

Actual path of the letter traveling from Nebraska to Boston [Milgram ‘67]

Page 42: Netwoks icml09

Networks are navigable! Model: People on a grid, each

node u connects to 4 neighbors and has 1 long range link to node v with prob. d(u,v)-α

Greedy navigation algorithm: given (x,y) location of the target node forward the packet to the neighbor geographically closest to target

6/14/2009 Jure Leskovec, ICML '09 42

[Kleinberg Nature ’01, Dodds et al. Science ‘03, LibenNovell et al. PNAS ’05]

Page 43: Netwoks icml09

Result: If probability of a long link d(u,v)-α, then for α=2 greedy navigation will find the target in poly-log time WHP.

Proof idea: If α too small: too many long links If α too large: too many short links

Application in P2P networksfor search

6/14/2009 Jure Leskovec, ICML '09 43

[Kleinberg Nature‘01]

Page 44: Netwoks icml09

Prior models and intuition say that the network diameter slowly grows (like log N, log log N)

time

diam

eter

diam

eter

size of the graph

Internet

Citations

Diameter shrinks over time as the network grows the

distances between the nodes slowly decrease

44

[Leskovec et al. KDD 05]

6/14/2009 Jure Leskovec, ICML '09

Page 45: Netwoks icml09

Networks are denser over time Densification Power Law:

a … densification exponent (1 ≤ a ≤ 2)

What is the relation between the number of nodes and the edges over time?

Prior models assume: constant average degree over time

Internet

Citations

a=1.2

a=1.6

N(t)

E(t)

N(t)

E(t)

45

[Leskovec et al. KDD 05]

6/14/2009 Jure Leskovec, ICML '09

Page 46: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 46

diam

eter

size of the graph

Erdos-Renyirandom graph

Densification exponent a =1.3

Densifying random graph has increasing diameter⇒ There is more to shrinking diameter

than just densification

Is shrinking diameter just a consequence of densification?

Page 47: Netwoks icml09

Compare diameter of a: True network (red) Random network with

the same degree distribution (blue)

47

diam

eter

size of the graphdi

amet

er

Citations

Densification + degree sequence give shrinking diameter

6/14/2009 Jure Leskovec, ICML '09

Page 48: Netwoks icml09

Models: Forest Fire [Leskovec et al. KDD 05] Based on iterative node attachment

Kronecker graphs (coming next) Affiliation networks [Lattanzi-Sivakumar

STOC 09] Build a scale free bipartite network B(Q,U) Power-law in- and out-degree distribution

G(Q, E): fold edges of B: Pair of nodes in G is connected if they share

a neighbor in B

6/14/2009 Jure Leskovec, ICML '09 48

B(Q,U)

G(Q,U)

Page 49: Netwoks icml09

Want to generate realistic networks:

6/14/2009 Jure Leskovec, ICML '09 49

Why synthetic graphs? Anomaly detection, Simulations, Predictions, Null-

model, Sharing privacy sensitive graphs, …

Q: Which network properties do we care about? Q: What is a good model and how do we fit it?

Compare graphs properties, e.g., degree distribution

Given a real network

Generate a synthetic network

Page 50: Netwoks icml09

Kronecker product of matrices A and B is given by

We define a Kronecker product of two graphs as a Kronecker product of their adjacency matrices

N x M K x L

N*K x M*L

506/14/2009 Jure Leskovec, ICML '09

Page 51: Netwoks icml09

Kronecker graph: a growing sequence of graphs by iterating the Kronecker product

Each Kronecker multiplication exponentially increases the size of the graph

One can easily use multiple initiator matrices (K1

’, K1’’, K1

’’’ ) that can be of different sizes

51

[Leskovec et al. PKDD ‘05]

6/14/2009 Jure Leskovec, ICML '09

K1

Page 52: Netwoks icml09

Kronecker graphs mimic real networks: Theorem: Power-law degree distribution, Densification,

Shrinking/stabilizing diameter, Spectral properties

Initiator

(9x9)(3x3)

(27x27)

52

pij

Edge probability Edge probability

Starting intuition: Recursion & self-similarity

[Leskovec et al. PKDD ‘05]

6/14/2009 Jure Leskovec, ICML '09

K1

Page 53: Netwoks icml09

53

[Leskovec et al. PKDD ‘05]

6/14/2009 Jure Leskovec, ICML '09

Page 54: Netwoks icml09

Initiator matrix K1 is a similarity matrix Node u is described with k binary

attributes: u1, u2 ,…, uk Probability of a link between nodes u, v:

P(u,v) = ∏ K1[ui, vi]

54

=1K a bc d

a b

c d

a b

c d

v

u = (0,1,1,0)

P(u,v) = b·d·c·b

0 101 v = (1,1,0,1)

u

[Leskovec et al. 09]

6/14/2009 Jure Leskovec, ICML '09

Page 55: Netwoks icml09

Given a real network GWant to estimate initiator matrix:

Method of moments [Owen ‘09]

Compare counts of and solve system of equations.

Maximum likelihood [Leskovec-Faloutsos ICML ‘07]

arg max P( | G1) SVD [VanLoan-Pitsianis ‘93]

Can solve using SVD

6/14/2009 Jure Leskovec, ICML '09 55

=1K a bc d

211min

FKKG ⊗−

Page 56: Netwoks icml09

Maximum likelihood estimation

Naïve estimation takes O(N!N2): N! for different node labelings: Our solution: Metropolis sampling: N! (big) const

N2 for traversing graph adjacency matrix Our solution: Kronecker product (E << N2): N2 E

Do gradient descent

=1K a bc d

1KP( | ) Kronecker

arg max

Estimate the model in O(E)56

1G

[Leskovec-Faloutsos ICML ‘07]

6/14/2009 Jure Leskovec, ICML '09

Page 57: Netwoks icml09

Real and Kronecker are very close:

57

=1K0.99 0.54

0.49 0.13

[Leskovec-Faloutsos ICML ‘07]

6/14/2009 Jure Leskovec, ICML '09

Page 58: Netwoks icml09

We can generate realistic looking networks: Simulations of new algorithms where real graphs are

hard/impossible to collect Anomaly detection – abnormal behavior, evolution Predictions – predicting future from the past Hypothesis testing Graph sampling – many real world graphs are too large to

deal with “What if” scenarios

6/14/2009 Jure Leskovec, ICML '09 58

Page 59: Netwoks icml09

Link prediction in networks Hierarchical random graphs Exponential random graphs Statistical relational learning

6/14/2009 Jure Leskovec, ICML '09 59

Page 60: Netwoks icml09

Network modeling is all about predicting links but so far we have not tackled this problem directly

Task: predict missing links in a network In a evolving network In a static network

2 types of approaches: Node distance approaches: define a distance function, closer nodes are more likely to link

Statistical approaches: Design a model of link creation and fit to data

6/14/2009 Jure Leskovec, ICML '09 60

Page 61: Netwoks icml09

Link prediction in a evolving network:

Task: Given G[t0,t0’] a graph on edges up to time t0’ output a ranked list L of links (not in G[t0,t0’]) that are predicted to appear in G[t1,t1’]

Evaluation: n=|Enew|: # new edges that appear during the test period [t1,t1’]Take top n elements of L and count correct edges

6/14/2009 Jure Leskovec, ICML '09 61

[LibenNowell-Kleinberg CIKM ‘03]

Page 62: Netwoks icml09

Predict links a evolving collaboration network

Core: since network data is very sparse Consider only nodes with in-degree and out-

degree of at least 3

6/14/2009 Jure Leskovec, ICML '09 62

[LibenNowell-Kleinberg CIKM ‘03]

Page 63: Netwoks icml09

Rank potential links (x,y) based on:

6/14/2009 Jure Leskovec, ICML '09 63

[LibenNowell-Kleinberg CIKM ‘03]

Γ(x) … degree of node x

Page 64: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 64

[LibenNowell-Kleinberg CIKM’ 03]

Page 65: Netwoks icml09

Hierarchical model of network structure Tree D: Leaves of D correspond to nodes of the network Internal nodes of D have Bernoulli parameters θi

associated with them (edge probability) Prob. of edge (u,v) is θx where x is the least common

ancestor of leaves u and v

6/14/2009 Jure Leskovec, ICML '09 65

[Clauset et al. Nature ‘08]

Tree: shade corresponds to value of θi

1

4

3

2

5

6

7

Corresponding graph

Page 66: Netwoks icml09

Graphs and corresponding hierarchies:

6/14/2009 Jure Leskovec, ICML '09 66

[Clauset et al. Nature ‘08]

Page 67: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 67

[Clauset et al. Nature ‘08]

D1

Li, Ri: # edges in left/right subtreeEi: # edges between the subtrees

G

D2

Given a graph G and a model D How do we compute the likelihood L(D)=P(G|D)?

Example:

Page 68: Netwoks icml09

How estimate model parameters θi? Just count number of edges between the subtrees:

How to learn the tree? Markov Chain Monte Carlo to the rescue to

stochastically search over the network structures Each internal node i can be in one of the 3

configurations:

6/14/2009 Jure Leskovec, ICML '09 68

[Clauset et al. Nature ‘08]

Li, Ri: # edges in left/right subtreeEi: # edges between the subtrees

Algorithm:1. Randomly pick internal node i2. Randomly pick one of the two

alternative configurations3. Accept the change based on

likelihood ratio

i i

Page 69: Netwoks icml09

The model has linear number (n) parameters Possible problem due to overfitting Solution: Model averaging 1: do MCMC so that it converges to stationary

distribution 2: sample models from stationary distribution and then

out Majority Consensus model (tree): Each model Di has a score (likelihood L(Di)) Use tree consensus procedure

6/14/2009 Jure Leskovec, ICML '09 69

[Clauset et al. Nature ‘08]

Page 70: Netwoks icml09

Setting: Given a static network G on m edges Create graph G’ on a random subset of x edges Estimate the model on G’ Output m-x most likely edges

Results AUC: 3 networks with tens of nodes

6/14/2009 Jure Leskovec, ICML '09 70

[Clauset et al. Nature ‘08]

Terrorist network Metabolic network Food web

Page 71: Netwoks icml09

Results: Improvement over random

6/14/2009 Jure Leskovec, ICML '09 71

[Clauset et al. Nature’ 08]

Page 72: Netwoks icml09

Mainly used by statistics and traditional social network analysis community

Descriptive model: numerical summary measures Nodal level: centrality, node attributes Configuration level: cycles, triads, reciprocity Network level: clustering, core-periphery

Generative: Test alternative hypotheses Extrapolate and simulate from the model

6/14/2009 Jure Leskovec, ICML '09 72

Page 73: Netwoks icml09

Log-linear model over graph configurations: Unit of analysis: an edge (dyad) Observations (edges) are dependent

In the most general case:

where : A: (labeled) configurations θA: parameter for configuration A gA(y): if configuration A is present gA(y)=1 else gA(y)=0 Z: normalizing constant (sum over all possible graphs!)

6/14/2009 Jure Leskovec, ICML '09 73

))(exp(1)( ∑==A AA yg

ZyYP θ

(We usually replace g(y) with g(y)-g(yobs))

Page 74: Netwoks icml09

Attributes of nodes: Characteristics of a group: activity Individual’s characteristics

Attributes of links: Characteristics of links: duration, type

Configurations: Node degree: Cycles: Common neighbors:

6/14/2009 Jure Leskovec, ICML '09 74

Edge independent

terms

Edge dependent

terms

Page 75: Netwoks icml09

Edge independence model:

where: y: observed graph adjacency matrix φ: the expected number of edges ρ: tendency toward reciprocation αi: productivity of a node (out-degree) βi: attractiveness of a node (in-degree)

6/14/2009 Jure Leskovec, ICML '09 75

[Holland-Leinhardt ‘81]

Page 76: Netwoks icml09

Problem: normalizing constant

6/14/2009 Jure Leskovec, ICML '09 76

))(exp( graphs

possible all∑∑=

A AA

y

ygZ θ

For the graph on left we have to sum over 7,547,924,849,643,082,704,483,109,161,

976,537,781,833,842,440,832,880,856,752,412,600,491,248,324,784,297,704,172,253,450,355,317,535,082,936,750,061,527,689,799,541,169,259,849,585,265,122,868,502,865,392,087,298,790,653,952 terms (graphs)

Page 77: Netwoks icml09

Suppose we fix θ0 then log-likelihood:log E[exp((θ0- θ)g(Y))]= l(θ)- l(θ0)

Law of large numbers says we can approximate true mean by a sample mean

Thus:

where Y1, Y2, …, Ym is a random sample of networks from the distribution defined by the model with parameters θ0

6/14/2009 Jure Leskovec, ICML '09 77

∑=

−≈−m

iiYg

mll

100 ))()exp((1)()( θθθθ

[Hunter ‘06]

Page 78: Netwoks icml09

Goal: simulate random networks Y from the p* model

Use Markov Chain Monte Carlo: Repeat for a long time: Select a pair of nodes (i,j) at random Calculate likelihood ratio:

π = P(Yij changes) / P(Yij does not change)accept the change with prob. min{1, π}

Convergence is agonizingly slow

6/14/2009 Jure Leskovec, ICML '09 78

[Hunter ‘06]

Page 79: Netwoks icml09

Graph of relations between Florentine families (n=16,m=19)

Decide on the features: Density, Two-star,

Three-star, Triangle

Parameter estimates: θ < 0: edges occur relatively

rarely, especially if they are notpart of higher order structures τ > 0: business tries tend to occur

in triangular structuresJure Leskovec, ICML '09 Part 1-79

[Robins et al. ‘06]

6/14/2009

Page 80: Netwoks icml09

Scalability: for graphs up to 1,000 nodes MCMC converges very slowly Computation of features can be expensive

Model degeneracy: Very small number of graphs

has high probability This is a problem for networks

with high transitivity (i.e., socialnetworks) as the model clumps triangles together.

6/14/2009 Jure Leskovec, ICML '09 80

Page 81: Netwoks icml09

Types of predictive tasks addressed by SRL: Object classification: predict category of an object

based on its attributes and links Link classification: predict type of a link Link existence: predict whether a link exists or not Link cardinality estimation: predict the number of links

of a node Approaches use directed and undirected

graphical models See Introduction to statistical relational learning

by Taskar and Getoor

6/14/2009 Jure Leskovec, ICML '09 81

Page 82: Netwoks icml09

Will a paper get accepted? Templated Bayes network: each entity defines

a little graphical model

6/14/2009 Jure Leskovec, ICML '09 82

Length

Mood

Author

Good Writer

Paper

Quality

Accepted

Review

Smart

[Getoor et al.]

Page 83: Netwoks icml09

Data on papers, authors & reviews instantiates a big Bayes net

Some labels are given

Infer the missing labels

Collective classification

6/14/2009 Jure Leskovec, ICML '09 83

Author A1Paper P1Author: A1Review: R1

Review R2

Review R1

Author A2

Paper P2Author: A1Review: R2

Paper P3Author: A2Review: R2

Good Writer

Smart

Length

Mood

Quality

Accepted

Length

Mood

Review R3

Length

Mood

Quality

Accepted

Quality

Accepted

Good Writer

Smart

[Getoor et al.]

Page 84: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 84

Author3Fame

nodes = domain variablesedges = mutual influence

Author1Fame

Author4Fame

Author2Fame

0.6

f2

f4

f2

0.3

1.5

0.3

f4f2

f4

f4

f2

F4F2 φ(F2,F4)

Potentials measurecompatibility

)4,3()4,2()3,1()2,1(14321 34241312 ffffffffZ

)f,,ff,P(f φφφφ=

Good news: no acyclicity constraintsBad news: global normalization (1/Z)

[Taskar et al. NIPS ’01]

Page 85: Netwoks icml09

Web-KB: university webpages 2954 pages from Stanford, Berkeley, MIT Webpage classes: organization, student, research

group, faculty, course, research project, research scientist, staff Predict relation type: Advisor, Member, Teach, TA Train on two universities, predict on one

Does link structure help in classifying the type of a webpage?

6/14/2009 Jure Leskovec, ICML '09 85

[Taskar et al. NIPS ‘01]

Page 86: Netwoks icml09

Link structure helps with prediction

6/14/2009 Jure Leskovec, ICML '09 86

[Taskar et al. NIPS ‘01]

(predict links independently LogReg)(cliques over triangles in the link graph)

(cliques over sections of the page)

Page 87: Netwoks icml09

SLR in a nutshell: Inside each node/edge we have a

small graphical model Link structure defines additional

dependencies between the variables A big graphical model

Very good for collective classification predicting node/edge types

Inference is hard but many clever ideas on exploiting templated structure of the graphical model

For modeling network structure one has to consider all possible edges and then for each infer its presence/absence.

6/14/2009 Jure Leskovec, ICML '09 87

Y

N

N

Page 88: Netwoks icml09

Group formation Finding groups/communities/clustersModular structure in networks Consequences

6/14/2009 Jure Leskovec, ICML '09 88

Page 89: Netwoks icml09

In a social network nodes explicitly declare group membership: Facebook groups Publication venue

Can think of groups as node colors Gives insights into social dynamics: Recruits friends? Memberships spread

along edges Doesn’t recruit? Spread randomly

What factors influence a person’s decision to join a group?

What factors indicate that a group will grow in membership?

6/14/2009 Jure Leskovec, ICML '09 89

[Backstrom et al. KDD ‘06]

Page 90: Netwoks icml09

Analogous to diffusion: Group memberships spread over the network: Red circles represent

existing group members Yellow squares may join

Question: How does prob. of joining

a group depend on the number of friends already in the group?

[Backstrom et al. KDD ‘06]

6/14/2009 90Jure Leskovec, ICML '09

Page 91: Netwoks icml09

Diminishing returns: Probability of joining increases with the number of

friends in the group But increases get smaller and smaller

LiveJournal: 1 million users250,000 groups

DBLP: 400,000 papers2000 conferences,100,000 authors

[Backstrom et al. KDD ‘06]

6/14/2009 91Jure Leskovec, ICML '09

Page 92: Netwoks icml09

Connectedness of friends: x and y have three friends in the group x’s friends are independent y’s friends are all connected

Who is more likely to join?x y

6/14/2009 92Jure Leskovec, ICML '09

[Backstrom et al. KDD ‘06]

Page 93: Netwoks icml09

Competing sociological theories Information argument [Granovetter ‘73] Social capital argument [Coleman ’88]

Information argument: Unconnected friends give independent support

Social capital argument: Safety/truest advantage in having friends who

know each other

x y

6/14/2009 93Jure Leskovec, ICML '09

[Backstrom et al. KDD ‘06]

Page 94: Netwoks icml09

LiveJournal: 1 million users, 250,000 groups

[Backstrom et al. KDD ‘06]

6/14/2009 94Jure Leskovec, ICML '09

Social capital argument wins!Prob. of joining increases with

adjacent members.

Page 95: Netwoks icml09

Predict whether a user will join a group Important features: Group activity level in LiveJournal (# posts). Internal connectedness of friends. Other topological features of the friendship graphs

LiveJournal DBLP

[Backstrom et al. KDD ‘06]

6/14/2009 95Jure Leskovec, ICML '09

Page 96: Netwoks icml09

Predict whether group will grow significantly: Less than 9% vs. greater than 18%

Predicting based on fringe size does not do well

Using more sophisticated features gives good performance Number of closed triads Number of people with at least 10

friends in the group Total number of friendships

6/14/2009 Jure Leskovec, ICML '09 96

Feature AUCFringe Size 0.559Group Size 0.521Fringe/Group 0.562Above Three 0.601All network features

0.771

[Backstrom et al. KDD ‘06]

Page 97: Netwoks icml09

Findings so far suggest that network groups are tightly connected

Network communities: Sets of nodes with lots of

connections inside and few to outside (the rest of the network)

97

Communities, clusters, groups, modules

6/14/2009 Jure Leskovec, ICML '09

Page 98: Netwoks icml09

How to automatically find such densely connected groups of nodes?

Ideally such automatically detected clusters would then correspond to real groups

For example:

98

Communities, clusters, groups, modules

6/14/2009 Jure Leskovec, ICML '09

Page 99: Netwoks icml09

Zachary’s Karate club network: Observe social ties and rivalries in a university karate club During his observation, conflicts led the group to split Split could be explained by a minimum cut in the network

Jure Leskovec, ICML '09 Part 1-996/14/2009

Page 100: Netwoks icml09

Find micro-markets by partitioning the “query x advertiser” graph:

advertiser

quer

y

1006/14/2009 Jure Leskovec, ICML '09

Page 101: Netwoks icml09

Many methods: Linear (low-rank) methods: If Gaussian, then low-rank space is good

Kernel (non-linear) methods: If low-dimensional manifold, then kernels are good

Hierarchical methods: Top-down and bottom-up – common in social sciences

Graph partitioning methods: Define “edge counting” metric – conductance,

expansion, modularity, etc. – and optimize!

1016/14/2009 Jure Leskovec, ICML '09

Page 102: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 102

What is a good notion that would extract such clusters?

Page 103: Netwoks icml09

Divisive hierarchical clustering based on the notion of edge betweenness:

Number of shortest paths passing through the edge Remove edges in decreasing betweenness

6/14/2009 Jure Leskovec, ICML '09 103

[Girvan-Newman PNAS ‘02]

4933

11

Page 104: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 104

[Girvan-Newman PNAS ‘02]

Page 105: Netwoks icml09

Zachary’s Karate club: hierarchical decomposition

6/14/2009 Jure Leskovec, ICML '09 105

[Newman-Girvan PhysRevE ‘03]

Page 106: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 106

Communities in physics collaborations

[Newman-Girvan PhysRevE ‘03]

Page 107: Netwoks icml09

Want to compute betweenness of paths starting at node A

6/14/2009 Jure Leskovec, ICML '09 107

Breath first search starting from A:

Page 108: Netwoks icml09

Count the number of shortest paths from A to all other nodes of the network:

6/14/2009 Jure Leskovec, ICML '09 108

Page 109: Netwoks icml09

Compute betweenness by working up the tree: If there are multiple paths count them fractionally

6/14/2009 Jure Leskovec, ICML '09 109

1 path to KSplit evenly

1+0.5 paths to JSplit 1:2

1+1 paths to HSplit evenly

• Repeat the BFS procedure for each node of the network• Add edge scores

Page 110: Netwoks icml09

Hierarchical random graphs can be used to extract hierarchical community structure

6/14/2009 Jure Leskovec, ICML '09 110

[Clauset et al. Nature ‘08]

Simple network Corresponding dendrogram Grassland species networkNode shapes: plants, herbivores, parasitoids and hyperparasitoids

Page 111: Netwoks icml09

Communities: Sets of nodes with lots of

connections inside and few to outside (the rest of the network)

111

Question:Are large networks

really like this?

Hierarchical community structure

6/14/2009 Jure Leskovec, ICML '09

Page 112: Netwoks icml09

Community (cluster) structure of networks

112

Physics collaborations Tiny part of a large social network

How does community structure scale from small to large networks?

6/14/2009 Jure Leskovec, ICML '09

Page 113: Netwoks icml09

Conductance (normalized cut):

How community-like is a set of nodes?

How good of a community is a set of nodes?

Small Φ(S) == more community-like sets of nodes

S

S’

1136/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. WWW ‘08]

Page 114: Netwoks icml09

Score: Φ(S) = # edges cut / # edges inside

What is “best” community of

5 nodes?

1146/14/2009 Jure Leskovec, ICML '09

Page 115: Netwoks icml09

Score: Φ(S) = # edges cut / # edges inside

Bad community

Φ=5/6 = 0.83

What is “best” community of

5 nodes?

1156/14/2009 Jure Leskovec, ICML '09

Page 116: Netwoks icml09

Score: Φ(S) = # edges cut / # edges inside

Better community

Φ=5/7 = 0.7

Bad community

Φ=2/5 = 0.4

What is “best” community of

5 nodes?

1166/14/2009 Jure Leskovec, ICML '09

Page 117: Netwoks icml09

Score: Φ(S) = # edges cut / # edges inside

Better community

Φ=5/7 = 0.7

Bad community

Φ=2/5 = 0.4

Best communityΦ=2/8 = 0.25

What is “best” community of

5 nodes?

1176/14/2009 Jure Leskovec, ICML '09

Page 118: Netwoks icml09

Define:Network community profile (NCP) plot

Plot the score of best community of size k

118

Community size, log k

log Φ(k)Φ(5)=0.25

Φ(7)=0.18

k=5 k=7

[Leskovec et al. 08]

6/14/2009 Jure Leskovec, ICML '09

Page 119: Netwoks icml09

119

Community size, log k

Com

mun

ity sc

ore,

log Φ

(k)

• Every dot represents a cut on k nodes• Lower envelope gives score of best community on k nodes

[Leskovec et al. WWW ‘08]

Page 120: Netwoks icml09

Idea: Use approximation algorithms for NP-hard graph partitioning problems as experimental probes of network structure. Spectral (quadratic approx): confuses “long paths” with

“deep cuts” Multi-commodity flow (log(n) approx): difficulty with

expanders SDP (sqrt(log(n)) approx): best in theory Metis (multi-resolution heuristic): common in practice X+MQI: post-processing step on, e.g., MQI of Metis

Local Spectral - connected and tighter sets (empirically)

Metis+MQI - best conductance (empirically)

1206/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. WWW ‘08]

Page 121: Netwoks icml09

d-dimensional meshes California road network

1216/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. WWW ‘08]

Page 122: Netwoks icml09

Manifold learning dataset (Hands)

122

[Leskovec et al. WWW ‘08]

Page 123: Netwoks icml09

Zachary’s university karate club social network

1236/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. WWW ‘08]

Page 124: Netwoks icml09

Collaborations between scientists in Networks [Newman, 2005]

124

Community size, log k

Cond

ucta

nce,

log

Φ(k

)

6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. WWW ‘08]

Page 125: Netwoks icml09

125

[Ravasz-Barabasi 03]

[Clauset-Moore-Newman 08]6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. WWW ‘08]

Page 126: Netwoks icml09

Natural hypothesis about NCP: NCP of real networks slopes

downward Slope of the NCP corresponds to

the dimensionality of the network

126

What about large networks?

6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. WWW ‘08]

Page 127: Netwoks icml09

Typical example: General Relativity collaborations(n=4,158, m=13,422)

1276/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. WWW ‘08]

Page 128: Netwoks icml09

6/14/2009 Jure Leskovec, ICML '09 128

[Leskovec et al. WWW ‘08]

Page 129: Netwoks icml09

Φ(k

), (c

ondu

ctan

ce)

k, (community size)

Better and better communities

Communities get worse and worse

Best community has ~100 nodes

1296/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. WWW ‘08]

Page 130: Netwoks icml09

Each new edge inside the community costs more

NCP plot

Φ=2/4 = 0.5

Φ=8/6 = 1.3

Φ=64/14 = 4.5

Each node has twice as many children

Φ=1/3 = 0.33

1306/14/2009 Jure Leskovec, ICML '09

Page 131: Netwoks icml09

Definition: Whisker is a maximal set of nodes connected to the network by a single edge

Whiskers are responsible for

downward slope of NCP plot

NCP plot

Best community.How does it scale

with network size?

1316/14/2009 Jure Leskovec, ICML '09

Page 132: Netwoks icml09

Each dot is a different network132

Practically constant!

6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 133: Netwoks icml09

133

Whiskers:Edge to cut

Whiskers in real networks are non-trivial (richer than trees)

6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 134: Netwoks icml09

134

Whiskers Whiskers in real networks are larger than expected based on density and degree sequence

6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 135: Netwoks icml09

1356/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 136: Netwoks icml09

136

Nothing happens!Now we have 2-edge connected whiskers to

deal with. Indicates the recursiveness of our core-periphery structure: as we remove the

periphery, the core itself breaks into core and the periphery

6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 137: Netwoks icml09

Network structure: Core-periphery (jellyfish, octopus)

Whiskers are responsible for

good communities

Denser and denser core of

the network

Core contains ~60% nodes and

~80% edges

1376/14/2009 Jure Leskovec, ICML '09

Page 138: Netwoks icml09

What if we allow cuts that give disconnected communities?

• Compose communities out of whiskers• How good “community” do we get?

1386/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 139: Netwoks icml09

LiveJournal

Rewired network

Local spectral

Bag-of-whiskers

Metis+MQI

1396/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 140: Netwoks icml09

Regularization properties: spectral embeddings stretch along directions in which the random-walk mixes slowly Resulting hyperplane cuts have "good" conductance

cuts, but may not yield the optimal cuts

spectral embedding flow based embedding

1406/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 141: Netwoks icml09

Metis+MQI (red) gives sets with better conductance.

Local Spectral (blue) gives tighter and more well-rounded sets.

141

ext/

int

Dot

s ar

e co

nnec

ted

clus

ters

6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 142: Netwoks icml09

Two ca. 500 node communities from Local Spectral:

Two ca. 500 node communities from Metis+MQI:

1426/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 143: Netwoks icml09

... can be computed from: Spectral embedding

(independent of balance) SDP-based methods (for

volume-balanced partitions)

1436/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 144: Netwoks icml09

Core-periphery

Small good communities

Denser and denser core of

the network

144

So, what’s a good model?

6/14/2009 Jure Leskovec, ICML '09

Page 145: Netwoks icml09

What do estimated parameters tell us about the network structure?

145

=1K a bc d a edges d edges

b edges

c edges

6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv 09]

Page 146: Netwoks icml09

What do estimated parameters tell us about the network structure?

146

Core0.9 edges

Periphery0.1 edges

0.5 edges

0.5 edges

Core-periphery

=1K 0.9 0.50.5 0.1

6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 147: Netwoks icml09

Small and large networks are very different:

147

0.99 0.540.49 0.13

0.99 0.170.17 0.82

K1 = K1 =6/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 148: Netwoks icml09

Small communities: Largest have ≈100 nodes Community size is independent of network size

Core: 60% of the nodes, 80% edges Core has little structure (hard to cut) Still more structure than the random network

6/14/2009 Jure Leskovec, ICML '09 148

Page 149: Netwoks icml09

Compare to networks where nodes explicitly declare group membership: LiveJournal12: users create and explicitly join on-line groups

DBLP co-authorships: publication venues can be viewed as communities

Amazon product co-purchasing: each item belongs to one or more hierarchically organized

categories, as defined by Amazon IMDB collaboration: countries of production and languages may be viewed as

communities

6/14/2009 Jure Leskovec, ICML '09 149

Page 150: Netwoks icml09

LiveJournal DBLP

Amazon IMDB

RewiredNetworkGround truth

1506/14/2009 Jure Leskovec, ICML '09

[Leskovec et al. Arxiv ‘09]

Page 151: Netwoks icml09

Community structure of large networks: Recursive Core-periphery structure Scale to natural community size: Dunbar number 150 individuals is maximum community size

Model: Kronecker graphs Analytically tractable: provable properties Can efficiently estimate parameters from data

1516/14/2009 Jure Leskovec, ICML '09

Page 152: Netwoks icml09

Large social & information networks No large clusters: no/little hierarchical structure Can’t be well embedded – no underlying geometry

Are fundamentally different from small networks and manifolds

So… in large networks… Manifold learning won’t really work Semi-supervised learning ideas won’t really work

(in the core)

152

Page 153: Netwoks icml09

Statistical properties of networks across various domains Key to understanding the behavior of many

“independent” nodes Models of network structure and growth Help explain, think and reason about properties

Prediction, understanding of the structure Fitting the models

Jure Leskovec, ICML '09 1536/14/2009

Page 154: Netwoks icml09

How to systematically characterize the network structure?

How do properties relate to one another?

Is there something else we should measure?

Jure Leskovec, ICML '09 1546/14/2009

Page 155: Netwoks icml09

Why are networks the way they are?

Steer the network evolution

Predictive modeling of large communities Online massively multi-player games are closed

worlds with detailed traces of activity

Design systems (networks) that will Be robust to node failures Support local search (navigation): P2P networks

6/14/2009 Jure Leskovec, ICML '09 155

Page 156: Netwoks icml09

Why are networks the way they are? Only recently have basic properties been

observed on a large scale Confirms social science intuitions; calls others

into question

What are good tractable network models? Builds intuition and understanding

Benefits of working with large data Observe structures not visible at smaller scales

1566/14/2009 Jure Leskovec, ICML '09