Top Banner
Significant scales in community structure V.A. Traag 1,2 , G. Krings 3 , P. Van Dooren 4 1 KITLV, Leiden, the Netherlands 2 e-Humanities, KNAW, Amsterdam, the Netherlands 3 Real Impact, Brussels, Belgium, 4 UCL, Louvain-la-Neuve, Belgium September 17, 2013 e Royal Netherlands Academy of Arts and Sciences Humanities
21

Significant scales in community structure

Jun 19, 2015

Download

Science

Vincent Traag

Presentation at ECCS 2013, Barcelona, September 17, 2013
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Significant scales in community structure

Significant scales in community structure

V.A. Traag1,2, G. Krings3, P. Van Dooren4

1KITLV, Leiden, the Netherlands2e-Humanities, KNAW, Amsterdam, the Netherlands

3Real Impact, Brussels, Belgium,4UCL, Louvain-la-Neuve, Belgium

September 17, 2013

eRoyal Netherlands Academy of Arts and SciencesHumanities

Page 2: Significant scales in community structure

Community Detection

Contant Potts Model (CPM)

• Minimize H(γ) = −∑ij(Aij − γ)δ(σi , σj)

• Resolution-limit-free

• Internal density pc > γ

• Density between pcd < γ

Page 3: Significant scales in community structure

Community Detection

Contant Potts Model (CPM)

• Minimize H(γ) = −∑ij(Aij − γ)δ(σi , σj)

• Resolution-limit-free

• Internal density pc > γ

• Density between pcd < γ

Page 4: Significant scales in community structure

Community Detection

Contant Potts Model (CPM)

• Minimize H(γ) = −∑ij(Aij − γ)δ(σi , σj) = −∑

c(ec − γn2c)

• Resolution-limit-free

• Internal density pc > γ

• Density between pcd < γ

Page 5: Significant scales in community structure

Community Detection

Contant Potts Model (CPM)

• Minimize H(γ) = −∑ij(Aij − γ)δ(σi , σj) = −∑

c(ec − γn2c)

• Resolution-limit-free

• Internal density pc > γ

• Density between pcd < γ

Page 6: Significant scales in community structure

Community Detection

Contant Potts Model (CPM)

• Minimize H(γ) = −∑ij(Aij − γ)δ(σi , σj) = −∑

c(ec − γn2c)

• Resolution-limit-free

• Internal density pc > γ

• Density between pcd < γ

Page 7: Significant scales in community structure

Community Detection

Contant Potts Model (CPM)

• Minimize H(γ) = −∑ij(Aij − γ)δ(σi , σj) = −∑

c(ec − γn2c)

• Resolution-limit-free

• Internal density pc > γ

• Density between pcd < γ

How to choose γ?

Page 8: Significant scales in community structure

Resolution profile

10−3 10−2 10−1 100103

104

105

106

γ

N E

Page 9: Significant scales in community structure

Significance

How significant is a partition?

Page 10: Significant scales in community structure

Significance

E = 14

E = 9

Fixed partition

E = 11

Better partition

Page 11: Significant scales in community structure

Significance

E = 14

E = 9

Fixed partition

E = 11

Better partition

• Not: Probability to find E edges in partition.

• But: Probability to find partition with E edges.

Page 12: Significant scales in community structure

Subgraph probability

Decompose partition

• Probability to find partition with E edges.

• Probability to find communities with ec edges.

• Asymptotic estimate

• Probability for subgraph of nc nodes with density pc

Pr(S(nc , pc) ⊆ G (n, p)) ≈ exp[−n2cD(pc ‖ p)

]

Significance

• Probability for all communities Pr(σ) ≈∏c

exp[−n2cD(pc ‖ p)

].

• Significance S(σ) = − log Pr(σ) =∑c

n2cD(pc ‖ p).

Page 13: Significant scales in community structure

Subgraph probability

Decompose partition

• Probability to find partition with E edges.

• Probability to find communities with ec edges.

• Asymptotic estimate

• Probability for subgraph of nc nodes with density pc

Pr(S(nc , pc) ⊆ G (n, p)) ≈ exp[−n2cD(pc ‖ p)

]

Significance

• Probability for all communities Pr(σ) ≈∏c

exp[−n2cD(pc ‖ p)

].

• Significance S(σ) = − log Pr(σ) =∑c

n2cD(pc ‖ p).

Page 14: Significant scales in community structure

Subgraph probability

Decompose partition

• Probability to find partition with E edges.

• Probability to find communities with ec edges.

• Asymptotic estimate

• Probability for subgraph of nc nodes with density pc

Pr(S(nc , pc) ⊆ G (n, p)) ≈ exp[−n2cD(pc ‖ p)

]

Significance

• Probability for all communities Pr(σ) ≈∏c

exp[−n2cD(pc ‖ p)

].

• Significance S(σ) = − log Pr(σ) =∑c

n2cD(pc ‖ p).

Page 15: Significant scales in community structure

Subgraph probability

Decompose partition

• Probability to find partition with E edges.

• Probability to find communities with ec edges.

• Asymptotic estimate

• Probability for subgraph of nc nodes with density pc

Pr(S(nc , pc) ⊆ G (n, p)) ≈ exp[−n2cD(pc ‖ p)

]

Significance

• Probability for all communities Pr(σ) ≈∏c

exp[−n2cD(pc ‖ p)

].

• Significance S(σ) = − log Pr(σ) =∑c

n2cD(pc ‖ p).

Page 16: Significant scales in community structure

Subgraph probability

Decompose partition

• Probability to find partition with E edges.

• Probability to find communities with ec edges.

• Asymptotic estimate

• Probability for subgraph of nc nodes with density pc

Pr(S(nc , pc) ⊆ G (n, p)) ≈ exp[−n2cD(pc ‖ p)

]

Significance

• Probability for all communities Pr(σ) ≈∏c

exp[−n2cD(pc ‖ p)

].

• Significance S(σ) = − log Pr(σ) =∑c

n2cD(pc ‖ p).

Page 17: Significant scales in community structure

Subgraph probability

Decompose partition

• Probability to find partition with E edges.

• Probability to find communities with ec edges.

• Asymptotic estimate

• Probability for subgraph of nc nodes with density pc

Pr(S(nc , pc) ⊆ G (n, p)) ≈ exp[−n2cD(pc ‖ p)

]

Significance

• Probability for all communities Pr(σ) ≈∏c

exp[−n2cD(pc ‖ p)

].

• Significance S(σ) = − log Pr(σ) =∑c

n2cD(pc ‖ p).

Page 18: Significant scales in community structure

Significance

10−3 10−2 10−1 100103

104

105

106

γ

N E

Page 19: Significant scales in community structure

Significance

10−3 10−2 10−1 100103

104

105

106

γ

N E S

Page 20: Significant scales in community structure

Benchmark

0.25

0.5

0.75

1

NM

In = 5000, Small

0

1S S∗

0 0.2 0.4 0.6 0.8 101

µ

S∗ 〈S〉

CPM+SigSignificanceModularity

InfomapOSLOM

Page 21: Significant scales in community structure

Conclusions

• Scan γ efficiently.

• Significance applicable in all methods.

• Correct comparison to random graph.

Traag, Krings, Van Dooren Significant scales in Community StructurearXiv:1306.3398

Thank you!Questions?

e-mail: [email protected] twitter: @vtraag