Top Banner
Competing for attention: branching-process models of meme popularity James P. Gleeson MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland #branching www.ul.ie/gleeson [email protected] @gleesonj NetSci14, Berkeley, 5 June 2014
40

NetSci14 invited talk: Competing for attention

Aug 23, 2014

Download

Science

James Gleeson

Invited talk at @Netsci14 (5 June 2014). Branching-process models of meme popularity.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: NetSci14 invited talk: Competing for attention

Competing for attention: branching-process models of meme popularity

James P. Gleeson

MACSI, Department of Mathematics and Statistics, University of Limerick, Ireland

#branching

www.ul.ie/gleeson [email protected]

@gleesonj

NetSci14, Berkeley, 5 June 2014

Page 2: NetSci14 invited talk: Competing for attention

Branching processes for meme popularity models Overview

Ξ¦

Memory

Network Competition π‘π‘˜

Page 3: NetSci14 invited talk: Competing for attention

Branching processes for meme popularity models Part 1

Memory

Network Competition

Page 4: NetSci14 invited talk: Competing for attention

Motivating examples from empirical work on Twitter

Twitter 15M one-year dataset: collaboration with R. BaΓ±os and Y. Moreno

𝛼 = 2

fraction of hashtags with popularity β‰₯ 𝑛 at age π‘Ž

Page 5: NetSci14 invited talk: Competing for attention

Branching processes for meme popularity models Part 2

Memory

Network Competition

Page 6: NetSci14 invited talk: Competing for attention

Simon’s model

β€’ Simon, β€œOn a class of skew distribution functions”, Biometrica, 1955 β€’ The basis of β€œcumulative advantage” and β€œpreferential attachment” models;

see Simkin and Roychowdhury, Phys. Rep., 2011 β€’ During each time step, one word is added to an ordered sequence

β€’ With probability πœ‡, the added word is an innovation (a new word)

β€’ With probability 1 βˆ’ πœ‡, a previously-used word is copied; the copied word is

chosen at random from all words used to date

time

Page 7: NetSci14 invited talk: Competing for attention

β€’ Simulation results at age π‘Ž = 25000: seed time is 𝜏, observation time is Ξ© = 𝜏 + 25000

β€’ Early-mover advantage; fixed-age distributions have exponential tails

[Simkin and Roychowdyury, 2007]

Simon’s model

πœ‡ = 0.02

Page 8: NetSci14 invited talk: Competing for attention

Simon’s model as a branching process

β€’ During each time step, one word is added to an ordered sequence β€’ With probability πœ‡, the added word is an innovation (a new word) β€’ With probability 1 βˆ’ πœ‡, a previously-used word is copied; the copied word is

chosen at random from all words used to date

𝑑 = 𝜏 𝑑 = Ξ©

Page 9: NetSci14 invited talk: Competing for attention

A word on probability generating functions (PGFs)

β€’ PGFs are β€œtransforms” of probability distributions: define PGF 𝑓(π‘₯) by β€’ …but β€œinverse transform” usually requires numerical methods, e.g. Fast

Fourier Transforms [Cavers, 1978] β€’ Some properties:

β€’ PGF for the sum of independent random variables is the product of the

PGFs for each of the random variables e.g., H. S. Wilf, generatingfunctionology, CRC Press, 2005

𝑓 π‘₯ = οΏ½π‘π‘˜π‘₯π‘˜βˆž

π‘˜=0

𝑓 1 = οΏ½π‘π‘˜

∞

π‘˜=0

= 1 𝑓𝑓 1 = οΏ½π‘˜ π‘π‘˜

∞

π‘˜=0

= 𝑧

Page 10: NetSci14 invited talk: Competing for attention

Branching processes solution of Simon’s model

β€’ Define π‘žπ‘›(𝜏,Ξ©) as the probability that the word born at time 𝜏 has been used a total of 𝑛 times by the observation time Ξ©

β€’ Define 𝐻(𝜏,Ξ©, π‘₯) as the PGF for the popularity distribution

𝐻 𝜏,Ξ©, π‘₯ = οΏ½π‘žπ‘› 𝜏,Ξ© π‘₯π‘›βˆž

𝑛=1

β€’ Define 𝐺 𝜏,Ξ©, π‘₯ as the PGF for the excess popularity distribution, so that

𝐻 𝜏,Ξ©, π‘₯ = π‘₯ 𝐺 𝜏,Ξ©, π‘₯

and 𝐺 Ξ©,Ξ©, π‘₯ = 1

Page 11: NetSci14 invited talk: Competing for attention

Outcome for seed word Probability Contribution to 𝐺 𝜏,Ξ©, π‘₯

Copied at 𝜏 + Ξ”πœ (1 βˆ’ πœ‡) Ξ”π‘‘πœ

π‘₯ 𝐺 𝜏 + Δ𝑑 2

Not copied 1 βˆ’ (1 βˆ’ πœ‡) Ξ”π‘‘πœ

𝐺 𝜏 + Δ𝑑

𝐺 𝜏,Ξ©, π‘₯ = 1 βˆ’ πœ‡

π›₯π‘‘πœπ‘₯ 𝐺 𝜏 + π›₯𝑑,𝛺, π‘₯ 2 + 1 βˆ’ (1 βˆ’ πœ‡)

π›₯π‘‘πœ

𝐺 𝜏 + π›₯𝑑,𝛺, π‘₯

β‡’ βˆ’πœ•πΊπœ•πœ

β‰ˆ1 βˆ’ πœ‡πœ

π‘₯ 𝐺2 βˆ’ 𝐺

𝜏 𝜏 + Δ𝑑

Ξ© ≫ 𝜏 ≫ Δ𝑑 when

Branching processes solution of Simon’s model

Page 12: NetSci14 invited talk: Competing for attention

β‡’ 𝐺 𝜏,Ξ©, π‘₯ =𝜏Ω

1βˆ’πœ‡

1 βˆ’ π‘₯ 1 βˆ’ 𝜏Ω

1βˆ’πœ‡

βˆ’πœ•πΊπœ•πœ

=1 βˆ’ πœ‡πœ

π‘₯ 𝐺2 βˆ’ 𝐺

Using 𝐻 = π‘₯ 𝐺, the corresponding popularity distribution is

π‘žπ‘› 𝜏,Ξ© =𝜏Ω

1βˆ’πœ‡1 βˆ’

𝜏Ω

1βˆ’πœ‡ π‘›βˆ’1

Mean (expected) popularity:

π‘š 𝜏,Ξ© = �𝑛 π‘žπ‘›(𝜏,Ξ©)∞

𝑛=1

=πœ•π»πœ•π‘₯

𝜏,Ω, 1 =Ω𝜏

1βˆ’πœ‡

β€œEarly-mover advantage”

Branching processes solution of Simon’s model

Page 13: NetSci14 invited talk: Competing for attention

πœ‡ = 0.02

β€’ Simulation results at age π‘Ž = 25000: set Ξ© = 𝜏 + 25000

β€’ Early-mover advantage; fixed-age distributions have exponential tails

[Simkin and Roychowdyury, 2007]

Branching processes solution of Simon’s model

Page 14: NetSci14 invited talk: Competing for attention

Note 𝛼 β‰₯ 2

β€’ Power-law distributions arise only after averaging over seed times:

π‘žπ‘› Ξ© ≑ οΏ½ π‘žπ‘› 𝜏,Ξ©1Ξ©

π‘‘πœΞ©

0

= 1

1 βˆ’ πœ‡ 𝐡 𝑛,

2 βˆ’ πœ‡1 βˆ’ πœ‡

∼ π‘›βˆ’π›Ό as 𝑛 β†’ ∞, with 𝛼 = 2βˆ’πœ‡1βˆ’πœ‡

Branching processes solution of Simon’s model

Page 15: NetSci14 invited talk: Competing for attention

A generalization of Simon’s model

Probability that a copying event at time 𝑑 chooses the word from time 𝜏 𝜏 𝑑

πœ™ 𝜏, 𝑑 Δ𝑑

Simon’s model: πœ™ 𝜏, 𝑑 = 1𝑑

Copying with memory models: (e.g. Cattuto et al. 2007, Bentley et al. 2011)

πœ™ 𝜏, 𝑑 = Ξ¦(𝑑 βˆ’ 𝜏)

𝐺 𝜏,Ξ©, π‘₯ β‰ˆ exp (1 βˆ’ πœ‡)οΏ½ πœ™ 𝜏, 𝑑 π‘₯ 𝐺 𝑑,Ξ©, π‘₯ βˆ’ 1 𝑑𝑑Ω

𝜏

𝐺 Ξ©,Ξ©, π‘₯ = 1 with Ξ© ≫ 𝜏 ≫ Δ𝑑, when

Page 16: NetSci14 invited talk: Competing for attention

A generalization of Simon’s model

Probability that a copying event at time 𝑑 chooses the word from time 𝜏 𝜏 𝑑

𝐺 𝜏,Ξ©, π‘₯ = exp (1 βˆ’ πœ‡)οΏ½ πœ™ 𝜏, 𝑑 π‘₯ 𝐺 𝑑,Ξ©, π‘₯ βˆ’ 1 𝑑𝑑Ω

𝜏

Age of seed at observation time is π‘Ž = Ξ© βˆ’ 𝜏

For πœ™ 𝜏, 𝑑 = Ξ¦(𝑑 βˆ’ 𝜏), let 𝐺 𝜏,Ξ©, π‘₯ = 𝐺�(Ξ© βˆ’ 𝜏, π‘₯)

β‡’ 𝐺� π‘Ž, π‘₯ = exp (1 βˆ’ πœ‡)οΏ½ Ξ¦(𝑠) π‘₯ 𝐺� π‘Ž βˆ’ 𝑠, π‘₯ βˆ’ 1 π‘‘π‘ π‘Ž

0

β€’ In this case, popularity distributions depend only on the age of the seed; there is no early-mover advantage

πœ™ 𝜏, 𝑑 Δ𝑑

Page 17: NetSci14 invited talk: Competing for attention

β€’ Simulation results at age π‘Ž = 25000: set Ξ© = 𝜏 + 25000

β€’ Memory-time distribution: πœ™ 𝜏, 𝑑 = Ξ¦ 𝑑 βˆ’ 𝜏 = 1π‘‡π‘’βˆ’(π‘‘βˆ’πœ)/𝑇, with 𝑇 = 500

A generalization of Simon’s model

β€’ In this case, popularity distributions depend only on the age of the seed; there is no early-mover advantage

πœ‡ = 0.02

𝛼 = 1.5

Page 18: NetSci14 invited talk: Competing for attention

Competition-induced criticality

Simon’s original model, and the copying-with-memory model both have the following features:

β€’ One word is added in each time step

β€’ Words β€œcompete” for user attention in order to become popular β€’ The words have equal β€œfitness” – a type of β€œneutral model” [Pinto and

MuΓ±oz 2011, Bentley et al. 2004 ]

β€’ … except for the early-mover advantage in Simon’s model…

but only the copying-with-memory model gives critical branching processes.

β€’ Gleeson JP, Cellai D, Onnela J-P, Porter MA, Reed-Tsochas F, β€œA simple generative model of collective online behaviour” arXiv :1305.7440v2

Page 19: NetSci14 invited talk: Competing for attention

Branching processes for meme popularity models Part 3

Memory

Network Competition

Page 20: NetSci14 invited talk: Competing for attention

β€’ Each node (of 𝑁) has a memory screen, which holds the meme of current interest to that node. Each screen has capacity for only one meme.

β€’ During each time step (Δ𝑑 = 1/𝑁), one node is chosen at random. β€’ With probability πœ‡, the selected node innovates, i.e., generates a brand-new

meme, that appears on its screen, and is tweeted (broadcast) to all the node's followers.

β€’ Otherwise (with probability 1 βˆ’ πœ‡), the selected node (re)tweets the meme currently on its screen (if there is one) to all its followers, and the screen is unchanged. If there is no meme on the node's screen, nothing happens.

β€’ When a meme π‘š is tweeted, the popularity π‘›π‘š of meme π‘š is incremented by 1 and the memes currently on the followers' screens are overwritten by meme π‘š.

The Markovian Twitter model

Page 21: NetSci14 invited talk: Competing for attention

β€’ Network structure: a node has π‘˜ followers (out-degree π‘˜) with probability π‘π‘˜.

β€’ In-degree distribution (number of followings) has a Poisson distribution. β€’ Mean degree 𝑧 = βˆ‘ π‘˜π‘π‘˜π‘˜ .

β€’ A simplified version of the model of Weng, Flammini, Vespignani, Menczer,

Scientific Reports 2, 335 (2012). β€’ Related to the random-copying β€œneutral” (Moran-type) models of Bentley

et al. 2004 [Bentley et al. I’ll Have What She’s Having: Mapping Social Behavior, MIT Press, 2011], where the distribution of popularity increments can be obtained analytically [Evans and Plato, 2007].

β€’ Our focus is on the distributions of popularity accumulated over long timescales: when a meme π‘š is tweeted, the popularity π‘›π‘š of meme π‘š is incremented by 1.

The Markovian Twitter model

Page 22: NetSci14 invited talk: Competing for attention

β€’ When all screens are non-empty, memes compete for the limited resource of user attention

β€’ Random fluctuations lead to some memes becoming very popular, while others languish in obscurity

The Markovian Twitter model

Page 23: NetSci14 invited talk: Competing for attention

β€’ Random fluctuations lead to some memes becoming very popular, while others languish in obscurity

β€’ The popularity distributions depend on the structure of the network, through the out-degree distribution π‘π‘˜

πœ‡ = 0

π‘π‘˜ = π›Ώπ‘˜,10

The Markovian Twitter model

Page 24: NetSci14 invited talk: Competing for attention

β€’ Random fluctuations lead to some memes becoming very popular, while others languish in obscurity

β€’ The popularity distributions depend on the structure of the network, through the out-degree distribution π‘π‘˜

πœ‡ = 0.01

π‘π‘˜ ∝ π‘˜βˆ’π›Ύ; 𝛾 = 2.5

The Markovian Twitter model

Page 25: NetSci14 invited talk: Competing for attention

overwritten 𝑧 Δ𝑑

𝑑 𝑑 + Δ𝑑

Branching processes solution of Twitter model

Define 𝐺(π‘Ž, π‘₯) as the PGF for the excess popularity distribution at age π‘Ž of memes that originate from a single randomly-chosen screen (the root screen)

π‘Ž π‘Ž βˆ’ Δ𝑑

Outcome for screen 𝑆1 Probability

πœ•πΊπœ•π‘Ž

= 𝑧 + πœ‡ βˆ’ 𝑧 + 1 𝐺 + 1 βˆ’ πœ‡ π‘₯𝐺𝑓(𝐺) 𝑓 π‘₯ = οΏ½π‘π‘˜π‘₯π‘˜βˆž

π‘˜=0

𝐺 0, π‘₯ = 1

selected, innovates πœ‡ Δ𝑑

selected, retweets (1 βˆ’ πœ‡) Δ𝑑

not selected, survives 1 βˆ’ (𝑧 + 1) Δ𝑑

Page 26: NetSci14 invited talk: Competing for attention

πœ•πΊπœ•π‘Ž

= 𝑧 + πœ‡ βˆ’ 𝑧 + 1 𝐺 + 1 βˆ’ πœ‡ π‘₯𝐺𝑓(𝐺)

𝐻 π‘Ž, π‘₯ = οΏ½π‘žπ‘› π‘Ž π‘₯𝑛 = π‘₯𝐺 π‘Ž, π‘₯ 𝑓(𝐺 π‘Ž, π‘₯ )∞

𝑛=0

Analysis of the branching process equation

Mean popularity of age-π‘Ž memes:

π‘š π‘Ž = οΏ½π‘›π‘žπ‘›(π‘Ž)∞

𝑛=1

=πœ•π»πœ•π‘₯

π‘Ž, 1 = 1 + (𝑧 + 1)πœ•πΊπœ•π‘₯

π‘Ž, 1

So: π‘‘π‘šπ‘‘π‘Ž

= (𝑧 + 1)(1 βˆ’ πœ‡ π‘š)

with π‘š 0 = 1

Page 27: NetSci14 invited talk: Competing for attention

πœ•πΊπœ•π‘Ž

= 𝑧 + πœ‡ βˆ’ 𝑧 + 1 𝐺 + 1 βˆ’ πœ‡ π‘₯𝐺𝑓(𝐺)

𝐻 π‘Ž, π‘₯ = οΏ½π‘žπ‘› π‘Ž π‘₯𝑛 = π‘₯𝐺 π‘Ž, π‘₯ 𝑓(𝐺 π‘Ž, π‘₯ )∞

𝑛=0

Analysis of the branching process equation

Mean popularity of age-π‘Ž memes:

π‘š π‘Ž = οΏ½ 1 + 𝑧 + 1 π‘Ž if πœ‡ = 01πœ‡βˆ’

1 βˆ’ πœ‡πœ‡

π‘’βˆ’πœ‡ 𝑧+1 π‘Ž if πœ‡ > 0

Page 28: NetSci14 invited talk: Competing for attention

Analysis of the branching process equation

Mean popularity of age-π‘Ž memes:

π‘š π‘Ž = οΏ½ 1 + 𝑧 + 1 π‘Ž if πœ‡ = 01πœ‡βˆ’

1 βˆ’ πœ‡πœ‡

π‘’βˆ’πœ‡ 𝑧+1 π‘Ž if πœ‡ > 0

Page 29: NetSci14 invited talk: Competing for attention

Long-time (old-age) asymptotics

β€’ If 𝑓′′ 1 < ∞ (finite second moment of π‘π‘˜),

π‘žπ‘› ∞ ∼ 𝐴 π‘’βˆ’π‘›πœ… π‘›βˆ’

32 as 𝑛 β†’ ∞

with πœ… = 2πœ‡2

𝑓′′ 1 +2𝑧𝑧+1 2

β€’ If π‘π‘˜ ∝ π‘˜βˆ’π›Ύ for large π‘˜ with 2 < 𝛾 < 3,

π‘žπ‘› ∞ ∼ �𝐡 π‘›βˆ’π›Ύ

π›Ύβˆ’1 if πœ‡ = 0𝐢 π‘›βˆ’π›Ύ if πœ‡ > 0

as 𝑛 β†’ ∞

πœ•πΊπœ•π‘Ž

= 0

cf. sandpile SOC on networks [Goh et al. 2003]

Page 30: NetSci14 invited talk: Competing for attention

Comparing branching process theory with simulations

π‘π‘˜ = π›Ώπ‘˜,10

π‘π‘˜ ∝ π‘˜βˆ’π›Ύ 𝛾 = 2.5

πœ‡ = 0.01

πœ‡ = 0

Page 31: NetSci14 invited talk: Competing for attention

Branching processes for meme popularity models Part 4

Memory

Network Competition

Page 32: NetSci14 invited talk: Competing for attention

Twitter model with memory

Ξ¦

β€’ During each time step (with time increment Δ𝑑 = 1/𝑁), one node is chosen at random.

β€’ The selected node may innovate (with probability πœ‡), or it may retweet a meme from its memory using the memory distribution Ξ¦(𝑑 βˆ’ 𝜏).

Page 33: NetSci14 invited talk: Competing for attention

β€’ Define 𝐺(π‘Ž, π‘₯) as the PGF for the excess popularity distribution at age π‘Ž of memes that originate from a single randomly-chosen seed (the root)

β€’ The mean popularity π‘š(π‘Ž) of age-π‘Ž memes has Laplace transform:

Branching process analysis

𝐺 π‘Ž, π‘₯ = οΏ½π‘π‘˜ οΏ½ 𝑑𝑑 (𝑧 + πœ‡)π‘’βˆ’ 𝑧+πœ‡ 𝑑 Γ—βˆž

0 π‘˜

Γ— exp βˆ’ 1 βˆ’ πœ‡ οΏ½ 𝑑𝑑min 𝒕,π‘Ž

0

οΏ½ π‘‘πœπ‘Žβˆ’π‘Ÿ

0 Ξ¦ π‘Ž βˆ’ 𝑑 βˆ’ 𝜏 1 βˆ’ π‘₯ 𝐺 𝜏, π‘₯ π‘˜

π‘šοΏ½ 𝑠 = 𝑧 + πœ‡ + 𝑠 + 1 βˆ’ πœ‡ Ξ¦οΏ½(𝑠)

𝑠 𝑧 + πœ‡ + 𝑠 βˆ’ 1 βˆ’ πœ‡ 𝑧 Ξ¦οΏ½(𝑠)

Page 34: NetSci14 invited talk: Competing for attention

Ξ¦

Memory

Network Competition π‘π‘˜

Comparing the model to data

𝛾 β‰ˆ 2.13

Page 35: NetSci14 invited talk: Competing for attention

Ξ¦ 𝜏 = Gamma(π‘˜,πœƒ)

=1

Ξ“ π‘˜ πœƒπ‘˜ πœπ‘˜βˆ’1π‘’βˆ’πœ/πœƒ

π‘˜ = 0.2; πœƒ = 355

Ξ¦

π‘šοΏ½ 𝑠 = 𝑧 + πœ‡ + 𝑠 + 1 βˆ’ πœ‡ Ξ¦οΏ½(𝑠)

𝑠 𝑧 + πœ‡ + 𝑠 βˆ’ 1 βˆ’ πœ‡ 𝑧 Ξ¦οΏ½(𝑠)

Comparing the model to data

πœ‡ = 0.02

Page 36: NetSci14 invited talk: Competing for attention

Comparing the model to data

Data Model

Page 37: NetSci14 invited talk: Competing for attention

Conclusions: Branching processes for meme popularity models

Ξ¦

Memory

Network Competition π‘π‘˜

Page 38: NetSci14 invited talk: Competing for attention

β€’ Competition between memes for the limited resource of user attention induces criticality in this model in the πœ‡ β†’ 0 limit

β€’ Criticality gives power-law popularity distributions and epochs of linear-in-time popularity growth, even for (cf. Weng et al. 2012) – homogeneous out-degree distributions – homogeneous user activity levels

β€’ Despite its simplicity, the model matches the empirical popularity

distribution of real memes (hastags on Twitter) remarkably well

β€’ Generalizations of the model are possible, and remain analytically tractable

Conclusions: Competition-induced criticality

β‡’ a useful null model to understand how memory, network structure and competition affect popularity distributions

Page 39: NetSci14 invited talk: Competing for attention

Davide Cellai, UL Mason Porter, Oxford J-P Onnela, Harvard Felix Reed-Tsochas, Oxford

Jonathan Ward, Leeds Kevin O’Sullivan, UL William Lee, UL

Yamir Moreno, Zaragoza Raquel A BaΓ±os, Zaragoza Kristina Lerman, USC

Science Foundation Ireland FP7 FET Proactive PLEXMATH SFI/HEA Irish Centre for High-End

Computing (ICHEC)

Collaborators, funding, references

β€’ β€œA simple generative model of collective online behaviour” arXiv :1305.7440v2 β€’ Physical Review Letters, 112, 048701 (2014); arXiv:1305.4328

Page 40: NetSci14 invited talk: Competing for attention

Branching processes for meme popularity models

Ξ¦

Memory

Network Competition π‘π‘˜

#branching www.ul.ie/gleesonj [email protected] @gleesonj