Academy of Management Montreal , 6 August 2010 Empirical Exploration of Complexity in Human Systems: Data Collection & Interpretation Techniques Power law statistics and Pareto Science Pierpaolo Andriani Durham Business School, University of Durham, UK
33
Embed
Academy of Management Montreal, 6 August 2010 Empirical Exploration of Complexity in Human Systems: Data Collection & Interpretation Techniques Power law.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Academy of Management
Montreal , 6 August 2010
Empirical Exploration of
Complexity in Human Systems:
Data Collection & Interpretation Techniques
Power law statistics and Pareto Science
Pierpaolo Andriani
Durham Business School, University of Durham, UK
sampling & inferenceLIKELIHOOD distribution:
PROB (data given population)
INFERENTIAL distribution:
PROB (population given data)
sample
sample mean
sample variance
etc.
sample size
population
mean
variance
etc.
size is infinite
infer a value
take a sampleStatistical inference: Drawing conclusions
about the whole population on the basis of
a sample
Precondition for statistical inference:
A sample is randomly selected from the
population (=probability sample)
Representative agent links
population to sample level and
allows reduction of population
complexity to single agent
complexity
From Starbuck: The production of knowledge (2006)
• Consensus favoring use of null-hypothesis significance tests affords a clear example of paradigm stability. Although methodologists have been trying to discourage the use of these tests since the 1950s, the tests have remained very prevalent, and there is no sign that social scientists are shifting to other criteria. …. Hubbard and Ryan (2000: 678) concluded: ‘It seems inconceivable to admit that a methodology as bereft of value as SST (statistical significance tests) has survived, as the centerpiece of inductive inference no less, more than four decades of criticism in the psychological literature’.
p. 77
Starbucks: The production of knowledge (2006)
Starbucks: The production of knowledge (2006)
• Choosing two variables utterly at random, a researcher has 2-to-1 odds of finding a significant correlation on the first try, and 24-to-1 odds of finding a significant correlation within three tries. … the main inference I drew from these statistics was that the social sciences are drowning in statistically significant but meaningless noise. Because the differences and correlations that social scientists test have distributions quite different from those assumed in hypothesis test, social scientists are using tests that assign statistical significance to confounding background relationships. Because social scientists equate statistical significance with meaningful relationships, they often mistake confounding background relationships for theoretically important information. One result is that social science research creates a cloud of statistically significant differences and correlations that not only have no real meaning but also impede scientific progress by obscuring the truly meaningful relationships.
p. 49
Starbucks: The production of knowledge (2006)
• I began to think of statistical tests as arcane rituals that demonstrate membership in an esoteric subculture
Rationality, stock market and the butterfly effect
Growth-related power laws - ratio imbalances
1Surface /
volume Law
Organisms; villages: In organisms, surfaces absorbing energy grow by the square but the organism grows by the volume, resulting in an imbalance (Galileo 1638, Carneiro 1987); fractals emerge to bring surface/volume back into balance. West and Brown (1997) show that several phenomena in biology such as metabolic rate, height of trees, life span, etc. are described by allometric power law whose exponent is a multiple of ±¼. The cause is a fractal distribution of resources. Allometric power laws hold across 27 orders of magnitude (of mass).
2Least effort
Language; transition: Word frequency is a function of ease of usage by both speaker/writer and listener/reader; this gives rise to Zipf’s (power) Law (1949); now found to apply to language, firms, and economies in transition (Ferrer i Cancho & Solé, 2003; Dahui et al., 2005; Ishikawa, 2005; Podobnik et al., 2006).
3Hierarchical modularity
Growth unit connectivity: As cell fission occurs by the square, connectivity increases by n(n–1)/2, producing an imbalance between the gains from fission vs. the cost of maintaining connectivity; consequently organisms form modules so as to reduce the cost of connectivity; Simon argued that adaptive advantage goes to “nearly decomposable” systems (Simon, 1962; Bykoski, 2003). Complex adaptive systems: Heterogeneous agents seeking out other agents to copy/learn from so as to improve fitness generate networks; there is some probability of positive feedback such that some networks become groups, some groups form larger groups & hierarchies (Kauffman, 1969, 1993; Holland, 1995).
Combinations
4Interactive Breakage
theory
Wealth; mass extinctions/explosions: A few independent elements having multiplicative effects produce lognormals; if the elements become interactive with positive feedback loops materializing, a power law results; based on Kolmogorov’s “breakage theory” of wealth creation (1941).
5Combination
theory
# of exponentials; complexity: Multiple exponential or lognormal distributions or increased complexity of components (subtasks, processes) sets up, which results in a power law distribution (Mandelbrot, 1963; West & Deering, 1995; Newman, 2005).
6Interacting
fractals
Food web; firm & industry size, heartbeats: The fractal structure of a species is based on the food web (Pimm, 1982), which is a function of the fractal structure of predators and niche resources (Preston 1950; Halloy, 1998; Solé & Alonso, 1998; Camacho & Solé, 2001; Kostylev & Erlandsson, 2001, West, 2006).
Positive feedback loops
7Preferential attachment
Nodes; gravitational attraction: Given newly arriving agents into a system, larger nodes with an enhanced propensity to attract agents will become disproportionately even larger, resulting in the power law signature (Yule, 1925; Young, 1928; Arthur, 1988; Barabási, 2000).
8Irregularity generated gradients
Coral growth; blockages: Starting with a random, insignificant irregularity, coupled with positive feedback, the initial irregularity increases its effect. This explains the growth of coral reefs, blockages changing the course of rivers, (Juarrero, 1999; Turner, 2000; Barabási, 2005). Diffusion limited accretion (DLA). See also “niche constructionism” in biology (Odling-Smee, 2003)
Contextual effects
9Phase
transitions
Turbulent flows: Exogenous energy impositions cause autocatalytic, interaction effects and percolation transitions at a specific energy level—the 1st critical value—such that new interaction groupings form with a Pareto distribution (Bénard, 1901; Prigogine, 1955; Stauffer, 1985; Newman, 2005).
10Self-
organized criticality
Sandpiles; forests; heartbeats: Under constant tension of some kind (gravity, ecological balance, delivery of oxygen), some systems reach a critical state where they maintain stasis by preservative behaviors—such as sand avalanches, forest fires, changing heartbeat rate—which vary in size of effect according to a power law (Bak et al., 1987; Drossel & Schwabl, 1992; Bak, 1996).
11Niche
proliferation
Markets: When production, distribution, and search become cheap and easily available, markets develop a long tail of proliferating niches containing fewer customers; they become Paretian with mass-market products at one end and a long tail of niches at the other (Anderson, 2006).
Gaussian – heights of individualsTallest man (Robert Pershing Wadlow) 272 cm