Top Banner
The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal
47

The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Dec 14, 2015

Download

Documents

Alex Brandom
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The Probability Sampling Tradition in a period of crisis

Q2010 Keynote speech

Carl-Erik Särndal

Université de Montréal

Page 2: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The Probability Sampling Tradition

has governed surveys at National Statistical Institutes (NSI:s) for decades

Breaking a tradition : Not easy …

Page 3: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Background

The merits of probability sampling, also known as scientific sampling, are put in question by severe imperfections : non-sampling errors, economic pressures etc.

The problem not new – but more and more compelling

Page 4: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

BackgroundThe probability sampling process• is expensive (through follow-ups);• its theoretical merits are compromised

(by nonresponse, etc.)• “a few extra %” amount to very little• alternative data collection methods exist

Yet probability sampling continues to be practiced. Wasteful ? Can we do without probability sampling?

Page 5: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

My view

is a (Canadian) theoretician’s view

on (official) statistics production

To what extent guided by (statistical science) theory ?

Page 6: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Something we admire: Being able to predict facts about the world we

live in by theoretical arguments and deduction

This is the predictive power of science

In statistics: Want precise statements, backed by convincing theory, of level of unemployment, of industrial production, and so on

Theory as a basis for science (knowledge)

Page 7: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Theory as a basis for science

Gérard Jorland : How is it possible that one can predict, merely by theoretical deductions, the existence of a new planet, or a new chemical element, or a new elementary particle?

Based only on a calculus, on a set of mathematical equations ... remarkable achievement of the human mind.

Page 8: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Famous example: Planet Neptune was “found” by mathematical prediction by Le Verrier 1846, then empirically observed by Galle, at the position given by Le Verrier

Many other examples come from physics, astronomy, chemistry

Page 9: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

A hypothesis to test:

The sciences are predictive to the extent that they are mathematically formulated.

But that hypothesis is rejected : Today, Economics is highly mathematical and theoretical, but such arguments did not predict the current economic crisis, for example.

Page 10: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The contrast

Physics: Predictive power of formal theory very high

Economics: Predictive power of formal theory low

Page 11: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

So “science formulated mathematically” does not guarantee “predictive power of theory”

Why then are Physics and Economics different? Both are theoretical (mathematical) .

Page 12: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Contrasts

Physics : the objects (planets, elementary particles, and so on) are inanimate ; predictive power very high

Economics : the objects and the participants (human beings) are unpredictable, relationships highly complex; predictive power very low

Page 13: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Theory as a guide in statistics productionOur ambition : Create knowledge (predictions) about our

world through statistical surveys .

To what extent is this activity supported by theory ? To what extent scientific ?

Legitimate questions !

Some NSI:s take pride in “scientific principles”.

Page 14: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Sampling = Limiting attention to a small subset

To what extent scientific ?

We accept without hesitation that observing only n = 1,000 (or a few thousand) is enough - but provided the sample is “scientific”

Page 15: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

What is a scientific sample ? RoperCentre, Univ. of Connecticut, says :

A scientific sample is a process in which respondents are chosen randomly by one of several methods. The key component in the scientific sample is that everyone within the designated group (sample frame) has a chance of being selected.

We may add : Such a sample also known as a probability sample It is not necessarily a representative sample in the sense “all have the same probability”.

Page 16: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

scientific sample probability sample representative sample

around these terms, unfortunate ambiguity and confusion reigns in literature, in conversation

Ask, and you get a variety of responses

Page 17: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Sampling = Limiting attention to a small subset

Two contrasting examples:

Sampling trees in a forest - to predict volume

Sampling human beings in a country - to predict (assess) unemployment, or health conditions, or expenditures

Page 18: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Estimating volume of wood on a sample of trees

With classical probability sampling theory, we get not only a figure for the total volume of wood in the forest, but also a statement of its margin of error, free of any assumptions.

We can determine exactly the accuracy we want.

Page 19: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Estimating unemployed on a sample of people

We get from the LFS a figure, but we cannot quantify its margin of error. There is no objective declaration of numerical quality

because unmeasured are : nonresponse error, measurement error, frame error, recording and data handling error, and so on

Page 20: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The contrast

Trees are inanimate objects, like planets

Human beings, they are precisely that, human,

inconsistent, emotional, prone to error

Page 21: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The contrast

Trees : Predictive power of probability sampling theory very high – objects do not “cause trouble”

People : Predictive power of sampling theory very low - the survey is complex; human beings are involved

Page 22: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

A large scale statistical investigation (survey) :

“Unpredictable people are involved at so many points of this incredibly complex process”

so we will never have a theory that will allow precise measurement of total survey error

(Stanley McCarthy 2001)

Producing numbers is (relatively) easy ; by comparison, stating their accuracy is difficult

Page 23: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Article by Platek and Särndal : Can a statistician deliver ?

J. Official Statistics vol. 17 (2001), pp. 1 – 127

with 16 discussions

and a rejoinder by the authors

Can a statistician fulfill the promise (to society) ?

Upon rereading : Have we advanced any, in 10 years ?

Page 24: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The title : Can a statistician deliver ?

“Statistician” may denote

the head of a National Statistical Institute (NSI)

or

a person expert in the subject (labour market, or health issues, or manufacturing industry, etc.)

or

a person trained in statistical science (methodologist)

Page 25: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

As expected, feelings conveyed were of two kinds:

high ranking NSI officials: “Keep the ship sailing”, despite difficult times

academics and researchers: Regret the absence of a more solid (theoretical) base for (national) statistics production

Page 26: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Three themes are prominent in the 16 discussions (summarized in the authors’ rejoinder) :

The role of theory

The scientific and professional credo of the NSI

The concept of quality in regard to the NSI’s activity

Page 27: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The uncertain future of the NSI

I. Fellegi (Statistics Canada) on survival of the NSI. “Survival beyond quality” depends on

• Respect for respondents, and

• Credibility of information; Accuracy is an important part, but so are Relevance, Transparency & others

Page 28: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The uncertain future of the NSI

I. Fellegi : A life and death question for the NSI is

credibility :

Information that is not believed will not be used, and the NSI has no function any more.

Can the NSI count on future high co-operation and truthful response ? -

More and more doubtful.

Page 29: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Believing numerical information

We have no objective measures of “margin of error”

But what about the Total Survey Error model ? (US Bureau of the Census, around 1950)

It recognizes total error as a sum of a number of components.

Can we not use these equations, this theory ?

Page 30: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Believing information

The Total Survey Error model

• helped us to focus on specific components of total error

• disappointed us by failing to provide routine measures for the numerical quality of published statistics.

Page 31: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Believing informationDiscussants of Can a statistician deliver ?

deliver “a death sentence” on the TSE model :

“Unattainable and unrealistic ideal”“Utopian project”“Unrealistic utopian dream”

Theory is there, but it does not workSome say: We choose not to use itIn question are the notions of “probability” and

“probable error”

Page 32: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Statistics Canada Quality Guidelines (1998)

describes Survey Methodology as : “A collection of practices, backed by some

theory and empirical evaluation, among which practitioners have to make sensible choices in the context of a particular application”

A patchwork of theories, one for questionnaire design, one for motivating response, one for data handling and editing, one for imputation, one for estimation in small areas, and so on

Fragmentation …

Page 33: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

European Statistics Code of Practice (2005)

Sound methodology must underpin quality statistics. This requires adequate tools, procedures and expertise. The overall methodological framework of the statistical authority follows European and other international standards, guidelines, and good practices ... Survey designs, sample selections, and sample weights are well based and regularly reviewed, revised or updated …

(Emphasis is mine.) A “be-good” encouragement; what about “scientific underpinnings” ?

Page 34: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The stark reality

“Good practice” is the guide, not theory .

Numerical quality is not assured .

Large errors probably not infrequent; most go undetected .

So what ? - Other important professions are also guided by a bunch of “good practices”

Page 35: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The NSI:s situation

Its work is guided by “a collection of practices supported by some theory” plus requirement to keep response burden low

With this frail and fragmented base, the NSI must produce reliable Official Statistics, for the good of the nation, a solid basis for policy decisions

Not an enviable situation and a threat to NSI’s existence…

Page 36: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The Probability Sampling Tradition (born in 1930’s)

created the concept of Nonresponse Rate :

“the selected objects” (the probability sample) as compared with

“the data delivering objects” (the respondents)

We measure, steadfastly, sometimes misguidedly, the size ratio of those two sets

Page 37: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Our obsession with the Nonresponse Rate

When NR rate was 2%, nobody worried

When NR rate is now around 50%, we worry

• Intuitively because the non-responding may be systematically related to target variable values

• Probabilistically because “making the observation” (getting the response) has an unknown probability; the theory capsizes

Page 38: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

The believers in Probability Sampling regret that the theory cannot cope

The non-believers : Why worry about the NR rate ? Just collect some reasonably good data from a reasonably representative set of objects.

Page 39: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Our obsession with Nonresponse Rates

Why not (in the manner of some private survey institutes) just get data from “a reasonably representative set of co-operative objects”, and not bother with this stifling concept of the Nonresponse Rate ?

It is time that NSI:s deliver a strong endorsement of the Probability Sampling Tradition – if this is what they really believe in; otherwise, act accordingly

Page 40: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Our obsession with Nonresponse Rates

NR rate itself is a poor indicator of NR bias,

of “accuracy of estimates”

See for ex. Groves (2006), Schouten (2009)

Särndal and Lundström (2008)

Page 41: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Conclusions

What options remain for the NSI today, to show their superior capacity to produce “serious numbers” amidst a deluge of “junk information” ?

The underpinnings may be just “a collection of practices”, but still, the NSI is the model of statistical competence in the nation - and it must demonstrate this !

Media criticism of the NSI sometimes harsh.

Page 42: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Conclusions

The NSI’s delicate balancing act

vis-à-vis

• The national government : fulfill the mandate

• The world of theory and learning : show “scientific credibility”

• The other (private) producers of statistics : tough competition

• The supra-agency (EuroStat) : dictates

Page 43: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Conclusions

A fact is that the quality component accuracy cannot be measured (probabilistically).

Yet this is what users want desperately to have measured.

When important numbers are proven wrong (by users), trust in the NSI suffers

Other numbers may be wrong, but go unnoticed - and may not matter much .

Page 44: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

ConclusionsThe Probability Sampling (Scientific Sampling)

tradition, is a reflection of an idyllic past -

now we are 2010 , not 1950 On what grounds is it still defendable, in our

time? It is a challenge to the NSI, and to the academics

(the theoreticians), to provide the answers

Page 45: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Conclusions

The NSI vis-à-vis the scientific world : a sometimes hesitant relationship:

Most NSI:s have a scientific (academic) advisory board

NSI:s look to the learned world for support and acceptance

NSI:s own investment in research may (understandably) be limited.

Implementing new theory into the NSI's production has met with obstacles

Page 46: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Conclusions

Relationship of the NSI to the world of learning; an empirical investigation, see

Risto Lehtonen and Carl-Erik Särndal : Research and Development in Official

Statistics and Scientific Co-operation with Universities: A Follow-Up Study , J. Official Statistics (2010)

Page 47: The Probability Sampling Tradition in a period of crisis Q2010 Keynote speech Carl-Erik Särndal Université de Montréal.

Conclusions

Debate article :

S. Lundström and C.E. Särndal (2010): The devastating consequences of nonresponse : Probability sampling in question at Statistics Sweden . (In Swedish; internal report).

Credit goes to Statistics Sweden for their courage to debate a sensitive issue.