Q2014 – Special Session Big Data Vienna, 4 June 2014 Quality Approaches to Big Data Peter Struijs and Piet Daas.

Post on 31-Dec-2015

215 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

Transcript

Q2014 – Special Session Big Data Vienna, 4 June 2014

Quality Approaches to Big DataPeter Struijs and Piet Daas

2

Limitations of the established quality frameworks and methodology

Options

What to doin the changing context of making statistics

Approaches and data sources

Surveys / questionnaires

e.g. sampling theory

Administrative data sources

Where does Big Data fit in? 3

Two levels of quality

Quality as related to methodology

General quality criteria as defined in Code of Practice:

‐ Relevance‐ Accuracy and reliability‐ Timeliness and punctuality‐ Coherence and comparability‐ Accessibility and clarity

4

5

Limitations of the established quality frameworks and methodology

Small, medium-sized & large vehicles

22

7

Figure 1. Development of daily, weekly and monthly aggregates of social media sentiment from June 2010 until November 2013, in green, red and black, respectively. In the insert the development of consumer confidence is shown for the identical period.

Daytime population based on mobile phone data

The top three issues

9

Population not known

Unbalanced

coverage

Relevance of data not

clear

10

Options

Population not known

11

Derive background information

Relate population at meso- or macro-level to other information

Unbalanced coverage

12

Use modeling approaches

Relevance of data not clear

13

Calibration / fitting

Study correlations

Use Big Data for “stand alone”

information

14

What to doin the changing context of making statistics

15

Strategic aspects

Others start producing statistics• there may be quality issues• but they are extremely rapid• and there is obviously demand

Need for good, impartial informationwill remain• without a monopoly for NSIs

NSIs must validate information produced by others

16

The way forward

Get to know Big Data

Use Big Data for efficiency and response burden reduction

Use Big Data for early indicators

Start with Big Data, not with the desired outcome

Create the right environment

17

18

19

top related