METADATA OF NATIONAL STATISTICAL OFFICES BELARUS, RUSSIA AND KAZAKHSTAN Miroslava Brchanova, Moscow, October, 2014
Dec 28, 2015
METADATA OF NATIONAL STATISTICAL OFFICES BELARUS, RUSSIA ANDKAZAKHSTANMiroslava Brchanova, Moscow, October, 2014
Meta
data
of n
atio
nal sta
tistical o
ffice
s
2
In fact, all national statistical offices (NSOs) all over the world are focused on presentations of reference metadata on their web sites.
It is a logical process that reflects the need to explain what the content of the published data is.
Less common is the use of structural metadata in the whole cycle of processing statistical data.
Introduction
NATIONAL STATISTICAL COMMITTEEOF THE REPUBLIC OF BELARUS
3
Meta
data
of n
atio
nal sta
tistical o
ffice
s
The NSO plans to use the GSBPM 5.0 to describe the existing statistical production processes. It started a pilot survey description in February 2014.
However, metadata presented on the website of Belarus is of passive nature so far:
Information about the methodology of various statistics
Official classifications
Dates of data collection
Statistical questionnaires.
RUSSIAN FEDERATION FEDERAL STATE STATISTICS SERVICE (ROSSTAT) 1
4
Meta
data
of n
atio
nal sta
tistical o
ffice
s
ROSSTAT has located a conception (for the years 2011 - 2017) of creation an integrated processing system based on a unified system metadata on its web site.
The system should include: Catalogue of statistical indicators
Uniform code lists and classifications.
Indicators and code lists should be uniformly applied throughout the process of processing statistical data.
The system should be based on the methodology and standards of SDMX system.
RUSSIAN FEDERATION FEDERAL STATE STATISTICS SERVICE 2
5
Meta
data
of n
atio
nal sta
tistical o
ffice
s
There is an electronic catalogue of metadata on the web site of ROSSTAT so far. It contains a descriptive information of all input and output indicators of the Federal state statistics service.
In addition, the site contains mainly passive metadata of:
Information about the methodology of various statistics
Official classifications
Dates of data collection
Statistical questionnaires.
There is also a description of the SDDS standard methodology for data transmission the IMF on the web.
Meta
data
of n
atio
nal sta
tistical o
ffice
s
6
Inputs
?Metadata
driven architecture
Outputs
ProcessCollect Disseminate
Analyse
7
Meta
data
of n
atio
nal sta
tistical o
ffice
s
Statistical processes are driven by metadata.
Necessary preconditions:► High level of automation
► Metadata must be ready from the first stage of
statistical data processing.
Such architecture ensures „active“ up-to-date metadata.
In such architecture metadata have to cover all statistical variables/indicators, not just outputs.
METADATA DRIVEN ARCHITECTURE
Meta
data
of n
atio
nal sta
tistical o
ffice
s
Source: http://www.unece.org
GENERIC STATISTICAL BUSINESS PROCESS MODEL
8
Meta
data
of n
atio
nal sta
tistical o
ffice
s
9
The NSO has realized several important implementations that may be considered as the best practices for other statistical offices.
The achievements are: System of dissemination of statistical data and
metadata TALDAU
System of statistical metadata (classifications and
nomenclatures)
Model of integrated data processing, storage and
dissemination (actually in advanced phase of
implementation).
The Republic of Kazakhstan - Achievements
SYSTEM OF THE COMMITTEE ON STATISTICS REPUBLIC OF KAZAKHSTAN –
IS “Metadata”
10
Meta
data
of n
atio
nal sta
tistical o
ffice
s
IS “Metadata” goal is a technological support of processes, extension and adjustments, internal and external integration of the components IIS “e-statistics”.
Tasks for the development of the IS “Metadata” are:► To unify recording of metadata description
► To unify providing IS management
► To unify data and metadata exchange with users.
Meta
data
of n
atio
nal sta
tistical o
ffice
s
11
Collect Phase
Process Phase
Disseminate Phase
Service
Meta-blockDescription of Forms
Meta-blockData
Collection
Meta-block
Register
Meta-block
Classifiers
Meta-blockData
Processing
Meta-block
Distribution
Meta-block
Service Metadata
IS “Metadata”
IIS“e-statistics”
Meta
data
of n
atio
nal sta
tistical o
ffice
s
12
? ? ? ?Taldau
Collect Phase
Process Phase
Disseminate Phase
GSBPM
METADATA DRIVEN ARCHITECTURE - INPUT
14
Meta
data
of n
atio
nal sta
tistical o
ffice
s
The following processes of phase Design and phase Built are not/ not fully covered by IS „Metadata“ in the Kazakh NSO:► Methodological preparation of survey
► Timetables
► Structure and design of questionnaires
► Sample frame and samples
► Metadata definition of data validation rules
► Metadata definition of self-corrections
► Programming specifications
15
Meta
data
of n
atio
nal sta
tistical o
ffice
s
To add subsystems in to IS „Metadata“: Users‘ requests
Quality.Users‘ requests application programme should be a place where:
► Internal and external users can describe their needs
► Methodologists and domain experts evaluate requests
► Reports for management.
The sub-system should keep records of: Decision
Reasoning
Further steps.
System of the Republic of Kazakhstan – Goals 1
System of the Republic of Kazakhstan – Goals 2
16
Meta
data
of n
atio
nal sta
tistical o
ffice
s
► To improve cooperation of methodologists with the metadata experts, subject matter statisticians and computation centre.
► Methodologists should be trained on the topic of metadata. Round tables should be organized.
► Conditions for sampling must be developed.
► To implement a modular approach to questionnaires in order: To reduce an administrative burden on respondents
To improve quality of statistical indicators
To improve consistency of statistical data
To publish statistical data for the general population.
System of the Republic of Kazakhstan – Goals 3
18
Meta
data
of n
atio
nal sta
tistical o
ffice
s
In order to create a kind of quality map is necessary to create an application „Quality“ that allows to monitor and evaluate all stages of the data processing.
The main stages of interest from this point of view are: Specification requirements
Design
Data collection
The processing steps.
Meta
data
of n
atio
nal sta
tistical o
ffice
s
20
► To make metadata active to the greatest extent possible from the very beginning. Active metadata are metadata that drive other processes and actions.
► To reuse metadata where possible from the very beginning for statistical integration as well as efficiency reasons.
► To preserve history (old versions) of metadata from the very beginning.
► To make metadata-related work an integral part of business processes across the organisation from the very beginning.
CONCLUSIONS
Meta
data
of n
atio
nal sta
tistical o
ffice
s
21
Any questions?
Abbreviations:SDMX – Statistical Data and Metadata ExchangeSDDS – Special Data Dissemination StandardTALDAU – Information-analytical systemIS – Information SystemIIS - Integrated Information System