Top Banner
Introduction to Data Modeling Statistics Division, September 2019
14

Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Aug 28, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Introduction to Data Modeling

Statistics Division, September 2019

Page 2: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Data Structure Definitions

• DSDs are a description of the data model

• SDMX provides a framework to build DSDs

• To design a DSD, we first need to find concepts that

identify and describe our data

• Each concept describes something about the data

Page 3: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Building DSDs

Page 4: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Building DSDs

Concepts

Dimensions

Primary Measure

(Obs. Value)

Attributes

Page 5: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Dimension or Attribute?

• Concepts that identify data, should be made dimensions

• Concepts that provide additional information about data,

should be made attributes

If a concept is a dimension, it is possible

to have time series that are different only

in the value of this concept

Page 6: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Time Dimension

• TIME dimension provides the observation time

• FREQUENCY dimension describes interval between

observations (yearly, quarterly, daily, etc.)

Page 7: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Representation

Coded

• Based on code lists

• For example:Country, Sex, Indicator

Un-coded with format

• Format specified

• For example: postal code with 5 digits

Un-coded free text

• Any text is valid

• For example: footnote

Page 8: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Examples of Code Lists

Page 9: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

Representation of Concepts

• Dimensions must always be coded

• Attributes can be coded or un-coded

Page 10: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

SDG Data Structure

• Developed by the IAEG-SDG Working Group on SDMX

• Version 1.0 released in June 2019 (link)

Page 11: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

SDG DSD – Dimensions

Frequency

Reporting Type

Series

Reference Area

Sex

Age

Degree of urbanization

Income or wealth quantile

Education level

Occupation

Disability status

Economic activity

Product type

Custom breakdown

Composite breakdown

Time period

Page 12: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

SDG DSD – Dimensions

• Reporting type: ‘N’ for national, ‘G’ for global

• Sex: when series refer to female population, use ‘F’ – for

example: Number of seats held by women in national

parliaments

• Custom breakdown: for non-standard breakdown, ad-hoc

• Composite breakdown: combination of few seldom used

breakdowns

Page 13: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

SDG DSD – Code Lists

• Dimensions with breakdown have a code _T to denote the

total or no breakdown. For example CL_URBANISATION

• Code list CL_AREA has countries and groupings. For

country-level use, the code list needs to be expanded with

regions, cities, provinces, etc.

Code Description

_T Total

U Urban

R Rural

Page 14: Introduction to Data Modeling - UN ESCAP...Introduction to Data Modeling Statistics Division, September 2019 Data Structure Definitions • DSDs are a description of the data model

THANK YOU

WWW.UNESCAP.ORG

UNESCAP

UNESCAP

UNESCAP

UNITEDNATIONSESCAP

UNITEDNATIONSESCAP