United Nations Economic Commission for Europe Statistical Division United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE
United Nations Economic Commission for EuropeStatistical DivisionUnited Nations Economic Commission for EuropeStatistical Division
The Importance of Databases in the Dissemination Process
Steven Vale, UNECE
Steven Vale - UNECE Statistical Division Slide 220 April 2023
Contents
How are data currently disseminated? Advantages and disadvantages of
different approaches Introduction to data cubes Good practices
Steven Vale - UNECE Statistical Division Slide 320 April 2023
Dissemination Practices
Web sites of statistical agencies for all 56 UNECE member countries checked during spring 2008.
Data dissemination systems and formats recorded.
Not possible to check all national language versions of websites.
Steven Vale - UNECE Statistical Division Slide 420 April 2023
Results
Internet Dissemination ToolsNumber of Countries
%
Static html / pdf / word pages 29 51.8%
Excel spreadsheets 12 21.4%
National database software 17 30.4%
PC-Axis 12 21.4%
Statbank / PC-Axis 3 5.4%
SuperWEB 2 3.6%
Steven Vale - UNECE Statistical Division Slide 520 April 2023
Static html / pdf / word Pages
Steven Vale - UNECE Statistical Division Slide 620 April 2023
Static html / pdf / word Pages
Advantages• Quick, easy and cheap to prepare• Data at a glance• Possible to combine tables, graphics and text• Html and pdf viewers are free
Disadvantages• Only a picture - users can not easily download
or manipulate data• Manual updates
Steven Vale - UNECE Statistical Division Slide 720 April 2023
Excel Spreadsheets
Steven Vale - UNECE Statistical Division Slide 820 April 2023
Excel Spreadsheets
Advantages• Users can download and customize data• Most common format for basic data analysis
Disadvantages• Excel software is not cheap!• Manual updates• User has to download the whole file
Steven Vale - UNECE Statistical Division Slide 920 April 2023
Output Databases
Steven Vale - UNECE Statistical Division Slide 1020 April 2023
Output Databases
Advantages• Interactive with flexible outputs• User friendly (usually!)• Can be tailored to national requirements• Some generic systems available
Disadvantages• Can be expensive to develop and maintain,
particularly if you develop your own system
Steven Vale - UNECE Statistical Division Slide 1120 April 2023
What Do Users Want?
Depends on the type of user Quick access to key figures Options to select and manipulate data Easy export to own analysis packages Graphic visualizations (maps, charts, ..) Appropriate metadata Multiple languages
Steven Vale - UNECE Statistical Division Slide 1220 April 2023
What is a Data Cube?
A multi-dimensional structure containing data points that represent unique combinations of several classifications
A flexible way of storing and disseminating data
Steven Vale - UNECE Statistical Division Slide 1320 April 2023
Two-dimensional Cube
Year
Country 2000 2001 2002 2003
AAA 123 456 124 567 125 678 126 789
BBB 987 654 988 654 989 654 999 654
CCC 35 789 36 789 37 789 38 789
Steven Vale - UNECE Statistical Division Slide 1420 April 2023
Three-dimensional Cube
Steven Vale - UNECE Statistical Division Slide 1520 April 2023
More dimensions are possible,
but not easy to display!
Steven Vale - UNECE Statistical Division Slide 1620 April 2023
Why Data Cubes are Important
Many statistical data management models and systems are based on cubes
Users can select just those data that are of interest
Cubes can easily be expanded, e.g. for extra years, countries, or other categories
At least in theory, cubes can have an infinite number of dimensions
Steven Vale - UNECE Statistical Division Slide 1720 April 2023
Good Practices
Static tables can be useful for key figures For detailed or large datasets, allow users
to create and manipulate their own tables Store data as multi-dimensional cubes Offer graphic visualizations Allow users to download data in a range
of formats (including SDMX)
Steven Vale - UNECE Statistical Division Slide 1820 April 2023
Good Practices (2)
Link data and metadata Share development in an open-source
environment or network, with an electronic forum for discussions and questions
Don’t try to re-invent the wheel!
Steven Vale - UNECE Statistical Division Slide 1920 April 2023
Thank you for listening
Questions?