Top Banner
Trials and Tribulations of creating DDI Codebooks at the University of Guelph A.Michelle Edwards and Carol Perry, Data Resource Centre, University of Guelph Guelph, Ontario
41

Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Jan 02, 2016

Download

Documents

Trials and Tribulations of creating DDI Codebooks at the University of Guelph. A.Michelle Edwards and Carol Perry, Data Resource Centre, University of Guelph Guelph, Ontario. Current Search Function. Search Results. Current Documentation. Identifying Variables. Rationale for Change. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Trials and Tribulations of creating DDI Codebooks at

the University of Guelph

A.Michelle Edwards and Carol Perry,

Data Resource Centre,

University of Guelph

Guelph, Ontario

Page 2: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Current Search Function

Page 3: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Search Results

Page 4: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Current Documentation

Page 5: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Identifying Variables

Page 6: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Rationale for Change

522 datasets to date.

No comprehensive metadata search function.

No current variable search within dataset.

Limits researcher’s autonomy.

Page 7: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

XML tags

Started with approx. 30 or so tags…

As of June 5, 2002 101 tags59 are filled Information contained inside tags

Page 8: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Codebook Templates

Used Maddie to develop initial template.

Edited the template to add tags as required.

Filled in fields common to all codebooks.

Page 9: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Codebook Templates

Statistics Canada data

ICPSR data

B2020 data format

Page 10: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Statistics Canada Codebook

Page 11: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Differences between Codebook Templates

Authoring entity

Distributor (DLI vs. ICPSR)

Licenses

Other material – ICPSR abstract link

B2020No direct link to databaseNo variables

Page 12: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

How do we move our information from an HTML

readme file to an XML file???

Page 13: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Readme to XML

Document Description

Study Description

Data Files Description

Page 14: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Readme to XML

Currently – copy and paste information from the Readme (html) file into the XML Codebook.

Script extracts metadata from html and places into XML.

Same amount of time.

Page 15: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Variable Information

Page 16: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Variable Information

Sources of Variable information

Variable names, labels, and position from the SAS program.

Frequencies for each variable value from SAS output.

Page 17: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Variable Information

Sources of Variable information

Literal questions from questionnaires if available.

Page 18: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Variable Information

Script:

Looks into the SAS program – pulls out the variable names, labels and positions.

Looks into a SAS output file for frequencies and variable value labels.

Page 19: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Variable Information

Script:

If questionnaire is available – seeks out questions and matches with variables.

Page 20: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Variable Information

Problems with Script:

SAS programs must be consistent in their format.

SAS output and questionnaires – matching variables.

Page 21: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

SAS to XML

SAS 8.2 - XML engine and ODS XML.

Can create XML SAS output.Variable names, labels, value labels, and

frequencies.

Variable positions with the input statement and Proc Print XML.

Page 22: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

SAS to XML Frequency Output

Page 23: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

SAS to XMLProc print output

Page 24: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

SAS to XML

Page 25: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

SAS to XML

Advantages:

SAS programs do not need to be consistent.

Use one program from start to finish – SAS.

Still in development.

Page 26: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

XML to Viewable Document

Saxon – to render our XML documents to HTML using XSL Stylesheets.

XSL – pull out info from XML document and display with HTML tags.

Page 27: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

XSL Templates

Set for each:Statistics Canada ICPSRB2020

Initial templates from University of Virginia samples.

Page 28: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

XSL Templates

Abstract

Study Info

Methodology & File Dimensions

Questions

Variables & Frequencies

Other Documents

Page 29: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

XSL Stylesheets

Page 30: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

Search

Uses SAS IntrNet to call and run the UNIX SGREP search.

Creates an XML file with results.

Calls Saxon to render the file with the Variable XSL Stylesheet.

Page 31: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”

Frames to put it all together.

Links to each component (abstract, etc.).

Returns the rendered HTML on the fly.

Page 32: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”

Page 33: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”

Sun Exposure Survey 1996

http://tdr.uoguelph.ca/DATA/WWWDOCS/XML/SES2/ses96cbk.html

Page 34: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Finished Product”

522 datasets to date.

35 Completed DDI-compliant codebooks.

Fall completion ???

Page 35: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”

Page 36: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”

Page 37: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”

Page 38: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”

Page 39: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”

Page 40: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”

Page 41: Trials and Tribulations of creating DDI Codebooks at the University of Guelph

“Final Product”