Slide Template forModule 2: Types, Formats,
and Stages of Data
Learner Objectives
Module 2: Data Types, Stages & Formats
1. Explain what research data is and the range of data types
2. Identify stages of research data3. Identify common potential storage formats for data 4. Identify relevant quality control techniques/technical
standards 5. Identify methods of recording data that are specific to
researchers’ disciplines and research interests
Research Data Associated with Most Disciplines
• Images
• Video
• Mapping/GIS data
• Numerical measurements
Module 2: Data Types, Stages & Formats
Research Data Associated with Social Sciences
• survey responses
• focus group and individual interviews
• economic indicators
• demographics
• opinion polling
Module 2: Data Types, Stages & Formats
Research Data Associated with Hard Sciences
• measurements generated by sensors/laboratory instruments
• computer modeling
• simulations
• observations and/or field studies
• specimen
Module 2: Data Types, Stages & Formats
Stages of Data Related to Research Data Life Cycle
• Raw Data
• Processed Data
• Analyzed Data
• Finalized/Published Data
• Existing Data across Different Sources
Module 2: Data Types, Stages & Formats
Stages of Data Related to Research Data Life Cycle
Sample hypothesis:
Water temperatures in Lake Superior are now significantly warmer than in previous years. The evidence lends support to global warming.
Module 2: Data Types, Stages & Formats
Using our sample hypothesisWater temperatures in Lake Superior are now significantly warmer than in previous years. This evidence lends support to global warming.
• Raw Data = daily lake temperatures
• Processed Data = ‘cleaned’ temp. data in spreadsheet
• Analyzed Data = average temps., graphing changes
• Finalized Data = does data support the hypothesis?
Module 2: Data Types, Stages & Formats
Preferable Format Types for Long-Term Access to Data
Data formats that offer the best chance for long-term access are both:
• Non-proprietary (also known as open), and
• Unencrypted and uncompressed
Module 2: Data Types, Stages & Formats
Preferred FormatsExamples of preferred formats for various data types
include:
Moving Images: MOV, MPEG
Audio: WAVE, MP3
Numbers/statistics: ASCII, SAS
Images: TIFF, JPEG 2000
Text: PDF/A, ASCII
Module 2: Data Types, Stages & Formats
Converting to Preferable Formats
Information can be lost when converting file formats.
To mitigate the risk of lost information:
• Note conversion steps taken
• If possible, keep the original file as well as the converted one
Module 2: Data Types, Stages & Formats
Describing Data, Documenting Reliability & Collection
TechniquesData documentation explains the:
• Who• What• Where• When• And why of data.
Module 2: Data Types, Stages & Formats
Describing Data, Documenting Reliability & Collection
TechniquesWho:
• Who collected this data?
• Who or what were the subjects under study?
Module 2: Data Types, Stages & Formats
Describing Data, Documenting Reliability & Collection
TechniquesWhat:
• What data was collected, and for what purpose?
• What is the content and structure of the data?
Module 2: Data Types, Stages & Formats
Describing Data, Documenting Reliability & Collection
TechniquesWhere:
• Where was this data collected?
• What were the experimental conditions that produced it?
Module 2: Data Types, Stages & Formats
Describing Data, Documenting Reliability & Collection
TechniquesWhen:
• When was the data collected?
• Is the data part of a series, or ongoing experiment?
Module 2: Data Types, Stages & Formats
Describing Data, Documenting Reliability & Collection
TechniquesWhy:
• Why was this experiment performed?
• How does it relate to your research question?
Module 2: Data Types, Stages & Formats
Cross Discipline ConcernsNo matter what, you need to
have:
• File naming conventions
• Version control
Module 2: Data Types, Stages & Formats
Why Do I Need to Worry about That?
Consider this:
If you unexpectedly have to leave your research project for a few months, could a colleague easily make sense of your data files?
Module 2: Data Types, Stages & Formats
Why Use File Naming Conventions?
Naming conventions make life easier!
• Help you find your data• Help others find your data• Help track which version of a file
is most current
Module 2: Data Types, Stages & Formats
What File Naming Convention Should I Use?
Has your research group established a convention?
If not, general guidelines include:
• Meaningful file names that aren’t too long• Avoid certain characters• Dates can help with sorting and version control
Module 2: Data Types, Stages & Formats