EDU 8603 Day 6
Jan 02, 2016
EDU 8603 Day 6
What do the following numbers mean?85 92 45 90 95 68 97 75 88 85
Educational MeasurementMeasurement: assignment of numbers to
differentiate values of a variablePurpose of measurement for research
Provide a standard format for recording observations, performances, or other responses of subjects and summarizing results
GOOD RESEARCH MUST HAVE SOUND MEASUREMENT!!
Descriptive StatisticsStatistics: procedures that summarize and
analyze quantitative dataDescriptive statistics: statistical procedures
that summarize a set of numbers in terms of central tendency or variation
Important for understanding what the data tells the researcher
Descriptive StatisticsStatistics: procedures that summarize and
analyze quantitative dataDescriptive statistics
Statistical procedures that summarize a set of numbers in terms of central tendency or variation
Foundational for inferential statisticsImportant for understanding what the data
tells the researcher
Measures of central tendencyMean (µ)MedianMode
Thought QuestionConsider the following scores on a test
Marco 90 Adriane 85 Adriane 85 Linda 75 Christy 99Chantelle 88 Jay 45 Remi 68 Marcus
97Chi Bo 92 Donnie 85
Which measure of central tendency would Adriane use when telling her parents about her performance?
Thought QuestionIf Jay scored an 85 instead of a 45, what
changes?
Descriptive StatisticsFrequency distributions (see Figure 6.2)
Normal - scores equally distributed around middle
Positively skewed - large number of low scores and a small number of high scores; mean being pulled to the positive
Negatively skewed - large number of high scores and a small number of low scores; mean being pulled to the negative
Normal Distribution
An Extreme ExampleConsider the salaries of 10 peopleGroup A – All are teachers.
Salaries: $45,000 $45,000 $45,000$50,000 $50,000 $50,000$50,000 $55,000 $55,000$55,000
An Extreme ExampleConsider the salaries of 10 peopleGroup B – All are teachers; 1 won the lottery.
Salaries: $45,000 $45,000 $45,000$50,000 $50,000 $50,000$50,000 $55,000 $55,000$6,300,000
An Extreme ExampleWhat happens to the mean and median in
these 2 examples? Does it change? What happens to the normal distribution?
Positive Skew
Negative Skew
Descriptive StatisticsVariability
How different are the scores?Types
Range: the difference between the highest and lowest scores
Standard deviation The average distance of the scores from the mean The relationship to the normal distribution
±1 SD = 68% of all scores in a distribution ±2 SD = 95% of all scores in a distribution
Variability
Standard Deviation
VariabilityWhy does variability matter?
Descriptive StatisticsRelationship
How two sets of scores relate to one another
Correlation (positive)Low .10 - .39Moderate .40 - .69High > .70
Example of Correlation
Validity and Reliability
What’s all the fuss about?
Validity/Reliability and TrustworthinessWhy do we need validity and reliability
in quantitative studies and “trustworthiness” in qualitative studies?
We can’t trust the results if we can’t trust the
methods!
Thought QuestionOn the ACT and SAT assessments, there is a
definitive script that test administrators are required to follow exactly. What measurement issue are the test makers addressing?
Reliability of MeasurementReliability - The extent to which measures are
free from error
Error is measured by consistency
Reliability of MeasurementSources of error
Test construction and administrationAmbiguous questions, confusing
directions, changes in scoring, interrupted testing, etc.
Subject’s characteristicsTest anxiety, lack of motivation, fatigue,
guessing, etc.
Reliability of MeasurementReliability
Measurement0.00 indicates no reliability or
consistency1.00 indicates total reliability or
consistency< .60 = weak reliability> .80 = sufficient reliability
Reliability of MeasurementTypes of reliability evidence
Stability (i.e. test-retest) Testing the same subject using the same test on
two occasions Limitation - carryover effects from the first to
second administration of the testEquivalence (i.e. parallel form)
Testing the same subject with two parallel (i.e. equal) forms of the same test taken at the same time
Limitation - difficulty in creating parallel forms
Reliability of MeasurementEquivalence and stability
Testing the same subject with two forms of the same test taken at different times
Limitation - difficulty in creating parallel forms
Reliability of MeasurementInternal consistency
Testing the same subject with one test and “artificially” splitting the test into two halves
Limitations - must have a minimum of ten (10) questions
Often see “Chronbach’s alpha” for reliability coefficient (ex – Learning styles)
Reliability of MeasurementAgreement/ Inter-rater reliability
Observational measuresMultiple observers coding similarly
Reliability of MeasurementEnhancing reliability
Standardized administration procedures (e.g. directions, conditions, etc.)
Appropriate reading levelReasonable length of the testing periodCounterbalancing the order of testing if
several tests are being given
Validity of MeasurementValidity: the extent to which inferences are
appropriate, meaningful, and usefulCurrent example – content tests and teacher
licensure
Validity of MeasurementFor research results to have any
value, validity of the measurement of a variable must existUse of established and “new” instruments
and the implications for establishing validity
Importance of establishing validity prior to data collection (e.g. pilot tests)
ValidityContentPredictive (criterion-related)ConcurrentConstruct
Thought QuestionCriticisms of standardized tests like the SAT
claim that they discriminate against particular groups of students (especially minorities) and do not represent a broad enough domain of knowledge to adequately assess a student’s academic potential. What issue of validity is operating in these arguments?
Thought QuestionOther arguments against the SAT state that the
tests do not adequately estimate an individual’s ability to succeed in college. What issue of validity is operating here?
Reader’s Digest version…Reliability
The extent to which scores are free from errorError is measured by consistency
ValidityThe extent to which inferences are
appropriate, meaningful, and useful“Does the instrument measure what it is
supposed to measure??”
Reliability & Validity of MeasurementWhat is the relationship of reliability to
validity? If a watch consistently gives the time at 1:10
when actually it is 1:00, it is ____ but not ____.______ is necessary but not sufficient condition
for _______. To be _____ , an instrument must be ______, but
a ____ instrument is not necessarily _____.
Midterm3 parts
Multiple Choice (50%) – terms and application
Short Answer (25%) – applicationEssay (25%) – evaluate a research article.
This part is take home.
Take Home Portion of ExamSchlosser Article Based on topics we have discussed in class and you
have read about, critique the article based on the following:
Introduction and research problem, including the researcher’s background and involvement
Review of literature/ theoretical frameworkMethods of data collection (including participants)
and data analysisResults and conclusions including issues of
trustworthiness. Be sure to address whether we should trust the claims that the authors have made and why we should or should not trust the claims.