Top Banner
1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example
27

1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

Jan 04, 2016

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

1

Detection of Differential Item/Test Functioning

(DIF/DTF) Using IRT

Implication and example

Page 2: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

2

Why Study DIF/DTF Using IRT

comparing cultural, ethnic, or gender groups.

Meaningful comparisons require that measurement equivalence holds.

Classical test theory methods confound “bias” with true mean differences; IRT does not.

In IRT terminology, item/test bias is referred (relative) to DIF/DTF

Page 3: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

3

Defining DIF and DTF

DIF

DBF

DTF

BIARS and IMPACT

Page 4: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

4

Examples of DIFUniform DIF Against Focal Group

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

-3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 2.5 3Theta

Pro

b.

of

Po

siti

ve R

esp

on

se

Reference

Focal

Nonuniform (Crossing) DIF

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

-3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 2.5 3Theta

Pro

b.

of

Po

siti

ve R

esp

on

se

Reference

Focal

Reference group favored at all levels

Focal favored at low theta

Reference favored at high theta

Page 5: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

5

Example of DTF for 50-Item Test

DTF Against Reference Group

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

-3.0 -2.5 -2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0

Theta

Pro

po

rtio

n C

orr

ec

t T

rue

Sc

ore

Focal

Reference

Most focal group members expected to score about 3 points higher

Page 6: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

6

Procedures for Detecting DIF/DTF

DIF Parametric

Lord’s Chi-Square Likelihood Ratio Test Signed and Unsigned Area Methods

Nonparametric SIBTEST Mantel-Haenszel

DTF Parametric

Raju’s DFIT Method Nonparametric

SIBTEST

Page 7: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

7

Detecting DIF Using Lord's Chi-Square

i1

i2i vv

vi is a vector of the differences in the estimated item parameters for the ith item between the focal and reference groups

i is the variance-covariance matrix for

the differences in item parameter estimates

Lord’s Chi-Square is sensitive to both uniform and nonuniform DIF.

Page 8: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

8

Detecting DIF Using Lord's Chi-Square

1. Estimate item parameters and covariances for focal and reference groups separately.

2. Obtain linking constants, A and K, for putting the focal and reference parameters on a common metric.

3. Compute Lord’s chi-square to identify DIF items using the reference and transformed focal group parameters and their covariances.

4. Once the DIF items have been identified, reequate the focal and reference group metrics using only the non-DIF items.

5. Repeat steps 2 through 4 until the same items are identified on consecutive trials.

This procedure is implemented in the program ITERLINK.

Page 9: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

9

Detecting DIF Using Mantel-Haenszel(1959)

Detect Dif of Dichotomous items Statistics S*2 *2

0<=aMH< 正无穷 (0 为无 DIF , >1有利于参照组, <1 有利于目标组 )

美国 ETS 公司做了改进。

)/)(/()/)((MH 1001 kfkrkkfkrk nffnffa

Page 10: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

10

Detecting DIF Using Mantel-aenszel(1959)

)ln(35.2)ln(1.7

4MH aMHaMH

ETS 公司根据 MH 方法计算的结果,把项目分成三种, A ( negligible ) B(Moderate) C(Large)

Page 11: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

11

Detecting DIF Using LOG Model

假设:如果 focal group 及 Reference group 能力相等,则有:

DIF LOG Model:

) | ( P ) | ( Pf y y r

uey

1

1

)*( 1131211 XGbXbGbu

Page 12: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

12

Detecting DIF/DTF Using SIBTEST

Nonparametric method that can be used to examine individual items or groups of items Assumes only monotonicity Requires only item response data Works well with fairly small samples (250+)

Several variations exist Original SIBTEST: Uniform DIF Crossing SIBTEST: Nonuniform DIF PolySIB: Uniform DIF, polytomous data MultiSIB: Uniform DIF, multiple dimensions

Page 13: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

13

Using SIBTEST

SIBTEST consists of two executable files: SIBIN.EXE : interactive, creates input file SIBTEST.EXE : performs DIF/DTF analyses

Choose “E” for either, “R” for reference, or “F” for focal group

Detailed discussion of running SIBIN and SIBTEST is presented on the web

Page 14: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

14

LOG Model (SPSS)

在一个测验中,某道题男女同学 ( 以“ 1” 表示男,以“ 0” 表示女 ) 答对的情况 ( 以“ 1” 表示答对,以“ 0”表示答错 ) 及其总分如下表。问该题对于性别来说是否存在 DIF( 以男生为参照组,以女性为目标组 ) 。

Page 15: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

15

答对情况 (Y) 总分 (X) 性别 (G)

1010101010

6.06.07.23.57.02.95.00.87.13.3

0100111100

Page 16: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

16

Example (LOG Model)

Page 17: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

17

Example (LOG Model)

Page 18: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

18

Example (LOG Model)

Page 19: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

19

Explore DIF Using EZDIFEZDIF measures DIF with the Mantel- Haensael and the Logistic Regression procedures.

Output files:

the common odds ratio a

the M-Hx2

the X2 significance level

the Holland & Thayer (1988) MH D-DIF statistic

the standard error of MH D-DIF

an ETS effect size code indicating A/B/C

empirical ICC, Logistic Regression output

Page 20: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

20

EZDI RESULT

Mantel-Haenszel and Logistic Regression Analysis of DIFFERENTIAL ITEM FUNCTIONING

Programmed by Niels G. Waller

RFDATA

Reference Group: F:\JANE\rdata Focal Group: F:\JANE\fdata Number of Cases in Reference Group: 1000 Number of Cases in Focal Group: 1000

Conditioning Levels 0 5 9 13 16 19 21 24 27 30 33 36 4 8 12 15 18 20 23 26 29 32 35 40

Note: Alpha > 1.00 favors Reference Group; Alpha < 1.00 favors Focal Group D-DIF < 0.00 favors Reference Group, D-DIF > 0.00 favors Focal Group

Page 21: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

21

EZDIF RESULT CONT.

Results for Pass Number: 1 WARNING: Insufficient Data Found in Level: 0 - 4 SE ITEM Alpha X^2 P-Value MH D-DIF (MH D-DIF) CR 1*** 9.734 312.890 0.000 -5.348 0.332 CR 2*** 10.182 345.163 0.000 -5.453 0.323 CR 3*** 8.411 225.039 0.000 -5.004 0.363 CR 4*** 10.235 237.853 0.000 -5.466 0.400 CR 5*** 10.368 54.170 0.000 -5.496 0.871 A 6*** 0.697 8.351 0.004 0.848 0.288 A 7* 0.750 6.431 0.011 0.675 0.260 A 8 0.858 0.393 0.531 0.361 0.492 A 9 0.874 0.380 0.538 0.318 3.344 A 10* 0.747 6.584 0.010 0.686 0.262 A 11*** 0.715 8.161 0.004 0.788 0.271 B 12** 0.638 7.256 0.007 1.056 0.381 A 13 0.799 2.580 0.108 0.526 0.315 A 14*** 0.689 8.497 0.004 0.874 0.294 A 15** 0.737 6.668 0.010 0.718 0.272 A 16** 0.664 7.446 0.006 0.963 0.345 A 17 0.789 2.795 0.095 0.558 0.320

Page 22: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

22

EZDIF RESULT cont.

Number of Items Purged in Pass 1: 5 Item Numbers: 1 2 3 4 5

Results for Pass Number: 2 WARNING: Insufficient Data Found in Level: 36 - 40 SE ITEM Alpha X^2 P-Value MH D-DIF (MH D-DIF) CR 1*** 11.796 357.628 0.000 -5.799 0.342 CR 2*** 12.610 394.589 0.000 -5.956 0.337 CR 3*** 11.565 280.040 0.000 -5.753 0.385 CR 4*** 14.334 288.049 0.000 -6.257 0.429 CR 5*** 14.755 74.687 0.000 -6.325 0.897 A 6 1.000 0.004 0.951 0.000 0.287 A 7 1.000 0.003 0.956 0.000 0.259 A 8 1.000 0.010 0.919 0.000 0.478 A 9 1.000 0.257 0.612 0.000 2.412

Page 23: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

23

EZDIF RESULT cont.

Page 24: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

24

EZDIF RESULT cont.

Empirical Item Characteristic Curves

Item 1 ETS Code = C R R 0.00 0.10 0.31 0.52 0.78 0.82 0.85 0.95 0.96 1.00 1.00 9.00 F 0.00 0.00 0.02 0.13 0.21 0.32 0.37 0.52 0.70 0.84 1.00 9.00 L 0 5 9 13 16 19 21 24 27 30 33 36 U 4 8 12 15 18 20 23 26 29 32 35 40

Item 2 ETS Code = C R R 0.00 0.12 0.25 0.46 0.55 0.64 0.79 0.87 0.96 0.96 1.00 9.00 F 0.00 0.00 0.00 0.06 0.06 0.22 0.24 0.36 0.58 0.74 0.93 9.00 L 0 5 9 13 16 19 21 24 27 30 33 36 U 4 8 12 15 18 20 23 26 29 32 35 40

Item 3 ETS Code = C R R 0.00 0.02 0.03 0.15 0.12 0.23 0.40 0.52 0.74 0.84 0.98 9.00 F 0.00 0.00 0.00 0.01 0.00 0.01 0.06 0.12 0.15 0.37 0.76 9.00 L 0 5 9 13 16 19 21 24 27 30 33 36 U 4 8 12 15 18 20 23 26 29 32 35 40

Page 25: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

25

Page 26: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

26

Page 27: 1 Detection of Differential Item/Test Functioning (DIF/DTF) Using IRT Implication and example.

27