|
Sign In to gain access to subscriptions and/or personal tools.
|
Methodology Review: Statistical Approaches for Assessing Measurement Bias
Roger E. Millsap
Baruch College, City University of New York
Howard T. Everson
The College Board
Statistical methods developed over the last decade for detecting measurement bias in psycho logical and educational tests are reviewed. Earlier methods for assessing measurement bias generally have been replaced by more sophisticated statistical techniques, such as the Mantel-Haenszel procedure, the standardization approach, logistic regression models, and item response theory approaches. The review employs a conceptual framework that distin guishes methods of detecting measurement bias based on either observed or unobserved conditional invariance models. Although progress has been made in the development of statistical methods for detecting measurement bias, issues related to the choice of matching variable, the nonuniform nature of measurement bias, the suitability of cur rent approaches for new and emerging perform ance assessment methods, and insights into the causes of measurement bias remain elusive. Clearly, psychometric solutions to the problems of measurement bias will further understanding of the more central issue of construct validity. The con tinuing development of statistical methods for detecting and understanding the causes of mea surement bias will continue to be an important scientific challenge.
Key Words: Index terms: bias detection, differential item functioning item bias measurement bias test bias.
Applied Psychological Measurement, Vol. 17, No. 4,
297-334 (1993)
DOI: 10.1177/014662169301700401

CiteULike Complore Connotea Del.icio.us Digg Reddit Technorati Twitter What's this?
This article has been cited by other articles:

|
 |

|
 |
 
C. M. Woods
Testing for Differential Item Functioning With Measures of Partial Association
Applied Psychological Measurement,
October 1, 2009;
33(7):
538 - 554.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. M. Woods
Empirical Selection of Anchors for Tests of Differential Item Functioning
Applied Psychological Measurement,
January 1, 2009;
33(1):
42 - 57.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
A. M. Fidalgo and J. M. Madeira
Generalized Mantel-Haenszel Methods for Differential Item Functioning Detection
Educational and Psychological Measurement,
December 1, 2008;
68(6):
940 - 958.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. M. Woods
Likelihood-Ratio DIF Testing: Effects of Nonnormality
Applied Psychological Measurement,
October 1, 2008;
32(7):
511 - 526.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
W. H. Finch and B. F. French
Anomalous Type I Error Rates for Identifying One Type of Differential Item Functioning in the Presence of the Other
Educational and Psychological Measurement,
October 1, 2008;
68(5):
742 - 759.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. A. Scherbaum and H. W. Goldstein
Examining the Relationship Between Race-Based Differential Item Functioning and Item Difficulty
Educational and Psychological Measurement,
August 1, 2008;
68(4):
537 - 553.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
N. G. Waller
Commingled Samples: A Neglected Source of Bias in Reliability Analysis
Applied Psychological Measurement,
May 1, 2008;
32(3):
211 - 223.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
B. F. French and S. J. Maller
Iterative Purification and Effect Size Use With Logistic Regression for Differential Item Functioning Detection
Educational and Psychological Measurement,
June 1, 2007;
67(3):
373 - 393.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
P. O. Monahan, C. A. McHorney, T. E. Stump, and A. J. Perkins
Odds Ratio, Delta, ETS Classification, and Standardization Measures of DIF Magnitude for Binary Logistic Regression
Journal of Educational and Behavioral Statistics,
March 1, 2007;
32(1):
92 - 109.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Sheppard, K. Han, S. M. Colarelli, G. Dai, and D. W. King
Differential Item Functioning by Sex and Race in the Hogan Personality Inventory
Assessment,
December 1, 2006;
13(4):
442 - 453.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
E. A Hahn, R. K Bode, H. Du, and D. Cella
Evaluating linguistic equivalence of patient-reported outcomes in a cancer clinical trial
Clinical Trials,
June 1, 2006;
3(3):
280 - 290.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
E. A. Hahn, B. Holzner, G. Kemmler, B. Sperner-Unterweger, S. A. Hudgens, and D. Cella
Cross-Cultural Evaluation of Health Status Using Item Response Theory: FACT-B Comparisons Between Austrian and U.S. Patients With Breast Cancer
Eval Health Prof,
June 1, 2005;
28(2):
233 - 259.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
W. Van den Noortgate and P. De Boeck
Assessing and Explaining Differential Item Functioning Using Logistic Mixed Models
Journal of Educational and Behavioral Statistics,
January 1, 2005;
30(4):
443 - 464.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
S. P. Reise and J. M. Henson
Computerization and Adaptive Administration of the NEO PI-R
Assessment,
December 1, 2000;
7(4):
347 - 364.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
B. A. Hanson
Uniform DIF and DIF Defined by Differences in Item Response Functions
Journal of Educational and Behavioral Statistics,
January 1, 1998;
23(3):
244 - 253.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Narayanon and H. Swaminathan
Identification of Items that Show Nonuniform DIF
Applied Psychological Measurement,
September 1, 1996;
20(3):
257 - 274.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
T. A. Ackerman and J. A. Evans
The Influence of Conditioning Scores In Performing DIF Analyses
Applied Psychological Measurement,
December 1, 1994;
18(4):
329 - 342.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
S.-H. Kim, A. S. Cohen, and H.-O. Kim
An Investigation of Lord's Procedure for the Detection of Differential Item Functioning
Applied Psychological Measurement,
September 1, 1994;
18(3):
217 - 228.
[Abstract]
[PDF]
|
 |
|
|
|