Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here for more information on Research and Evaluation in Education and Psychology, 3e

CiteULike is a free service for managing and discovering scholarly references - click here to get started.

Sign In to gain access to subscriptions and/or personal tools.
Applied Psychological Measurement
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Millsap, R. E.
Right arrow Articles by Everson, H. T.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Reviews

Methodology Review: Statistical Approaches for Assessing Measurement Bias

Roger E. Millsap

Baruch College, City University of New York

Howard T. Everson

The College Board

Statistical methods developed over the last decade for detecting measurement bias in psycho logical and educational tests are reviewed. Earlier methods for assessing measurement bias generally have been replaced by more sophisticated statistical techniques, such as the Mantel-Haenszel procedure, the standardization approach, logistic regression models, and item response theory approaches. The review employs a conceptual framework that distin guishes methods of detecting measurement bias based on either observed or unobserved conditional invariance models. Although progress has been made in the development of statistical methods for detecting measurement bias, issues related to the choice of matching variable, the nonuniform nature of measurement bias, the suitability of cur rent approaches for new and emerging perform ance assessment methods, and insights into the causes of measurement bias remain elusive. Clearly, psychometric solutions to the problems of measurement bias will further understanding of the more central issue of construct validity. The con tinuing development of statistical methods for detecting and understanding the causes of mea surement bias will continue to be an important scientific challenge.

Key Words: Index terms: bias detection, differential item functioning • item bias • measurement bias • test bias.

Applied Psychological Measurement, Vol. 17, No. 4, 297-334 (1993)
DOI: 10.1177/014662169301700401


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Applied Psychological MeasurementHome page
C. M. Woods
Testing for Differential Item Functioning With Measures of Partial Association
Applied Psychological Measurement, October 1, 2009; 33(7): 538 - 554.
[Abstract] [PDF]


Home page
Applied Psychological MeasurementHome page
C. M. Woods
Empirical Selection of Anchors for Tests of Differential Item Functioning
Applied Psychological Measurement, January 1, 2009; 33(1): 42 - 57.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
A. M. Fidalgo and J. M. Madeira
Generalized Mantel-Haenszel Methods for Differential Item Functioning Detection
Educational and Psychological Measurement, December 1, 2008; 68(6): 940 - 958.
[Abstract] [PDF]


Home page
Applied Psychological MeasurementHome page
C. M. Woods
Likelihood-Ratio DIF Testing: Effects of Nonnormality
Applied Psychological Measurement, October 1, 2008; 32(7): 511 - 526.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
W. H. Finch and B. F. French
Anomalous Type I Error Rates for Identifying One Type of Differential Item Functioning in the Presence of the Other
Educational and Psychological Measurement, October 1, 2008; 68(5): 742 - 759.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
C. A. Scherbaum and H. W. Goldstein
Examining the Relationship Between Race-Based Differential Item Functioning and Item Difficulty
Educational and Psychological Measurement, August 1, 2008; 68(4): 537 - 553.
[Abstract] [PDF]


Home page
Applied Psychological MeasurementHome page
N. G. Waller
Commingled Samples: A Neglected Source of Bias in Reliability Analysis
Applied Psychological Measurement, May 1, 2008; 32(3): 211 - 223.
[Abstract] [PDF]


Home page
Educational and Psychological MeasurementHome page
B. F. French and S. J. Maller
Iterative Purification and Effect Size Use With Logistic Regression for Differential Item Functioning Detection
Educational and Psychological Measurement, June 1, 2007; 67(3): 373 - 393.
[Abstract] [PDF]


Home page
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICSHome page
P. O. Monahan, C. A. McHorney, T. E. Stump, and A. J. Perkins
Odds Ratio, Delta, ETS Classification, and Standardization Measures of DIF Magnitude for Binary Logistic Regression
Journal of Educational and Behavioral Statistics, March 1, 2007; 32(1): 92 - 109.
[Abstract] [Full Text] [PDF]


Home page
AssessmentHome page
R. Sheppard, K. Han, S. M. Colarelli, G. Dai, and D. W. King
Differential Item Functioning by Sex and Race in the Hogan Personality Inventory
Assessment, December 1, 2006; 13(4): 442 - 453.
[Abstract] [PDF]


Home page
Clin TrialsHome page
E. A Hahn, R. K Bode, H. Du, and D. Cella
Evaluating linguistic equivalence of patient-reported outcomes in a cancer clinical trial
Clinical Trials, June 1, 2006; 3(3): 280 - 290.
[Abstract] [PDF]


Home page
Eval Health ProfHome page
E. A. Hahn, B. Holzner, G. Kemmler, B. Sperner-Unterweger, S. A. Hudgens, and D. Cella
Cross-Cultural Evaluation of Health Status Using Item Response Theory: FACT-B Comparisons Between Austrian and U.S. Patients With Breast Cancer
Eval Health Prof, June 1, 2005; 28(2): 233 - 259.
[Abstract] [PDF]


Home page
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICSHome page
W. Van den Noortgate and P. De Boeck
Assessing and Explaining Differential Item Functioning Using Logistic Mixed Models
Journal of Educational and Behavioral Statistics, January 1, 2005; 30(4): 443 - 464.
[Abstract] [PDF]


Home page
AssessmentHome page
S. P. Reise and J. M. Henson
Computerization and Adaptive Administration of the NEO PI-R
Assessment, December 1, 2000; 7(4): 347 - 364.
[Abstract] [PDF]


Home page
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICSHome page
B. A. Hanson
Uniform DIF and DIF Defined by Differences in Item Response Functions
Journal of Educational and Behavioral Statistics, January 1, 1998; 23(3): 244 - 253.
[Abstract] [PDF]


Home page
Applied Psychological MeasurementHome page
P. Narayanon and H. Swaminathan
Identification of Items that Show Nonuniform DIF
Applied Psychological Measurement, September 1, 1996; 20(3): 257 - 274.
[Abstract] [PDF]


Home page
Applied Psychological MeasurementHome page
T. A. Ackerman and J. A. Evans
The Influence of Conditioning Scores In Performing DIF Analyses
Applied Psychological Measurement, December 1, 1994; 18(4): 329 - 342.
[Abstract] [PDF]


Home page
Applied Psychological MeasurementHome page
S.-H. Kim, A. S. Cohen, and H.-O. Kim
An Investigation of Lord's Procedure for the Detection of Differential Item Functioning
Applied Psychological Measurement, September 1, 1994; 18(3): 217 - 228.
[Abstract] [PDF]