|
Sign In to gain access to subscriptions and/or personal tools.
|
Applied Psychological Measurement, Vol. 32, No. 4,
289-310 (2008)
DOI: 10.1177/0146621607300047
© 2008 SAGE Publications
Consistent Estimation of Rasch Item Parameters and Their Standard Errors Under Complex Sample Designs
Jon Cohen
American Institutes for Research, jcohen{at}air.org
Tsze Chan
American Institutes for Research
Tao Jiang
American Institutes for Research
Mary Seburn
American Institutes for Research
U.S. state educational testing programs administer tests to track student progress and hold schools accountable for educational outcomes. Methods from item response theory, especially Rasch models, are usually used to equate different forms of a test. The most popular method for estimating Rasch models yields inconsistent estimates and relies on ad hoc adjustments to obtain good approximations. Furthermore, psychometricians have paid little attention to the estimation of effective standard errors for Rasch models, especially under complex sample designs. This article presents a computationally efficient, statistically consistent estimator for Rasch models, based on a nonparametric marginal maximum likelihood approach, along with complete, design-consistent estimators of the standard error, based on the full information matrix and including covariance terms among items, covariances between items, and parameters of the distribution of the latent trait. Simulations support the consistency of the estimators in both simple random samples and more realistic multistage samples.
Key Words: Rasch model standard error nonparametric marginal maximum likelihood equating item response theory
References
- Adams, R.J., Wilson M.R., & Wang W.C. (1997). The multidimensional random coefficients multinomial logit model. Applied Psychological Measurement, 21, 1-23.[Abstract/Free Full Text]
- Allen, N.L., Carlson, J.E., & Zelenak, C.A. (1999). The NAEP 1996 Technical Report (NCES 1999-452). Washington, DC: National Center for Education Statistics.
- Allen, N.L., Donoghue, J.R., Schoeps, T.L., U.S. Department of Education, Office of Educational Research and Improvement, & National Center for Education Statistics. (2001). The NAEP 1998 Technical Report (NCES 2001-509). Washington, DC: National Center for Education Statistics.
- Andersen, E.B. (1973). Conditional inference and models for measuring. Cophenhagen, Denmark: Mentalhygiejnisk Forlag.
- Association of Test Publishers. (2001). Test publisher 8.2 [Computer software]. Washington, DC: Author.
- Binder, D.A. (1983). On the variances of asymptotically normal estimators from complex surveys. International Statistical Review, 51, 279-292.[ISI]
- Bock, R.D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443-459.[CrossRef][ISI]
- Cohen, J., & the American Institutes for Research. (2002). AM statistical software (Beta version 0.06.00) [Computer software]. Washington DC: American Institutes for Research. Available from http://am.air.org
- Cressie, N., & Holland, P.W. (1983). Characterizing the manifest probabilities of latent trait models. Psychometrika, 48, 129-141.[CrossRef][ISI]
- De Leeuw, J., & Verhelst, N. (1986). Maximum likelihood estimation in generalized Rasch models. Journal of Educational Statistics, 11, 183-196.[CrossRef][ISI]
- Dempster, A.P., Laird, N.M., & Rubin, D.B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society, 39(B), 1-38.
- Fox, J.P., & Glas, C.A.W. (2003). Modeling measurement error in structural multilevel models. In G. A. Marcoulides & I. Moustaki (Eds.), Latent variable and latent structure models (pp. 245-269). London: Lawrence Erlbaum.
- Glas, C.A.W. (2004). Structural IRT models. In K. Kempf-Leonard (Ed.), Encyclopedia of social measurement (pp. 697-704). Oxford, UK: Elsevier.
- Godambe, V.P., & Thompson, M.E. (1986). Parameters of super populations and survey population: Their relationship and estimation. International Statistical Review, 54, 37-59.
- Hambleton, R.K., & Swaminathan, H. (1985). Item response theory: Principles and applications. Boston: Kluwer-Nijhoff Publishing.
- Kamata, A. (1998). Some generalizations of the Rasch model: An application of the hierarchical generalized linear model. Unpublished doctoral dissertation, Michigan State University, East Lansing.
- Kamata, A. (2001). Item analysis by the hierarchical generalized linear model. Journal of Educational Measurement, 38(1), 79-93.[CrossRef][ISI]
- Linacre, J.M. (2005). Rasch-model computer programs: Program manual [Computer manual]. Available from http://www.winsteps.com/winman/index.htm
- Masters, G.N. (1982). A Rasch model for partial credit scoring. Pscychometrika, 47, 149-174.[CrossRef]
- Mislevy, R.J. (1984). Estimating latent distributions. Psychometrika, 49, 359-381.[CrossRef][ISI]
- Rabe-Hesketh, S., Skrondal, A., & Pickles, A. (2004). Generalized multilevel structural equation modeling. Psychometrika, 69, 167-190.[CrossRef][ISI]
- Rasch, G. (1961). On general laws and the meaning of measurement in psychology. Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Psychology, 4, 321-333.
- Sarndal, C.E., Swensson, B., & Wretman, J. (1992). Model assisted survey sampling. New York: Springer-Verlag.
- Skrondal, A., & Rabe-Hesketh, S. (2003). Multilevel logistic regression for polytomous data and rankings. Psychometrika, 68, 267-287.[CrossRef][ISI]
- Verhelst, N.D., & Glas, C.A.W. (1995). The one parameter logistic model. In G. H. Fischer & I. W. Molenaar (Eds.), Rasch models: Foundations, recent developments, and applications. New York: Springer-Verlag.
- Verhelst, N.D., Glas, C.A.W., & Van der Sluis, A. (1984). Estimation problems in the Rasch model: The basic symmetric functions. Computational Statistics Quarterly, 1(3), 245-262.
- Verhelst, N.D., Glas, C.A.W., & Verstralen, H.H.F.M. (1995). OPLM-One-parameter logistic model [Computer software]. Arnhem, Netherlands: CITO.
- Wright, B.D., & Douglas, G.A. (1977). Best procedures for sample-free item analysis. Applied Psychological Measurement, 1, 281-294.[Abstract]

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
|