Applied Psychological Measurement

 

Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here for more information

Sign In to gain access to subscriptions and/or personal tools.
This Article
Right arrow Free Full Text (Free PDF) Free
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wilson, M.
Right arrow Articles by Wang, W.-c.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Applied Psychological Measurement, Vol. 19, No. 1, 51-71 (1995)
DOI: 10.1177/014662169501900107
© 1995 SAGE Publications

Complex Composites: Issues That Arise in Combining Different Modes of Assessment

Mark Wilson

University of California, Berkeley

Wen-chung Wang

National Taiwan University

Data from the California Learning Assessment System are used to examine certain characteristics of tests designed as the composites of items of different modes. The characteristics include rater severity, test information, and definition of the latent variable. Three different assessment modes-multiple-choice, open-ended, and investigation items (the latter two are referred to as performance-based modes)-were combined in a test across three different test forms. Rater severity was investigated by incorporating a rater parameter for each rater in an item response model that then was used to analyze the data. Some rater severities were found to be quite extreme, and the impact of this variation in rater severities on both total scores and trait level estimates was examined. Within-rater variation in rater severity also was examined and was found to have significant variation. The information contribution of the three modes was compared. Performance-based items provided more information than multiple-choice items and also provided greatest precision for higher levels of the latent variable. A projection-like method was applied to investigate the effects of assessment mode on the definition of the latent variable. The multiple-choice items added information to the performance-based variable. The results of the analysis also showed that the projection-like method did not practically differ from the results when the latent trait was defined jointly by both the multiple-choice and the performance-based items. Index terms: equating, linking, multiple assessment modes, polytomous item response models, rater effects.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICSHome page
R. J. Patz, B. W. Junker, M. S. Johnson, and L. T. Mariano
The Hierarchical Rater Model for Rated Test Items and its Application to Large-Scale Educational Assessment Data
Journal of Educational and Behavioral Statistics, January 1, 2002; 27(4): 341 - 384.
[Abstract] [PDF]


Home page
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICSHome page
M. Wilson and M. Hoskens
The Rater Bundle Model
Journal of Educational and Behavioral Statistics, January 1, 2001; 26(3): 283 - 306.
[Abstract] [PDF]


Home page
Review of Public Personnel AdministrationHome page
D. Southworth
Using Statistical-Based Rating Sheets to Measure Oral Test Inter-Rater Reliability
Review of Public Personnel Administration, July 1, 2000; 20(3): 43 - 57.
[PDF]


Home page
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICSHome page
R. J. Patz and B. W. Junker
Applications and Extensions of MCMC in IRT: Multiple Item Types, Missing Data, and Rated Responses
Journal of Educational and Behavioral Statistics, January 1, 1999; 24(4): 342 - 366.
[Abstract] [PDF]