|
Sign In to gain access to subscriptions and/or personal tools.
|
Applied Psychological Measurement, Vol. 32, No. 1,
62-80 (2008)
DOI: 10.1177/0146621607311579
Invariance of Equating Functions Across Different Subgroups of Examinees Taking a Science Achievement Test
Qing Yi
19500 Bulverde Road, San Antonio, TX 78259, Qing_Yi{at}harcourt.com
Harcourt Assessment
ACT
Deborah J. Harris
ACT
Xiaohong Gao
ACT
This study investigated the group invariance of equating results using a science achievement test. Examinees were divided into different subgroups based on the average composite score for test centers, whether they had taken a physics course, and self-reported science grade point average. The reason for dividing examinees into subgroups using such variables is that those variables are more related to performance on a science achievement test than, say, gender. Results indicated that the conversions obtained from different subgroups were similar to the conversions obtained by using the total group, except when the groups were divided based on whether a student had taken a physics course. Where there were differences, the differences were generally 1 equated raw score point.
Key Words: Index terms: group invariance score equating equating conversions IRT true score observed-score equating methods
References
- Angoff, W.H. (1971). Scales, norms and equivalent scores. In R. L. Thorndike (Ed.), Educational measurement (2nd ed., pp. 508-600). Washington, DC: American Council on Education.
- Angoff, W.H., & Cowell, W.R. (1986). An examination of the assumption that the equating of parallel forms is population-independent. Journal of Educational Measurement, 23, 327-345.[CrossRef][ISI]
- Cook, L.L., & Petersen, N.S. (1987). Problems related to the use of conventional and item response theory equating methods in less than optimal circumstances. Applied Psychological Measurement, 11, 225-244.[Abstract]
- Dorans, N.J., & Feigenbaum, M.D. (1994). Equating issues engendered by changes to the SAT and PSAT/NMSQT. In I. M. Lawrence, N. J. Dorans, M. D. Feigenbaum, N. J. Feryok, A. P. Schmitt, & N. K. Wright (Eds.), Technical issues related to the introduction of the new SAT and PSAT/NMSQT (ETS Research Memorandum No. RM-94-10). Princeton, NJ: Educational Testing Service.
- Dorans, N.J., & Holland, P.W. (2000). Population invariance and equatability of tests: Basic theory and the linear case. Journal of Educational Measurement, 37, 281-306.[CrossRef]
- Dorans, N.J., Holland, P.W., Thayer, D.T., & Tateneni, K. (2002, April). Invariance of score linking across gender groups for three Advanced Placement Program exams. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans, LA.
- Harris, D.J., & Kolen, M.J. (1986). Effect of examinee group on equating relationships. Applied Psychological Measurement, 10, 35-43.[Abstract]
- Kolen, M.J., & Brennan, R.L. (2004). Test equating, scaling, and linking: Methods and practices (2nd ed.). New York: Springer-Verlag.
- Liu, M. & Holland, P.W. (2008). Exploring Population Sensitivity of Linking Functions Across Three Law School Admission Test Administrations. Applied Psychological Measurement, 32, 27-44.[Abstract/Free Full Text]
- Lord, F.M. (1980). Application of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum.
- Lord, F.M., & Wingersky, M.S. (1984). Comparison of IRT true-score and equipercentile observed-score ``equatings.'' Applied Psychological Measurement, 8, 452-461.
- von Davier, A.A., & Wilson, C. (2008). Investigating the Population Sensitivity Assumption of Item Response Theory True-Score Equating Across Two Subgroups of Examinees and Two Test Formats. Applied Psychological Measurement, 32, 11-26.[Abstract/Free Full Text]
- Yang, W.-L., Dorans, N.J., & Tateneni, K. (2002, April). Sample selection effect on AP multiple-choice score to composite score scaling. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans, LA.
- Yang, W.-L., & Gao, R. (2008). Invariance of Score Linkings Across Gender Groups for Forms of a Testlet-Based College-Level Examination Program Examination. Applied Psychological Measurement, 32, 45-61.[Abstract/Free Full Text]
- Yi, Q., Harris, H., & Gao, X. (2004, April). Invariance of IRT equating across different sub-populations taking a science achievement test. Paper presented at the annual meeting of the National Council on Measurement in Education, San Diego, CA.
- Zimowski, M.F., Muraki, E., Mislevy, R.J., & Bock, R.D. (1996). BILOG-MG [Computer software]. Lincolnwood, IL: Scientific Software International.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
N. J. Dorans, Jinghua Liu, and S. Hammond
Anchor Test Type and Population Invariance: An Exploration Across Subpopulations and Test Administrations
Applied Psychological Measurement,
January 1, 2008;
32(1):
81 - 97.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
N. S. Petersen
A Discussion of Population Invariance of Equating
Applied Psychological Measurement,
January 1, 2008;
32(1):
98 - 101.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
R. L. Brennan
A Discussion of Population Invariance
Applied Psychological Measurement,
January 1, 2008;
32(1):
102 - 114.
[PDF]
|
 |
|
|