|
Sign In to gain access to subscriptions and/or personal tools.
|
Recovery of Two- and Three-Parameter Logistic Item Characteristic Curves: A Monte Carlo Study
Charles L. Hulin
University of Illinois
Robin I. Lissak
University of Illinois
Fritz Drasgow
University of Illinois
This monte carlo study assessed the accuracy of simultaneous estimation of item and person parameters in item response theory. Item responses were simulated using the two- and three-parameter logistic models. Samples of 200, 500, 1,000, and 2,000 simulated examinees and tests of 15, 30, and 60 items were generated. Item and person parameters were then estimated using the appropriate model. The root mean squared error between recovered and actual item characteristic curves served as the principal measure of estimation accuracy for items. The accuracy of estimates of ability was assessed by both correlation and root mean squared error. The results indicate that minimum sample sizes and tests lengths depend upon the response model and the purposes of an investigation. With item responses generated by the two-parameter model, tests of 30 items and samples of 500 appear adequate for some purposes. Estimates of ability and item parameters were less accurate in small sample sizes when item responses were generated by the three-parameter logistic model. Here samples of 1,000 examinees with tests of 60 items seem to be required for highly accurate estimation. Tradeoffs between sample size and test length are apparent, however.
Applied Psychological Measurement, Vol. 6, No. 3,
249-260 (1982)
DOI: 10.1177/014662168200600301

CiteULike Complore Connotea Del.icio.us Digg Reddit Technorati Twitter What's this?
This article has been cited by other articles:

|
 |

|
 |
 
P.-W. Lei, Q. Wu, J. C. DiPerna, and P. L. Morgan
Developing Short Forms of the EARLI Numeracy Measures: Comparison of Item Selection Methods
Educational and Psychological Measurement,
October 1, 2009;
69(5):
825 - 842.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. M. Woods
Ramsay-Curve Item Response Theory for the Three-Parameter Logistic Item Response Model
Applied Psychological Measurement,
September 1, 2008;
32(6):
447 - 465.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. M. Woods
Consequences of Ignoring Guessing When Estimating the Latent Density in Item Response Theory
Applied Psychological Measurement,
July 1, 2008;
32(5):
371 - 384.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. van Barneveld
The Effect of Examinee Motivation on Test Construction Within an IRT Framework
Applied Psychological Measurement,
January 1, 2007;
31(1):
31 - 46.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
J. S. Roberts, J. R. Donoghue, and J. E. Laughlin
Characteristics of MML/EAP Parameter Estimates in the Generalized Graded Unfolding Model
Applied Psychological Measurement,
June 1, 2002;
26(2):
192 - 207.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
S.-H. Kim
An Evaluation of a Markov Chain Monte Carlo Method for the Rasch Model
Applied Psychological Measurement,
June 1, 2001;
25(2):
163 - 176.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
F. B. Baker
An Investigation of the Item Parameter Recovery Characteristics of a Gibbs Sampling Procedure
Applied Psychological Measurement,
June 1, 1998;
22(2):
153 - 169.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
J. R. Donoghue and S. P. Isham
A Comparison of Procedures to Detect Item Parameter Drift
Applied Psychological Measurement,
March 1, 1998;
22(1):
33 - 51.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
F. B. Baker
Empirical Sampling Distributions of Equating Coefficients for Graded and Nominal Response Instruments
Applied Psychological Measurement,
June 1, 1997;
21(2):
157 - 172.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
J. S. Roberts and J. E. Laughlin
A Unidimensional Item Response Model for Unfolding Responses From a Graded Disagree-Agree Response Scale
Applied Psychological Measurement,
September 1, 1996;
20(3):
231 - 255.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Harwell, C. A. Stone, T.-C. Hsu, and L. Kirisci
Monte Carlo Studies in Item Response Theory
Applied Psychological Measurement,
June 1, 1996;
20(2):
101 - 125.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
F. B. Baker
An Investigation of the Sampling Distributions of Equating Coefficients
Applied Psychological Measurement,
March 1, 1996;
20(1):
45 - 57.
[Abstract]
|
 |
|

|
 |

|
 |
 
A. Maydeu-Olivares, F. Drasgow, and A. D. Mead
Distinguishing Among Paranletric item Response Models for Polychotomous Ordered Data
Applied Psychological Measurement,
September 1, 1994;
18(3):
245 - 256.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
M. D. Miller and T.C. Oshima
Effect of Sample Size, Number of Biased Items, and Magnitude of Bias on a Two-Stage Item Bias Estimation Method
Applied Psychological Measurement,
December 1, 1992;
16(4):
381 - 388.
[PDF]
|
 |
|

|
 |

|
 |
 
C. A. Stone
Recovery of Marginal Maximum Likelihood Estimates in the Two-Parameter Logistic Response Model: An Evaluation of MULTILOG
Applied Psychological Measurement,
March 1, 1992;
16(1):
1 - 16.
[Abstract]
|
 |
|

|
 |

|
 |
 
T. Hudson
Relationships among IRT item discrimination and item fit indices in criterion-referenced language testing
Language Testing,
December 1, 1991;
8(2):
160 - 181.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
G. Skaggs and J. Stevenson
A Comparison of Pseudo-Bayesian and Joint Maximum Likelihood Procedures for Estimating Item Parameters in the Three-Parameter IRT Model
Applied Psychological Measurement,
December 1, 1989;
13(4):
391 - 402.
[Abstract]
|
 |
|

|
 |

|
 |
 
F. Drasgow
An Evaluation of Marginal Maximum Likelihood Estimation for the Two-Parameter Logistic Model
Applied Psychological Measurement,
March 1, 1989;
13(1):
77 - 90.
[Abstract]
|
 |
|

|
 |

|
 |
 
F. B. Baker
The Item Log-Likelihood Surface for Two- and Three-Parameter Item Characteristic Curve Models
Applied Psychological Measurement,
December 1, 1988;
12(4):
387 - 395.
[Abstract]
|
 |
|

|
 |

|
 |
 
G. L. Candell and F. Drasgow
An Iterative Procedure for Linking Metrics and Assessing Item Bias in Item Response Theory
Applied Psychological Measurement,
September 1, 1988;
12(3):
253 - 260.
[Abstract]
|
 |
|

|
 |

|
 |
 
G. Skaggs and R. W. Lissitz
Effect of Examinee Ability on Test Equating Invariance
Applied Psychological Measurement,
March 1, 1988;
12(1):
69 - 82.
[Abstract]
|
 |
|

|
 |

|
 |
 
N. L. Strandmark and R. L. Linn
A Generalized Logistic Item Response Model Parameterizing Test Score Inappropriateness
Applied Psychological Measurement,
December 1, 1987;
11(4):
355 - 370.
[Abstract]
|
 |
|

|
 |

|
 |
 
F. B. Baker
Methodology Review: Item Parameter Estimation Under the One-, Two-, and Three-Parameter Logistic Models
Applied Psychological Measurement,
June 1, 1987;
11(2):
111 - 141.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
G. L. Candell and C. L. Hulin
Cross-Language and Cross-Cultural Comparisons in Scale Translations: Independent Sources of Information about Item Nonequivalence
Journal of Cross-Cultural Psychology,
December 1, 1986;
17(4):
417 - 440.
[Abstract]
|
 |
|

|
 |

|
 |
 
D. N. M. de Gruijter
The Use of Item Statistics in the Calibration of an Item Bank
Applied Psychological Measurement,
September 1, 1986;
10(3):
231 - 237.
[Abstract]
|
 |
|

|
 |

|
 |
 
G. Skaggs and R. W. Lissitz
An Exploration of the Robustness of Four Test Equating Models
Applied Psychological Measurement,
September 1, 1986;
10(3):
303 - 317.
[Abstract]
|
 |
|

|
 |

|
 |
 
F. J. R. van de Vijver
The Robustness of Rasch Estimates
Applied Psychological Measurement,
March 1, 1986;
10(1):
45 - 57.
[Abstract]
|
 |
|

|
 |

|
 |
 
S. H. Goldman and N. S. Raju
Recovery of One- and Two-Parameter Logistic Item Parameters: An Empirical Study
Educational and Psychological Measurement,
March 1, 1986;
46(1):
11 - 21.
[Abstract]
|
 |
|

|
 |

|
 |
 
D. A. Harrison
Robustness of Irt Parameter Estimation to Violations of The Unidimensionality Assumption
Journal of Educational and Behavioral Statistics,
January 1, 1986;
11(2):
91 - 115.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. H. Hui and H. C. Triandis
Measurement in Cross-Cultural Psychology: A Review and Comparison of Strategies
Journal of Cross-Cultural Psychology,
June 1, 1985;
16(2):
131 - 152.
[Abstract]
|
 |
|

|
 |

|
 |
 
C. H. Hui, F. Drasgow, and B.-H. Chang
Analysis of the Modernity Scale: An Item Response Theory Approach
Journal of Cross-Cultural Psychology,
September 1, 1983;
14(3):
259 - 278.
[Abstract]
|
 |
|

|
 |

|
 |
 
S.E. Phillips
Comparison of Equipercentile and Item Response Theory Equating When the Scaling Test Method Is Applied to a Multilevel Achievement Battery
Applied Psychological Measurement,
June 1, 1983;
7(3):
267 - 281.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
F. Drasgow and C. K. Parsons
Application of Unidimensional Item Response Theory Models to Multidimensional Data
Applied Psychological Measurement,
April 1, 1983;
7(2):
189 - 199.
[Abstract]
|
 |
|
|
|