Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here for FREE ACCESS to this landmark database

Click here for more information on Research and Evaluation in Education and Psychology, 3e

Sign In to gain access to subscriptions and/or personal tools.
Applied Psychological Measurement
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Humphreys, L. G.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Further Comments on Reliability and Power of Significance Tests

Lloyd G. Humphreys

University of Illinois, Urbana-Champaign

The controversy about the relationship between reliability and the power of significance tests exists because statisticians obtain numerical solutions by varying independently the parameters of the power of statistical tests. In contrast, researchers have empirical limitations placed on them in varying the same parameters. Reliability and power can legiti mately be decoupled by selection of the population from which to sample (Zimmerman & Williams, 1986), but this is an undependable way to increase power (Humphreys, 1991). Reducing population var iance by selection of the sample can be considered a special case of (and a crude approximation to) the analysis of covariance, which is also a more effective way of controlling individual differences in true scores than the use of difference scores. Both the regressed differences and the raw differences are less reliable within treatments than their components, but can have more power in statistical tests. As the reliability of derived scores increases, however, power increases.

Key Words: Index terms: difference scores • error of measurement • planning experiments • power • reliability, significance tests • t tests • true scores.

Applied Psychological Measurement, Vol. 17, No. 1, 11-14 (1993)
DOI: 10.1177/014662169301700102


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Organizational Research MethodsHome page
J. T. Austin, K. A. Boyle, and J. C. Lualhati
Statistical Conclusion Validity for Organizational Science Researchers: A Review
Organizational Research Methods, April 1, 1998; 1(2): 164 - 208.
[Abstract]


Home page
Applied Psychological MeasurementHome page
R. H. Williams and D. W. Zimmerman
Are Simple Gain Scores Obsolete?
Applied Psychological Measurement, March 1, 1996; 20(1): 59 - 69.
[Abstract]


Home page
Applied Psychological MeasurementHome page
D. W. Zimmerman, R. H. Williams, and B. D. Zumbo
Reliability, Power, Functions, and Relations: A Reply to Humphreys
Applied Psychological Measurement, March 1, 1993; 17(1): 15 - 16.