A comparison of methods for estimating internal consistency reliability of tests with dichotomously-scored items

Date
1995
Authors
Lupien, Alfred E.
Contributor
Advisor
Department
Instructor
Depositor
Speaker
Researcher
Consultant
Interviewer
Annotator
Journal Title
Journal ISSN
Volume Title
Publisher
Volume
Number/Issue
Starting Page
Ending Page
Alternative Title
Abstract
The use of Kuder and Richardson's formula 20 (KR20) (1937) to estimate reliability has been controversial. The purpose of this investigation was to review internal consistency reliability estimation techniques for unidimensional tests with dichotomously-scored items. Eleven methods were compared using a series of 98 simulated item-by-person response patterns with positive off-diagonal covariances, including patterns known to reflect perfect reliability by Loevinger's index of homogeneity (1947) and KR20. The upper limit of 1.0 was achieved in both perfect patterns only using methods described by Cliff (1984), Horst (1953), Loevinger, and Raju (1982). Lower limits of reliability were projected through linear regression. The ratio of off-diagonal covariance to test variance was used as the independent variable. Zero was included in the 95% confidence interval for Y-intercepts with Cliff's, Horst's, and Kuder-Richardson's techniques. Negative Y-intercepts were computed for the techniques of Cliff, Huck (1978), Loevinger, and Winer (1971). Positive Y-intercepts were computed for the techniques of Ayabe (1994), Guttman (L1 and L 2) (1945), Raju, and ten Berge and Zegers (1978). Between the upper and lower limits, reliability estimates generally increased as the ratio of off-diagonal covariance to total variance increased. It was concluded that the majority of estimation techniques do not meet minimum criteria for interpretation. Only the methods of Cliff, Horst, and Raju generally met the requirements for •reliability estimation techniques. Compared to KR20, the mean increases in reliability estimated with these three methods were .12 with Raju's ratio of actual to maximal KR20, .04 with Horst's method, and .00 with Cliff's γ-reliability technique.
Description
Thesis (Ph. D.)--University of Hawaii at Manoa, 1995.
Includes bibliographical references (leaves 154-160).
Microfiche.
xiv, 160 leaves, bound ill. 29 cm
Keywords
Educational tests and measurements -- Evaluation
Citation
Extent
Format
Geographic Location
Time Period
Related To
Theses for the degree of Doctor of Philosophy (University of Hawaii at Manoa). Educational Psychology; no. 3145
Table of Contents
Rights
All UHM dissertations and theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission from the copyright owner.
Rights Holder
Local Contexts
Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.