Russell Carl M, Williamson David F, Bartko John J, Bradley Edwin L
Office of Biostatistics and Department of Oral Diagnosis & Patient Services, Medical College of Georgia, Augusta, Georgia, 30912-4900.
Division of Nutrition, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Atlanta, Georgia 30341-3724.
Am J Hum Biol. 1994;6(3):311-320. doi: 10.1002/ajhb.1310060306.
Reliability is a subject of continuing discussion in biomedial specialty areas, including physical anthropology and nutritional epidemiology. The purpose of this study was to explore techniques of detecting differences between two evaluators or methods. A field study in which anthropometric dimensions would be taken by two independent evaluators on each participant in a study group was simulated. A panel of reliability indicators was applied across a broad range of parameters using simulation, and then the panel was applied to field anthropometric data. The panel consisted of the intraclass correlation coefficient (ICC), paired t-test, a simultaneous test of evaluator means and variances, technical error of measurement, mean absolute difference, and mean difference. The simultaneous test for equal evaluator means and variances uses regression to model paired differences versus paired sums. The simulation demonstrated general properties of the reliability indicators across many conditions of population variance, measurer bias, and measurer error variance. High values of ICC often exist in cases in which the measurers are different. The simultaneous test is thus a powerful method for detecting measurer differences, especially when combined with the paired t-test. However, a single reliability indicator that is sufficient to determine all measurer inconsistencies was not identified. The field study and the simulation permitted the development of a logical approach to determining the source and magnitude of measurer differences using the panel of reliability indicators. © 1994 Wiley-Liss, Inc.
在生物医学专业领域,包括体质人类学和营养流行病学,可靠性一直是持续讨论的主题。本研究的目的是探索检测两位评估者或两种方法之间差异的技术。模拟了一项实地研究,其中研究组中的每位参与者的人体测量维度将由两位独立评估者进行测量。使用模拟方法,在广泛的参数范围内应用了一组可靠性指标,然后将该指标组应用于实地人体测量数据。该指标组包括组内相关系数(ICC)、配对t检验、评估者均值和方差的同时检验、测量技术误差、平均绝对差和平均差。评估者均值和方差相等的同时检验使用回归模型来模拟配对差异与配对和。模拟展示了可靠性指标在总体方差、测量者偏差和测量者误差方差的多种条件下的一般特性。在测量者不同的情况下,ICC值往往较高。因此,同时检验是检测测量者差异的有力方法,特别是与配对t检验结合使用时。然而,尚未确定一个足以确定所有测量者不一致情况的单一可靠性指标。实地研究和模拟允许开发一种逻辑方法,使用可靠性指标组来确定测量者差异的来源和大小。© 1994威利 - 利斯公司。