Kukull W A, Larson E B, Reifler B V, Lampe T H, Yerby M, Hughes J
Department of Epidemiology, University of Washington, Seattle 98195.
Neurology. 1990 Feb;40(2):257-60. doi: 10.1212/wnl.40.2.257.
To determine interrater reliability of dementia diagnosis, 4 physicians experienced in the evaluation of dementia patients applied 3 sets of diagnostic criteria to each of 62 patients, based on a standardized set of medical record information. All patients had undergone similar examinations and follow-up to establish the initial clinical diagnosis (76% had autopsy). Raters were blind to the diagnosis and to follow-up information after the initial evaluation period. This paper presents interrater agreement (kappa values) for a diagnosis of Alzheimer's disease using the American Psychiatric Association diagnostic criteria from the Diagnostic and Statistical Manual (DSM-III), the National Institute of Neurological and Communicative Disorders and Stroke (NINCDS) criteria for the clinical diagnosis of Alzheimer's disease, and the Eisdorfer and Cohen Research Diagnostic Criteria (ECRDC) for primary neuronal degeneration. The NINCDS showed somewhat higher average interrater reliability (kappa = 0.64) than the DSM-III (kappa = 0.55) and considerably higher interrater reliability than the ECRDC (kappa = 0.37). One rater displayed conspicuously lower levels of interrater reliability than the other 3, especially in DSM-III and ECRDC. This study indicates that interrater reliability of DSM-III and NINCDS criteria are comparable. Documentation of interrater reliability and, if necessary, training to improve reliability is an important consideration in research where different observers are diagnosing dementing illnesses.
为确定痴呆诊断的评分者间信度,4位在评估痴呆患者方面经验丰富的医生,根据一套标准化的病历信息,对62名患者中的每一位应用了3套诊断标准。所有患者均接受了相似的检查和随访以确立初始临床诊断(76%的患者接受了尸检)。在初始评估期之后,评分者对诊断结果以及随访信息均不知情。本文呈现了使用美国精神病学协会《精神疾病诊断与统计手册》(DSM-III)中的诊断标准、美国国立神经疾病和中风研究所(NINCDS)的阿尔茨海默病临床诊断标准以及艾斯多弗和科恩研究诊断标准(ECRDC)对原发性神经元变性进行阿尔茨海默病诊断时的评分者间一致性(kappa值)。NINCDS显示出的平均评分者间信度(kappa = 0.64)略高于DSM-III(kappa = 0.55),且评分者间信度显著高于ECRDC(kappa = 0.37)。有一位评分者的评分者间信度水平明显低于其他3位,尤其是在DSM-III和ECRDC方面。本研究表明,DSM-III和NINCDS标准的评分者间信度具有可比性。在不同观察者诊断痴呆症的研究中,记录评分者间信度并在必要时进行培训以提高信度是一项重要的考量因素。