Yamane T, Ikari Y, Nishio T, Ishii K, Ishii K, Kato T, Ito K, Silverman D H S, Senda M, Asada T, Arai H, Sugishita M, Iwatsubo T
From the Division of Molecular Imaging (T.Y., Y.I., T.N., M. Senda), Institute of Biomedical Research and Innovation, Kobe, Japan.
AJNR Am J Neuroradiol. 2014 Feb;35(2):244-9. doi: 10.3174/ajnr.A3665. Epub 2013 Aug 1.
The role of (18)F-FDG-PET in the diagnosis of Alzheimer disease is increasing and should be validated. The aim of this study was to assess the inter-rater variability in the interpretation of (18)F-FDG-PET images obtained in the Japanese Alzheimer's Disease Neuroimaging Initiative, a multicenter clinical research project.
This study analyzed 274 (18)F-FDG-PET scans (67 mild Alzheimer disease, 100 mild cognitive impairment, and 107 normal cognitive) as baseline scans for the Japanese Alzheimer's Disease Neuroimaging Initiative, which were acquired with various types of PET or PET/CT scanners in 23 facilities. Three independent raters interpreted all PET images by using a combined visual-statistical method. The images were classified into 7 (FDG-7) patterns by the criteria of Silverman et al and further into 2 (FDG-2) patterns.
Agreement among the 7 visual-statistical categories by at least 2 of the 3 readers occurred in >94% of cases for all groups: Alzheimer disease, mild cognitive impairment, and normal cognitive. Perfect matches by all 3 raters were observed for 62% of the cases by FDG-7 and 76 by FDG-2. Inter-rater concordance was moderate by FDG-7 (κ = 0.57) and substantial in FDG-2 (κ = 0.67) on average. The FDG-PET score, an automated quantitative index developed by Herholz et al, increased as the number of raters who voted for the AD pattern increased (ρ = 0.59, P < .0001), and the FDG-PET score decreased as those for normal pattern increased (ρ = -0.64, P < .0001).
Inter-rater agreement was moderate to substantial for the combined visual-statistical interpretation of (18)F-FDG-PET and was also significantly associated with automated quantitative assessment.
(18)F-FDG-PET在阿尔茨海默病诊断中的作用日益增强,需进行验证。本研究旨在评估在日本阿尔茨海默病神经影像学倡议(一个多中心临床研究项目)中获取的(18)F-FDG-PET图像解读的评分者间变异性。
本研究分析了274例(18)F-FDG-PET扫描(67例轻度阿尔茨海默病、100例轻度认知障碍和107例认知正常者),作为日本阿尔茨海默病神经影像学倡议的基线扫描,这些扫描是在23个机构使用各种类型的PET或PET/CT扫描仪获取的。三名独立评分者采用视觉-统计联合方法解读所有PET图像。图像根据Silverman等人的标准分为7种(FDG-7)模式,并进一步分为2种(FDG-2)模式。
在所有组(阿尔茨海默病、轻度认知障碍和认知正常)中,>94%的病例在3名读者中至少有2名对7种视觉-统计类别达成一致。FDG-7在62%的病例中观察到所有3名评分者完全匹配,FDG-2在76%的病例中观察到完全匹配。FDG-7的评分者间一致性为中等(κ = 0.57),FDG-2的评分者间一致性平均较高(κ = 0.67)。由Herholz等人开发的自动定量指标FDG-PET评分随着投票支持AD模式的评分者数量增加而升高(ρ = 0.59,P <.0001),随着投票支持正常模式的评分者数量增加而降低(ρ = -0.64,P <.0001)。
对于(18)F-FDG-PET的视觉-统计联合解读,评分者间一致性为中等至较高,且与自动定量评估也显著相关。