Ford Jeremy N, Sweeney Elizabeth M, Skafida Myrto, Glynn Shannon, Amoashiy Michael, Lange Dale J, Lin Eaton, Chiang Gloria C, Osborne Joseph R, Pahlajani Silky, de Leon Mony J, Ivanidze Jana
Department of Radiology, Massachusetts General Hospital Boston, MA, United States.
Department of Population Health Sciences, Division of Biostatistics and Epidemiology, Weill Cornell Medical College New York, NY, United States.
Am J Nucl Med Mol Imaging. 2021 Aug 15;11(4):313-326. eCollection 2021.
Distinguishing frontotemporal lobar degeneration (FTLD) and Alzheimer Disease (AD) on FDG-PET based on qualitative review alone can pose a diagnostic challenge. SPM has been shown to improve diagnostic performance in research settings, but translation to clinical practice has been lacking. Our purpose was to create a heuristic scoring method based on statistical parametric mapping z-scores. We aimed to compare the performance of the scoring method to the initial qualitative read and a machine learning (ML)-based method as benchmarks. FDG-PET/CT or PET/MRI of 65 patients with suspected dementia were processed using SPM software, yielding z-scores from either whole brain (W) or cerebellar (C) normalization relative to a healthy cohort. A non-ML, heuristic scoring system was applied using region counts below a preset z-score cutoff. W z-scores, C z-scores, or WC z-scores (z-scores from both W and C normalization) served as features to build random forest models. The neurological diagnosis was used as the gold standard. The sensitivity of the non-ML scoring system and the random forest models to detect AD was higher than the initial qualitative read of the standard FDG-PET [0.89-1.00 vs. 0.22 (95% CI, 0-0.33)]. A categorical random forest model to distinguish AD, FTLD, and normal cases had similar accuracy than the non-ML scoring model (0.63 vs. 0.61). Our non-ML-based scoring system of SPM z-scores approximated the diagnostic performance of a ML-based method and demonstrated higher sensitivity in the detection of AD compared to qualitative reads. This approach may improve the diagnostic performance.
仅基于定性评估在氟代脱氧葡萄糖正电子发射断层扫描(FDG-PET)上区分额颞叶痴呆(FTLD)和阿尔茨海默病(AD)可能会带来诊断挑战。统计参数映射(SPM)已被证明在研究环境中可提高诊断性能,但缺乏向临床实践的转化。我们的目的是基于统计参数映射z分数创建一种启发式评分方法。我们旨在将该评分方法的性能与初始定性解读以及基于机器学习(ML)的方法作为基准进行比较。使用SPM软件对65例疑似痴呆患者的FDG-PET/CT或PET/MRI进行处理,相对于健康队列得出全脑(W)或小脑(C)标准化的z分数。使用低于预设z分数临界值的区域计数应用非ML启发式评分系统。W z分数、C z分数或WC z分数(来自W和C标准化的z分数)用作构建随机森林模型的特征。神经学诊断用作金标准。非ML评分系统和随机森林模型检测AD的敏感性高于标准FDG-PET的初始定性解读[0.89 - 1.00对0.22(95%CI,0 - 0.33)]。区分AD、FTLD和正常病例的分类随机森林模型与非ML评分模型具有相似的准确性(0.63对0.61)。我们基于SPM z分数的非ML评分系统接近基于ML方法的诊断性能,并且与定性解读相比在检测AD方面表现出更高的敏感性。这种方法可能会提高诊断性能。