Rudie Jeffrey D, Rauschecker Andreas M, Xie Long, Wang Jiancong, Duong Michael Tran, Botzolakis Emmanuel J, Kovalovich Asha, Egan John M, Cook Tessa, Bryan R Nick, Nasrallah Ilya M, Mohan Suyash, Gee James C
Department of Radiology, Hospital of the University of Pennsylvania, 3400 Spruce St, Philadelphia, PA 19104 (J.D.R., L.X., A.K., J.M.E., T.C., I.M.N., S.M., J.C.G.); Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, Calif (J.D.R., A.M.R.); Penn Image Computing and Science Laboratory, University of Pennsylvania, Philadelphia, Pa (X.L., J.W.); University of Pennsylvania Perelman School of Medicine, Philadelphia, Pa (M.T.D.); Mecklenburg Radiology Associates, Charlotte, NC (E.J.B.); Department of Radiology, University of Texas, Austin, Tex (R.N.B.); and Division of Nuclear Medicine and Clinical Molecular Imaging, Department of Radiology, University of Pennsylvania, Philadelphia, Pa (I.M.N.).
Radiol Artif Intell. 2020 Sep 23;2(5):e190146. doi: 10.1148/ryai.2020190146. eCollection 2020 Sep.
To develop and validate a system that could perform automated diagnosis of common and rare neurologic diseases involving deep gray matter on clinical brain MRI studies.
In this retrospective study, multimodal brain MRI scans from 212 patients (mean age, 55 years ± 17 [standard deviation]; 113 women) with 35 neurologic diseases and normal brain MRI scans obtained between January 2008 and January 2018 were included (110 patients in the training set, 102 patients in the test set). MRI scans from 178 patients (mean age, 48 years ± 17; 106 women) were used to supplement training of the neural networks. Three-dimensional convolutional neural networks and atlas-based image processing were used for extraction of 11 imaging features. Expert-derived Bayesian networks incorporating domain knowledge were used for differential diagnosis generation. The performance of the artificial intelligence (AI) system was assessed by comparing diagnostic accuracy with that of radiologists of varying levels of specialization by using the generalized estimating equation with robust variance estimator for the top three differential diagnoses (T3DDx) and the correct top diagnosis (TDx), as well as with receiver operating characteristic analyses.
In the held-out test set, the imaging pipeline detected 11 key features on brain MRI scans with 89% accuracy (sensitivity, 81%; specificity, 95%) relative to academic neuroradiologists. The Bayesian network, integrating imaging features with clinical information, had an accuracy of 85% for T3DDx and 64% for TDx, which was better than that of radiology residents ( = 4; 56% for T3DDx, 36% for TDx; < .001 for both) and general radiologists ( = 2; 53% for T3DDx, 31% for TDx; < .001 for both). The accuracy of the Bayesian network was better than that of neuroradiology fellows ( = 2) for T3DDx (72%; = .003) but not for TDx (59%; = .19) and was not different from that of academic neuroradiologists ( = 2; 84% T3DDx, 65% TDx; > .09 for both).
A hybrid AI system was developed that simultaneously provides a quantitative assessment of disease burden, explainable intermediate imaging features, and a probabilistic differential diagnosis that performed at the level of academic neuroradiologists. This type of approach has the potential to improve clinical decision making for common and rare diseases.© RSNA, 2020.
开发并验证一种能够对临床脑MRI研究中涉及深部灰质的常见和罕见神经系统疾病进行自动诊断的系统。
在这项回顾性研究中,纳入了2008年1月至2018年1月期间212例患有35种神经系统疾病的患者(平均年龄55岁±17[标准差];113名女性)的多模态脑MRI扫描以及正常脑MRI扫描(训练集110例患者,测试集102例患者)。178例患者(平均年龄48岁±17;106名女性)的MRI扫描用于补充神经网络的训练。使用三维卷积神经网络和基于图谱的图像处理来提取11种影像特征。结合领域知识的专家衍生贝叶斯网络用于生成鉴别诊断。通过使用具有稳健方差估计器的广义估计方程,将人工智能(AI)系统的诊断准确性与不同专业水平的放射科医生进行比较,以评估人工智能(AI)系统的性能,用于前三位鉴别诊断(T3DDx)和正确的首位诊断(TDx),以及进行受试者操作特征分析。
在保留测试集中,成像流程在脑MRI扫描上检测到11个关键特征,相对于学术神经放射科医生,其准确率为89%(敏感性81%;特异性95%)。将影像特征与临床信息相结合的贝叶斯网络,T3DDx的准确率为85%,TDx的准确率为64%,优于放射科住院医师(n = 4;T'3DDx为56%,TDx为36%;两者均P <.001)和普通放射科医生(n = 2;T3DDx为53%,TDx为31%;两者均P <.001)。贝叶斯网络对于T3DDx的准确率优于神经放射科进修医师(n = 2)(72%;P = 0.003),但对于TDx则不然(59%;P = 0.19),并且与学术神经放射科医生(n = 2;T3DDx为84%,TDx为65%;两者均P > 0.09)没有差异。
开发了一种混合AI系统,该系统同时提供疾病负担的定量评估、可解释的中间影像特征以及概率性鉴别诊断,其性能与学术神经放射科医生相当。这种方法有可能改善常见和罕见疾病的临床决策。©RSNA,2020年。