Stathopoulos Ioannis, Stoklasa Roman, Kouri Maria Anthi, Velonakis Georgios, Karavasilis Efstratios, Efstathopoulos Efstathios, Serio Luigi
2nd Department of Radiology, Medical School, Attikon University Hospital, National and Kapodistrian University of Athens, 11527 Athens, Greece.
Technology Department, CERN, 1211 Geneva, Switzerland.
J Imaging. 2024 Dec 31;11(1):6. doi: 10.3390/jimaging11010006.
Detection and segmentation of brain abnormalities using Magnetic Resonance Imaging (MRI) is an important task that, nowadays, the role of AI algorithms as supporting tools is well established both at the research and clinical-production level. While the performance of the state-of-the-art models is increasing, reaching radiologists and other experts' accuracy levels in many cases, there is still a lot of research needed on the direction of in-depth and transparent evaluation of the correct results and failures, especially in relation to important aspects of the radiological practice: abnormality position, intensity level, and volume. In this work, we focus on the analysis of the segmentation results of a pre-trained U-net model trained and validated on brain MRI examinations containing four different pathologies: Tumors, Strokes, Multiple Sclerosis (MS), and White Matter Hyperintensities (WMH). We present the segmentation results for both the whole abnormal volume and for each abnormal component inside the examinations of the validation set. In the first case, a dice score coefficient (DSC), sensitivity, and precision of 0.76, 0.78, and 0.82, respectively, were found, while in the second case the model detected and segmented correct (True positives) the 48.8% (DSC ≥ 0.5) of abnormal components, partially correct the 27.1% (0.05 > DSC > 0.5), and missed (False Negatives) the 24.1%, while it produced 25.1% False Positives. Finally, we present an extended analysis between the True positives, False Negatives, and False positives versus their position inside the brain, their intensity at three MRI modalities (FLAIR, T2, and T1ce) and their volume.
使用磁共振成像(MRI)检测和分割脑异常是一项重要任务,如今,人工智能算法作为辅助工具在研究和临床生产层面的作用已得到充分确立。虽然最先进模型的性能在不断提高,在许多情况下已达到放射科医生和其他专家的准确率水平,但在对正确结果和失败进行深入且透明评估的方向上仍有大量研究需要开展,特别是在放射实践的重要方面:异常位置、强度水平和体积。在这项工作中,我们专注于分析一个预训练的U-net模型的分割结果,该模型在包含四种不同病理情况的脑MRI检查上进行了训练和验证:肿瘤、中风、多发性硬化(MS)和白质高信号(WMH)。我们展示了验证集检查中整个异常体积以及每个异常成分的分割结果。在第一种情况下,分别发现骰子分数系数(DSC)、灵敏度和精度为0.76、0.78和0.82,而在第二种情况下,模型检测并分割出了48.8%(DSC≥0.5)的异常成分(真阳性),部分正确分割出了27.1%(0.05>DSC>0.5),遗漏了24.1%(假阴性),同时产生了25.1%的假阳性。最后,我们展示了真阳性、假阴性和假阳性之间关于它们在脑内的位置、在三种MRI模态(液体衰减反转恢复序列(FLAIR)、T2加权像和T1加权增强像(T1ce))下的强度以及它们的体积的扩展分析。