Shenouda Mena, Gudmundsson Eyjólfur, Li Feng, Straus Christopher M, Kindler Hedy L, Dudek Arkadiusz Z, Stinchcombe Thomas, Wang Xiaofei, Starkey Adam, Armato Iii Samuel G
Department of Radiology, The University of Chicago, Chicago, IL, 60637, USA.
Icelandic Radiation Safety Authority, Reykjavik, Iceland.
J Imaging Inform Med. 2025 Apr;38(2):967-978. doi: 10.1007/s10278-024-01092-z. Epub 2024 Sep 12.
The purpose of this study was to evaluate the impact of probability map threshold on pleural mesothelioma (PM) tumor delineations generated using a convolutional neural network (CNN). One hundred eighty-six CT scans from 48 PM patients were segmented by a VGG16/U-Net CNN. A radiologist modified the contours generated at a 0.5 probability threshold. Percent difference of tumor volume and overlap using the Dice Similarity Coefficient (DSC) were compared between the reference standard provided by the radiologist and CNN outputs for thresholds ranging from 0.001 to 0.9. CNN-derived contours consistently yielded smaller tumor volumes than radiologist contours. Reducing the probability threshold from 0.5 to 0.01 decreased the absolute percent volume difference, on average, from 42.93% to 26.60%. Median and mean DSC ranged from 0.57 to 0.59, with a peak at a threshold of 0.2; no distinct threshold was found for percent volume difference. The CNN exhibited deficiencies with specific disease presentations, such as severe pleural effusion or disease in the pleural fissure. No single output threshold in the CNN probability maps was optimal for both tumor volume and DSC. This study emphasized the importance of considering both figures of merit when evaluating deep learning-based tumor segmentations across probability thresholds. This work underscores the need to simultaneously assess tumor volume and spatial overlap when evaluating CNN performance. While automated segmentations may yield comparable tumor volumes to that of the reference standard, the spatial region delineated by the CNN at a specific threshold is equally important.
本研究的目的是评估概率图阈值对使用卷积神经网络(CNN)生成的胸膜间皮瘤(PM)肿瘤轮廓的影响。对48例PM患者的186份CT扫描进行了VGG16/U-Net CNN分割。一位放射科医生修改了在0.5概率阈值下生成的轮廓。比较了放射科医生提供的参考标准与CNN在0.001至0.9阈值下输出的肿瘤体积百分比差异和使用骰子相似系数(DSC)的重叠情况。CNN得出的轮廓始终比放射科医生得出的轮廓产生更小的肿瘤体积。将概率阈值从0.5降低到0.01,平均绝对体积百分比差异从42.93%降至26.60%。中位数和平均DSC范围为0.57至0.59,在阈值为0.2时达到峰值;未发现体积百分比差异的明显阈值。CNN在特定疾病表现方面存在不足,如严重胸腔积液或胸膜裂中的疾病。CNN概率图中没有一个单一的输出阈值对于肿瘤体积和DSC都是最优的。本研究强调了在评估跨概率阈值的基于深度学习的肿瘤分割时同时考虑这两个品质因数的重要性。这项工作强调了在评估CNN性能时同时评估肿瘤体积和空间重叠的必要性。虽然自动分割可能产生与参考标准相当的肿瘤体积,但CNN在特定阈值下划定的空间区域同样重要。
J Med Imaging (Bellingham). 2018-7
Int J Comput Assist Radiol Surg. 2016-12-27
J Med Imaging (Bellingham). 2024-11
Radiol Artif Intell. 2021-7-14