Bioengineering College, Chongqing University, Chongqing, China.
The department of radiology, Southwest Hospital, Chongqing, China.
J Digit Imaging. 2023 Oct;36(5):2088-2099. doi: 10.1007/s10278-023-00865-2. Epub 2023 Jun 20.
Segmentation is a crucial step in extracting the medical image features for clinical diagnosis. Though multiple metrics have been proposed to evaluate the segmentation performance, there is no clear study on how or to what extent the segmentation errors will affect the diagnostic related features used in clinical practice. Therefore, we proposed a segmentation robustness plot (SRP) to build the link between segmentation errors and clinical acceptance, where relative area under the curve (R-AUC) was designed to help clinicians to identify the robust diagnostic related image features. In experiments, we first selected representative radiological series from time series (cardiac first-pass perfusion) and spatial series (T2 weighted images on brain tumors) of magnetic resonance images, respectively. Then, dice similarity coefficient (DSC) and Hausdorff distance (HD), as the widely used evaluation metrics, were used to systematically control the degree of the segmentation errors. Finally, the differences between diagnostic related image features extracted from the ground truth and the derived segmentation were analyzed, using the statistical method large sample size T-test to calculate the corresponding p values. The results are denoted in the SRP, where the x-axis indicates the segmentation performance using the aforementioned evaluation metric, and the y-axis shows the severity of the corresponding feature changes, which are expressed in either the p values for a single case or the proportion of patients without significant change. The experimental results in SRP show that when DSC is above 0.95 and HD is below 3 mm, the segmentation errors will not change the features significantly in most cases. However, when segmentation gets worse, additional metrics are required for further analysis. In this way, the proposed SRP indicates the impact of the segmentation errors on the severity of the corresponding feature changes. By using SRP, one could easily define the acceptable segmentation errors in a challenge. Additionally, the R-AUC calculated from SRP provides an objective reference to help the selection of reliable features in image analysis.
分割是提取医学图像特征进行临床诊断的关键步骤。尽管已经提出了多种指标来评估分割性能,但对于分割误差将如何影响临床实践中使用的诊断相关特征,以及影响程度如何,尚无明确的研究。因此,我们提出了一种分割稳健性图(SRP),以建立分割误差与临床可接受性之间的联系,其中相对曲线下面积(R-AUC)旨在帮助临床医生识别稳健的诊断相关图像特征。在实验中,我们首先从磁共振成像的时间序列(心脏首过灌注)和空间序列(脑肿瘤 T2 加权图像)中分别选择有代表性的放射学系列。然后,使用广泛使用的评估指标,即骰子相似系数(DSC)和 Hausdorff 距离(HD),系统地控制分割误差的程度。最后,使用大样本量 T 检验的统计方法分析从真实分割和导出分割中提取的诊断相关图像特征之间的差异,计算相应的 p 值。结果在 SRP 中表示,其中 x 轴表示使用上述评估指标的分割性能,y 轴表示相应特征变化的严重程度,以单个病例的 p 值或无显著变化的患者比例表示。SRP 中的实验结果表明,当 DSC 大于 0.95 且 HD 小于 3mm 时,分割误差在大多数情况下不会显著改变特征。然而,当分割效果变差时,需要额外的指标进行进一步分析。这样,所提出的 SRP 就表明了分割误差对相应特征变化严重程度的影响。通过使用 SRP,可以轻松定义挑战中的可接受分割误差。此外,SRP 中计算的 R-AUC 为图像分析中可靠特征的选择提供了客观参考。