Woodland McKell, Patel Nihil, Taie Mais Al, Yung Joshua P, Netherton Tucker J, Patel Ankit B, Brock Kristy K
The University of Texas MD Anderson Cancer Center, Houston TX 77030, USA.
Rice University, Houston TX 77005, USA.
Uncertain Safe Util Mach Learn Med Imaging (2023). 2023 Oct;14291:147-156. doi: 10.1007/978-3-031-44336-7_15. Epub 2023 Oct 7.
Clinically-deployed deep learning-based segmentation models are known to fail on data outside of their training distributions. While clinicians review the segmentations, these models do tend to perform well in most instances, which could exacerbate automation bias. Therefore, it is critical to detect out-of-distribution images at inference to warn the clinicians that the model likely failed. This work applies the Mahalanobis distance post hoc to the bottleneck features of a Swin UNETR model that segments the liver on T1-weighted magnetic resonance imaging. By reducing the dimensions of the bottleneck features with principal component analysis, images the model failed on were detected with high performance and minimal computational load. Specifically, the proposed technique achieved 92% area under the receiver operating characteristic curve and 94% area under the precision-recall curve and can run in seconds on a central processing unit.
众所周知,临床应用的基于深度学习的分割模型在其训练分布之外的数据上会失效。虽然临床医生会审查分割结果,但这些模型在大多数情况下确实表现良好,这可能会加剧自动化偏差。因此,在推理时检测分布外图像以警告临床医生模型可能失败至关重要。这项工作将马氏距离事后应用于在T1加权磁共振成像上分割肝脏的Swin UNETR模型的瓶颈特征。通过主成分分析降低瓶颈特征的维度,以高性能和最小计算量检测出模型失败的图像。具体而言,所提出的技术在受试者工作特征曲线下的面积达到92%,在精确召回率曲线下的面积达到94%,并且可以在中央处理器上在数秒内运行。