评价自动分割算法的参考标准：MRI 上手动勾画前列腺轮廓的观察者间变异性的量化。

Reference standard for the evaluation of automatic segmentation algorithms: Quantification of inter observer variability of manual delineation of prostate contour on MRI.

机构信息

Department of Radiology, Hôpitaux Universitaire de Strasbourg, Hôpital de Hautepierre, 67200, Strasbourg, France; Breast and Thyroid Imaging Unit, Institut de Cancérologie Strasbourg Europe, 67200, Strasbourg, France; IGBMC, Institut de Génétique et de Biologie Moléculaire et Cellulaire, 67400, Illkirch, France.

Inria, Epione Team, Sophia Antipolis, Université Côte d'Azur, 06902, Nice, France.

出版信息

Diagn Interv Imaging. 2024 Feb;105(2):65-73. doi: 10.1016/j.diii.2023.08.001. Epub 2023 Aug 21.

DOI:10.1016/j.diii.2023.08.001

PMID:37822196

Abstract

PURPOSE

The purpose of this study was to investigate the relationship between inter-reader variability in manual prostate contour segmentation on magnetic resonance imaging (MRI) examinations and determine the optimal number of readers required to establish a reliable reference standard.

MATERIALS AND METHODS

Seven radiologists with various experiences independently performed manual segmentation of the prostate contour (whole-gland [WG] and transition zone [TZ]) on 40 prostate MRI examinations obtained in 40 patients. Inter-reader variability in prostate contour delineations was estimated using standard metrics (Dice similarity coefficient [DSC], Hausdorff distance and volume-based metrics). The impact of the number of readers (from two to seven) on segmentation variability was assessed using pairwise metrics (consistency) and metrics with respect to a reference segmentation (conformity), obtained either with majority voting or simultaneous truth and performance level estimation (STAPLE) algorithm.

RESULTS

The average segmentation DSC for two readers in pairwise comparison was 0.919 for WG and 0.876 for TZ. Variability decreased with the number of readers: the interquartile ranges of the DSC were 0.076 (WG) / 0.021 (TZ) for configurations with two readers, 0.005 (WG) / 0.012 (TZ) for configurations with three readers, and 0.002 (WG) / 0.0037 (TZ) for configurations with six readers. The interquartile range decreased slightly faster between two and three readers than between three and six readers. When using consensus methods, variability often reached its minimum with three readers (with STAPLE, DSC = 0.96 [range: 0.945-0.971] for WG and DSC = 0.94 [range: 0.912-0.957] for TZ, and interquartile range was minimal for configurations with three readers.

CONCLUSION

The number of readers affects the inter-reader variability, in terms of inter-reader consistency and conformity to a reference. Variability is minimal for three readers, or three readers represent a tipping point in the variability evolution, with both pairwise-based metrics or metrics with respect to a reference. Accordingly, three readers may represent an optimal number to determine references for artificial intelligence applications.

摘要

目的

本研究旨在探讨磁共振成像（MRI）检查中手动前列腺轮廓分割的读者间变异性，并确定建立可靠参考标准所需的最佳读者数量。

材料与方法

7 名具有不同经验的放射科医生对 40 名患者的 40 次前列腺 MRI 检查分别进行了前列腺轮廓（全腺[WG]和移行区[TZ]）的手动分割。使用标准指标（Dice 相似系数[DSC]、Hausdorff 距离和基于体积的指标）评估前列腺轮廓勾画的读者间变异性。使用两两比较指标（一致性）和参考分割指标（一致性）评估读者数量（从 2 名到 7 名）对分割变异性的影响，参考分割采用多数投票或同时真实和性能水平估计（STAPLE）算法获得。

结果

两名读者的平均分割 DSC 为 0.919（WG）和 0.876（TZ）。随着读者数量的增加，变异性降低：DSC 的四分位间距为 0.076（WG）/0.021（TZ）（两名读者）、0.005（WG）/0.012（TZ）（三名读者）和 0.002（WG）/0.0037（TZ）（六名读者）。两名读者和三名读者之间的四分位间距变化比三名读者和六名读者之间的变化稍快。使用一致性方法时，变异性通常在三名读者时达到最小值（使用 STAPLE，全腺的 DSC=0.96[范围：0.945-0.971]，TZ 的 DSC=0.94[范围：0.912-0.957]，四分位间距在三名读者的配置中最小）。

结论

读者数量会影响读者间的一致性和与参考标准的一致性，从而影响读者间的变异性。三名读者时变异性最小，或者三名读者代表变异性演变的临界点，无论是基于两两比较的指标还是参考指标都是如此。因此，三名读者可能是确定人工智能应用参考标准的最佳人数。

相似文献

Reference standard for the evaluation of automatic segmentation algorithms: Quantification of inter observer variability of manual delineation of prostate contour on MRI.评价自动分割算法的参考标准：MRI 上手动勾画前列腺轮廓的观察者间变异性的量化。

Diagn Interv Imaging. 2024 Feb;105(2):65-73. doi: 10.1016/j.diii.2023.08.001. Epub 2023 Aug 21.

Challenge of prostate MRI segmentation on T2-weighted images: inter-observer variability and impact of prostate morphology.T2加权图像上前列腺MRI分割的挑战：观察者间变异性及前列腺形态的影响

Insights Imaging. 2021 Jun 5;12(1):71. doi: 10.1186/s13244-021-01010-9.

Spatially varying accuracy and reproducibility of prostate segmentation in magnetic resonance images using manual and semiautomated methods.使用手动和半自动方法在磁共振图像中前列腺分割的空间变化准确性和可重复性。

Med Phys. 2014 Nov;41(11):113503. doi: 10.1118/1.4899182.

Manual prostate MRI segmentation by readers with different experience: a study of the learning progress.手动前列腺 MRI 分割：不同经验读者的研究——学习进展。

Eur Radiol. 2024 Jul;34(7):4801-4809. doi: 10.1007/s00330-023-10515-4. Epub 2024 Jan 2.

Variability of manual segmentation of the prostate in axial T2-weighted MRI: A multi-reader study.轴向 T2 加权 MRI 中前列腺手动分割的可变性：一项多读者研究。

Eur J Radiol. 2019 Dec;121:108716. doi: 10.1016/j.ejrad.2019.108716. Epub 2019 Oct 25.

Segmentation of prostate zones using probabilistic atlas-based method with diffusion-weighted MR images.基于概率图谱法并结合扩散加权磁共振图像对前列腺区域进行分割。

Comput Methods Programs Biomed. 2020 Nov;196:105572. doi: 10.1016/j.cmpb.2020.105572. Epub 2020 Jun 2.

Accuracy Validation of an Automated Method for Prostate Segmentation in Magnetic Resonance Imaging.磁共振成像中前列腺自动分割方法的准确性验证。

J Digit Imaging. 2017 Dec;30(6):782-795. doi: 10.1007/s10278-017-9964-7.

Evaluation of auto-segmentation accuracy of cloud-based artificial intelligence and atlas-based models.基于云的人工智能和图谱模型的自动分割准确性评估。

Radiat Oncol. 2021 Sep 9;16(1):175. doi: 10.1186/s13014-021-01896-1.

Automatic segmentation of myocardium at risk from contrast enhanced SSFP CMR: validation against expert readers and SPECT.基于对比增强稳态自由进动心脏磁共振成像自动分割危险心肌：与专家阅片及单光子发射计算机断层扫描的对比验证

BMC Med Imaging. 2016 Mar 5;16:19. doi: 10.1186/s12880-016-0124-1.

Combined model-based and deep learning-based automated 3D zonal segmentation of the prostate on T2-weighted MR images: clinical evaluation.基于模型和深度学习的T2加权磁共振图像前列腺自动三维分区联合分割：临床评估

Eur Radiol. 2022 May;32(5):3248-3259. doi: 10.1007/s00330-021-08408-5. Epub 2022 Jan 10.

引用本文的文献

Defining ground truth for prostate segmentation of transrectal ultrasound images: Inter- and intra-observer variability of manual versus semi-automatic methods.定义经直肠超声图像前列腺分割的真实标准：手动与半自动方法的观察者间和观察者内变异性

Med Phys. 2025 Aug;52(8):e18025. doi: 10.1002/mp.18025.

Automatic quantification of left atrium volume for cardiac rhythm analysis leveraging 3D residual UNet for time-varying segmentation of ECG-gated CT.利用3D残差UNet对心电图门控CT进行时变分割，自动定量左心房容积以进行心律分析。

Comput Struct Biotechnol J. 2025 May 13;28:175-189. doi: 10.1016/j.csbj.2025.04.039. eCollection 2025.

Deep learning-based segmentation of kidneys and renal cysts on T2-weighted MRI from patients with autosomal dominant polycystic kidney disease.基于深度学习的常染色体显性多囊肾病患者 T2 加权 MRI 上肾脏和肾囊肿的分割。

Eur Radiol Exp. 2024 Oct 30;8(1):122. doi: 10.1186/s41747-024-00520-7.

Achieving accurate prostate auto-segmentation on CT in the absence of MR imaging.在没有磁共振成像的情况下在CT上实现准确的前列腺自动分割。

Radiother Oncol. 2025 Jan;202:110588. doi: 10.1016/j.radonc.2024.110588. Epub 2024 Oct 16.

Neural network segmentation of disc volume from magnetic resonance images and the effect of degeneration and spinal level.基于磁共振图像的椎间盘体积神经网络分割以及退变和脊柱节段的影响

JOR Spine. 2024 Sep 4;7(3):e70000. doi: 10.1002/jsp2.70000. eCollection 2024 Sep.

Deep Learning Prostate MRI Segmentation Accuracy and Robustness: A Systematic Review.深度学习前列腺 MRI 分割准确性和稳健性：系统评价。

Radiol Artif Intell. 2024 Jul;6(4):e230138. doi: 10.1148/ryai.230138.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评价自动分割算法的参考标准：MRI 上手动勾画前列腺轮廓的观察者间变异性的量化。

Reference standard for the evaluation of automatic segmentation algorithms: Quantification of inter observer variability of manual delineation of prostate contour on MRI.

机构信息

出版信息

PURPOSE

MATERIALS AND METHODS

RESULTS

CONCLUSION

目的

材料与方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献