从 STAPLE 评估专家分割性能的推断不确定性估计。

Estimation of inferential uncertainty in assessing expert segmentation performance from STAPLE.

机构信息

Computational Radiology Laboratory, Department of Radiology, Children's Hospital, Boston, MA 02115, USA.

出版信息

IEEE Trans Med Imaging. 2010 Mar;29(3):771-80. doi: 10.1109/TMI.2009.2036011.

DOI:10.1109/TMI.2009.2036011

PMID:20199913

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3183509/

Abstract

The evaluation of the quality of segmentations of an image, and the assessment of intra- and inter-expert variability in segmentation performance, has long been recognized as a difficult task. For a segmentation validation task, it may be effective to compare the results of an automatic segmentation algorithm to multiple expert segmentations. Recently an expectation-maximization (EM) algorithm for simultaneous truth and performance level estimation (STAPLE) was developed to this end to compute both an estimate of the reference standard segmentation and performance parameters from a set of segmentations of an image. The performance is characterized by the rate of detection of each segmentation label by each expert in comparison to the estimated reference standard. This previous work provides estimates of performance parameters,but does not provide any information regarding the uncertainty of the estimated values. An estimate of this inferential uncertainty, if available, would allow the estimation of confidence intervals for the values of the parameters. This would facilitate the interpretation of the performance of segmentation generators and help determine if sufficient data size and number of segmentations have been obtained to precisely characterize the performance parameters. We present a new algorithm to estimate the inferential uncertainty of the performance parameters for binary and multi-category segmentations. It is derived for the special case of the STAPLE algorithm based on established theory for general purpose covariance matrix estimation for EM algorithms. The bounds on the performance parameters are estimated by the computation of the observed information matrix.We use this algorithm to study the bounds on performance parameters estimates from simulated images with specified performance parameters, and from interactive segmentations of neonatal brain MRIs. We demonstrate that confidence intervals for expert segmentation performance parameters can be estimated with our algorithm. We investigate the influence of the number of experts and of the segmented data size on these bounds, showing that it is possible to determine the number of image segmentations and the size of images necessary to achieve a chosen level of accuracy in segmentation performance assessment.

摘要

图像分割质量的评估以及分割性能的专家内和专家间可变性评估一直以来都是一项艰巨的任务。对于分割验证任务，将自动分割算法的结果与多个专家分割进行比较可能是有效的。最近，为了实现这一目标，开发了一种用于同时真实和性能水平估计（STAPLE）的期望最大化（EM）算法，以从一组图像分割中计算参考标准分割和性能参数的估计值。性能的特征是每个专家对每个分割标签的检测率与估计的参考标准进行比较。这项之前的工作提供了性能参数的估计值，但没有提供有关估计值不确定性的任何信息。如果有这样的推断不确定性的估计，就可以估计参数值的置信区间。这将有助于解释分割生成器的性能，并帮助确定是否获得了足够的数据大小和分割数量来精确地描述性能参数。我们提出了一种用于估计二进制和多类别分割性能参数推断不确定性的新算法。它是根据 EM 算法的一般协方差矩阵估计的既定理论，针对 STAPLE 算法的特殊情况推导出来的。通过计算观察信息矩阵，可以估计性能参数的界限。我们使用该算法来研究模拟图像中指定性能参数的性能参数估计值的界限，以及新生儿脑 MRI 的交互式分割。我们证明了可以使用我们的算法估计专家分割性能参数的置信区间。我们研究了专家数量和分割数据大小对这些界限的影响，表明可以确定获得所需分割性能评估准确性的图像分割数量和图像大小。

相似文献

Estimation of inferential uncertainty in assessing expert segmentation performance from STAPLE.

IEEE Trans Med Imaging. 2010 Mar;29(3):771-80. doi: 10.1109/TMI.2009.2036011.

Estimation of inferential uncertainty in assessing expert segmentation performance from STAPLE.

Inf Process Med Imaging. 2009;21:701-12. doi: 10.1007/978-3-642-02498-6_58.

Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation.

IEEE Trans Med Imaging. 2004 Jul;23(7):903-21. doi: 10.1109/TMI.2004.828354.

Incorporating priors on expert performance parameters for segmentation validation and label fusion: a maximum a posteriori STAPLE.

Med Image Comput Comput Assist Interv. 2010;13(Pt 3):25-32. doi: 10.1007/978-3-642-15711-0_4.

Estimating a reference standard segmentation with spatially varying performance parameters: local MAP STAPLE.

IEEE Trans Med Imaging. 2012 Aug;31(8):1593-606. doi: 10.1109/TMI.2012.2197406. Epub 2012 May 2.

Simultaneous truth and performance level estimation through fusion of probabilistic segmentations.

IEEE Trans Med Imaging. 2013 Oct;32(10):1840-52. doi: 10.1109/TMI.2013.2266258. Epub 2013 Jun 4.

Performance-based classifier combination in atlas-based image segmentation using expectation-maximization parameter estimation.

IEEE Trans Med Imaging. 2004 Aug;23(8):983-94. doi: 10.1109/TMI.2004.830803.

Multi-atlas segmentation of the whole hippocampus and subfields using multiple automatically generated templates.

Neuroimage. 2014 Nov 1;101:494-512. doi: 10.1016/j.neuroimage.2014.04.054. Epub 2014 Apr 29.

Validation of image segmentation by estimating rater bias and variance.

Med Image Comput Comput Assist Interv. 2006;9(Pt 2):839-47. doi: 10.1007/11866763_103.

Validation of clinical acceptability of an atlas-based segmentation algorithm for the delineation of organs at risk in head and neck cancer.

Med Phys. 2015 Sep;42(9):5027-34. doi: 10.1118/1.4927567.

引用本文的文献

Two-dimensional segmentation fusion tool: an extensible, free-to-use, user-friendly tool for combining different bidimensional segmentations.

Front Bioeng Biotechnol. 2024 Jan 31;12:1339723. doi: 10.3389/fbioe.2024.1339723. eCollection 2024.

The effect of imaging modality (magnetic resonance imaging vs. computed tomography) and patient position (supine vs. prone) on target and organ at risk doses in partial breast irradiation.

J Med Radiat Sci. 2021 Jun;68(2):157-166. doi: 10.1002/jmrs.453. Epub 2020 Dec 7.

The impact of a radiologist-led workshop on MRI target volume delineation for radiotherapy.

J Med Radiat Sci. 2018 Dec;65(4):300-310. doi: 10.1002/jmrs.298. Epub 2018 Aug 3.

Multivariate Analyses Applied to Healthy Neurodevelopment in Fetal, Neonatal, and Pediatric MRI.

Front Neuroanat. 2016 Jan 21;9:163. doi: 10.3389/fnana.2015.00163. eCollection 2015.

Comparative performance evaluation of automated segmentation methods of hippocampus from magnetic resonance images of temporal lobe epilepsy patients.

Med Phys. 2016 Jan;43(1):538. doi: 10.1118/1.4938411.

Multiatlas segmentation as nonparametric regression.

IEEE Trans Med Imaging. 2014 Sep;33(9):1803-17. doi: 10.1109/TMI.2014.2321281. Epub 2014 Apr 30.

How Many Templates Does It Take for a Good Segmentation?: Error Analysis in Multiatlas Segmentation as a Function of Database Size.

Med Image Comput Comput Assist Interv. 2012;7509:103-114. doi: 10.1007/978-3-642-33530-3_9.

A collaborative resource to build consensus for automated left ventricular segmentation of cardiac MR images.

Med Image Anal. 2014 Jan;18(1):50-62. doi: 10.1016/j.media.2013.09.001. Epub 2013 Sep 13.

Validating retinal fundus image analysis algorithms: issues and a proposal.

Invest Ophthalmol Vis Sci. 2013 May 1;54(5):3546-59. doi: 10.1167/iovs.12-10347.

Estimating a reference standard segmentation with spatially varying performance parameters: local MAP STAPLE.

IEEE Trans Med Imaging. 2012 Aug;31(8):1593-606. doi: 10.1109/TMI.2012.2197406. Epub 2012 May 2.

本文引用的文献

Estimation of inferential uncertainty in assessing expert segmentation performance from STAPLE.

Inf Process Med Imaging. 2009;21:701-12. doi: 10.1007/978-3-642-02498-6_58.

Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation.

IEEE Trans Med Imaging. 2004 Jul;23(7):903-21. doi: 10.1109/TMI.2004.828354.

Statistical validation of image segmentation quality based on a spatial overlap index.

Acad Radiol. 2004 Feb;11(2):178-89. doi: 10.1016/s1076-6332(03)00671-8.

A methodology for evaluation of boundary detection algorithms on medical images.

IEEE Trans Med Imaging. 1997 Oct;16(5):642-52. doi: 10.1109/42.640755.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从 STAPLE 评估专家分割性能的推断不确定性估计。

Estimation of inferential uncertainty in assessing expert segmentation performance from STAPLE.

机构信息

Computational Radiology Laboratory, Department of Radiology, Children's Hospital, Boston, MA 02115, USA.

出版信息

IEEE Trans Med Imaging. 2010 Mar;29(3):771-80. doi: 10.1109/TMI.2009.2036011.

DOI:10.1109/TMI.2009.2036011

PMID:20199913

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3183509/

Abstract

摘要

从 STAPLE 评估专家分割性能的推断不确定性估计。

Estimation of inferential uncertainty in assessing expert segmentation performance from STAPLE.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

从 STAPLE 评估专家分割性能的推断不确定性估计。

Estimation of inferential uncertainty in assessing expert segmentation performance from STAPLE.

机构信息

出版信息