Kim Haksoo, Monroe James I, Lo Simon, Yao Min, Harari Paul M, Machtay Mitchell, Sohn Jason W
Proton Therapy Center, National Cancer Center, 323 Ilsan-ro, Ilsandong-gu, Goyang-si, Gyeonggi-do 410-769, South Korea.
Department of Radiation Oncology, School of Medicine, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, Ohio 44106 and St. Anthony's Medical Center, 10010 Kennerly Road, St. Louis, Missouri 63128.
Med Phys. 2015 Jun;42(6):3013-23. doi: 10.1118/1.4921067.
A quantitative and objective metric, the medical similarity index (MSI), has been developed for evaluating the accuracy of a medical image segmentation relative to a reference segmentation. The MSI uses the medical consideration function (MCF) as its basis.
Currently, no indices provide quantitative evaluations of segmentation accuracy with medical considerations. Variations in segmentation can occur due to individual skill levels and medical relevance--curable or palliative intent, boundary uncertainty due to volume averaging, contrast levels, spatial resolution, and unresolved motion all affect the accuracy of a patient segmentation. Current accuracy measuring indices are not medically relevant. For example, undercontouring the tumor volume is not differentiated from overcontouring tumor. Dice similarity coefficient (DSC) and Hausdorff distance (HD) are two similarity measures often used. However, these metrics consider only geometric difference without considering medical implications. Two segments (under- vs overcontouring tumor) with similar DSC and HD measures could produce significantly different medical treatment results. The authors are proposing a MSI involving a user-defined MCF derived from an asymmetric Gaussian function. The shape of the MCF can be determined by a user, reflecting the anatomical location and characteristics of a particular tissue, organ, or tumor type. The peak of MCF is set along the reference contour; the inner and outer slopes are selected by the user. The discrepancy between the test and reference contours is calculated at each pixel by using a bidirectional local distance measure. The MCF value corresponding to that distance is summed and averaged to produce the MSI. Synthetic segmentations and clinical data from a 15 multi-institutional trial for a head-and-neck case are scored and compared by using MSI, DSC, and Hausdorff distance.
The MSI was shown to reflect medical considerations through the choice of MCF penalties for under- and overcontouring. Existing similarity scores were either insensitive to medical realities or simply inaccurate.
The medical similarity index, a segmentation evaluation metric based on medical considerations, has been proposed, developed, and tested to incorporate clinically relevant considerations beyond geometric parameters alone.
已开发出一种定量且客观的指标——医学相似性指数(MSI),用于评估医学图像分割相对于参考分割的准确性。MSI 以医学考量函数(MCF)为基础。
目前,尚无指标能从医学角度对分割准确性进行定量评估。分割的差异可能源于个人技术水平以及医学相关性——治愈或姑息意图、由于体积平均导致的边界不确定性、对比度水平、空间分辨率以及未解决的运动等都会影响患者分割的准确性。当前的准确性测量指标与医学无关。例如,肿瘤体积勾画不足与勾画过度并无区分。骰子相似系数(DSC)和豪斯多夫距离(HD)是常用的两种相似性度量。然而,这些指标仅考虑几何差异,未考虑医学意义。具有相似 DSC 和 HD 度量的两个分割(肿瘤勾画不足与过度)可能会产生显著不同的治疗结果。作者提出了一种涉及从非对称高斯函数导出的用户定义 MCF 的 MSI。MCF 的形状可由用户确定,反映特定组织、器官或肿瘤类型的解剖位置和特征。MCF 的峰值沿参考轮廓设置;内斜率和外斜率由用户选择。通过使用双向局部距离度量在每个像素处计算测试轮廓与参考轮廓之间的差异。将对应于该距离的 MCF 值求和并求平均值以产生 MSI。使用 MSI、DSC 和豪斯多夫距离对来自一项针对头颈病例的 15 个多机构试验的合成分割和临床数据进行评分和比较。
通过对勾画不足和过度的 MCF 惩罚选择,MSI 显示出能反映医学考量。现有的相似性分数要么对医学现实不敏感,要么根本不准确。
已提出、开发并测试了医学相似性指数,这是一种基于医学考量的分割评估指标,以纳入超越单纯几何参数的临床相关考量。