通过深度学习对危及器官分割进行几何聚焦训练与评估。
Geometrically focused training and evaluation of organs-at-risk segmentation via deep learning.
作者信息
Ni Ruiyan, Chuk Elizabeth, Han Kathy, Croke Jennifer, Fyles Anthony, Lukovic Jelena, Milosevic Michael, Haibe-Kains Benjamin, Rink Alexandra
机构信息
Department of Medical Biophysics, University of Toronto, Toronto, Canada.
Princess Margaret Cancer Center, University Health Network, Toronto, Canada.
出版信息
Med Phys. 2025 Jul;52(7):e17840. doi: 10.1002/mp.17840. Epub 2025 Apr 25.
BACKGROUND
Deep learning methods are promising in automating segmentation of organs at risk (OARs) in radiotherapy. However, the lack of a geometric indicator for dosimetry accuracy remains to be a problem. This issue is particularly pronounced in specific radiotherapy treatments where only the proximity of structures to the radiotherapy target affects the dose planning. In cervical cancer high dose-rate (HDR) brachytherapy, treatment planning is motivated by limiting dose to the hottest 2 cubic centimeters (D2cm) of the OARs. Similarly, Ethos online adaptive radiotherapy system prioritizes only the closest target structures for adaptive plan generation.
PURPOSE
We propose a novel geometrically focused deep learning training method and evaluation metric, using cervical brachytherapy as a case study. A distance-penalized (DP) loss function was developed to focus attention on the near-to-target OAR regions. We also introduced and evaluated a novel geometric metric, weighted dice similarity coefficient (wDSC), correlated with OARs D2cm.
METHODS
A model was trained using a 3D U-Net architecture and 170 T2-weighted magnetic resonance (MR) images (56 patients) with clinical contours. The dataset was split into subsets at the patient level: 45 patients (150 scans) as the training set for five-fold cross-validation and 11 patients (20 scans) as the testing set. Another dataset from our institution, consisting of 35 MR scans from 22 cervical cancer patients, was used as an independent internal testing set. A distance map, emphasizing errors near high-risk clinical target volume (CTV), was used to penalize two commonly used loss functions, cross-entropy (CE) loss and DiceCE loss. The wDSC emphasizes the accuracy of OAR regions proximal to CTV by incorporating a weighted factor in the original vDSC. The Pearson correlation coefficient (r) was used to quantify the strength of the relationship between D2cm accuracy and six evaluation metrics (wDSC and five standard metrics). A physician rated and revised the auto-contours for the clinical acceptability tests.
RESULTS
The wDSC moderately correlated (r = -0.55) with D2cm accuracy, outperforming standard geometric metrics. Models using DP loss functions consistently yielded higher wDSCs compared to their respective non-DP counterparts. DP loss models also improved D2cm accuracy, indicating an enhanced accuracy in dosimetry. The clinical acceptability tests revealed that more than 94% of bladder and rectum contours and approximately half of the sigmoid and small bowel contours were clinically accepted.
CONCLUSION
We developed and evaluated a new geometric metric, wDSC, as a better indicator of D2cm accuracy, which has the potential to become a surrogate for dosimetric accuracy in cervical brachytherapy. The model with DP loss showed non-statistically significant improvements in geometric and dosimetric performance. This work also holds the potential to be used for precise OARs delineation in adaptive radiotherapy.
背景
深度学习方法在放疗中对危及器官(OARs)进行自动分割方面很有前景。然而,缺乏用于剂量测定准确性的几何指标仍然是一个问题。在特定的放射治疗中,这个问题尤为突出,即只有结构与放射治疗靶区的接近程度会影响剂量规划。在宫颈癌高剂量率(HDR)近距离放疗中,治疗计划的制定是为了将OARs中最热的2立方厘米(D2cm)的剂量限制在一定范围内。同样,Ethos在线自适应放疗系统在生成自适应计划时仅优先考虑最接近的靶区结构。
目的
我们提出了一种新颖的几何聚焦深度学习训练方法和评估指标,以宫颈癌近距离放疗为例进行研究。开发了一种距离惩罚(DP)损失函数,以将注意力集中在靠近靶区的OAR区域。我们还引入并评估了一种与OARs的D2cm相关的新颖几何指标,加权骰子相似系数(wDSC)。
方法
使用3D U-Net架构和170张带有临床轮廓的T2加权磁共振(MR)图像(56例患者)训练模型。数据集在患者层面被划分为子集:45例患者(150次扫描)作为五折交叉验证的训练集,11例患者(20次扫描)作为测试集。我们机构的另一个数据集,由22例宫颈癌患者的35次MR扫描组成,用作独立的内部测试集。使用强调高风险临床靶区(CTV)附近误差的距离图来惩罚两种常用的损失函数,交叉熵(CE)损失和DiceCE损失。wDSC通过在原始vDSC中纳入加权因子,强调了CTV附近OAR区域的准确性。使用Pearson相关系数(r)来量化D2cm准确性与六个评估指标(wDSC和五个标准指标)之间关系的强度。由医生对自动轮廓进行评分并修订,以进行临床可接受性测试。
结果
wDSC与D2cm准确性呈中度相关(r = -0.55),优于标准几何指标。与各自的非DP对应模型相比,使用DP损失函数的模型始终产生更高的wDSC。DP损失模型也提高了D2cm准确性,表明剂量测定的准确性有所提高。临床可接受性测试显示,超过94%的膀胱和直肠轮廓以及约一半的乙状结肠和小肠轮廓在临床上是可接受的。
结论
我们开发并评估了一种新的几何指标wDSC,作为D2cm准确性的更好指标,它有可能成为宫颈癌近距离放疗中剂量测定准确性的替代指标。具有DP损失的模型在几何和剂量测定性能方面显示出无统计学意义的改善。这项工作也有可能用于自适应放疗中精确的OARs勾画。