Ye Zezhong, Saraf Anurag, Ravipati Yashwanth, Hoebers Frank, Zha Yining, Zapaishchykova Anna, Likitlersuang Jirapat, Tishler Roy B, Schoenfeld Jonathan D, Margalit Danielle N, Haddad Robert I, Mak Raymond H, Naser Mohamed, Wahid Kareem A, Sahlsten Jaakko, Jaskari Joel, Kaski Kimmo, Mäkitie Antti A, Fuller Clifton D, Aerts Hugo J W L, Kann Benjamin H
Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, United States.
Department of Radiation Oncology, Dana-Farber Cancer Institute and Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States.
medRxiv. 2023 Mar 6:2023.03.01.23286638. doi: 10.1101/2023.03.01.23286638.
Sarcopenia is an established prognostic factor in patients diagnosed with head and neck squamous cell carcinoma (HNSCC). The quantification of sarcopenia assessed by imaging is typically achieved through the skeletal muscle index (SMI), which can be derived from cervical neck skeletal muscle (SM) segmentation and cross-sectional area. However, manual SM segmentation is labor-intensive, prone to inter-observer variability, and impractical for large-scale clinical use. To overcome this challenge, we have developed and externally validated a fully-automated image-based deep learning (DL) platform for cervical vertebral SM segmentation and SMI calculation, and evaluated the relevance of this with survival and toxicity outcomes.
899 patients diagnosed as having HNSCC with CT scans from multiple institutes were included, with 335 cases utilized for training, 96 for validation, 48 for internal testing and 393 for external testing. Ground truth single-slice segmentations of SM at the C3 vertebra level were manually generated by experienced radiation oncologists. To develop an efficient method of segmenting the SM, a multi-stage DL pipeline was implemented, consisting of a 2D convolutional neural network (CNN) to select the middle slice of C3 section and a 2D U-Net to segment SM areas. The model performance was evaluated using the Dice Similarity Coefficient (DSC) as the primary metric for the internal test set, and for the external test set the quality of automated segmentation was assessed manually by two experienced radiation oncologists. The L3 skeletal muscle area (SMA) and SMI were then calculated from the C3 cross sectional area (CSA) of the auto-segmented SM. Finally, established SMI cut-offs were used to perform further analyses to assess the correlation with survival and toxicity endpoints in the external institution with univariable and multivariable Cox regression.
DSCs for validation set (n = 96) and internal test set (n = 48) were 0.90 (95% CI: 0.90 - 0.91) and 0.90 (95% CI: 0.89 - 0.91), respectively. The predicted CSA is highly correlated with the ground-truth CSA in both validation (r = 0.99, < 0.0001) and test sets (r = 0.96, < 0.0001). In the external test set (n = 377), 96.2% of the SM segmentations were deemed acceptable by consensus expert review. Predicted SMA and SMI values were highly correlated with the ground-truth values, with Pearson r β 0.99 (p < 0.0001) for both the female and male patients in all datasets. Sarcopenia was associated with worse OS (HR 2.05 [95% CI 1.04 - 4.04], p = 0.04) and longer PEG tube duration (median 162 days vs. 134 days, HR 1.51 [95% CI 1.12 - 2.08], p = 0.006 in multivariate analysis.
We developed and externally validated a fully-automated platform that strongly correlates with imaging-assessed sarcopenia in patients with H&N cancer that correlates with survival and toxicity outcomes. This study constitutes a significant stride towards the integration of sarcopenia assessment into decision-making for individuals diagnosed with HNSCC.
In this study, we developed and externally validated a deep learning model to investigate the impact of sarcopenia, defined as the loss of skeletal muscle mass, on patients with head and neck squamous cell carcinoma (HNSCC) undergoing radiotherapy. We demonstrated an efficient, fullyautomated deep learning pipeline that can accurately segment C3 skeletal muscle area, calculate cross-sectional area, and derive a skeletal muscle index to diagnose sarcopenia from a standard of care CT scan. In multi-institutional data, we found that pre-treatment sarcopenia was associated with significantly reduced overall survival and an increased risk of adverse events. Given the increased vulnerability of patients with HNSCC, the assessment of sarcopenia prior to radiotherapy may aid in informed treatment decision-making and serve as a predictive marker for the necessity of early supportive measures.
肌肉减少症是诊断为头颈部鳞状细胞癌(HNSCC)患者的既定预后因素。通过成像评估的肌肉减少症量化通常通过骨骼肌指数(SMI)来实现,该指数可从颈部骨骼肌(SM)分割和横截面积得出。然而,手动SM分割劳动强度大,容易出现观察者间差异,且不适用于大规模临床应用。为克服这一挑战,我们开发并外部验证了一个基于图像的全自动深度学习(DL)平台,用于颈椎SM分割和SMI计算,并评估其与生存和毒性结果的相关性。
纳入899例经多机构CT扫描诊断为HNSCC的患者,其中335例用于训练,96例用于验证,48例用于内部测试,393例用于外部测试。由经验丰富的放射肿瘤学家手动生成C3椎体水平SM的真实单切片分割。为开发一种有效的SM分割方法,实施了一个多阶段DL管道,包括一个二维卷积神经网络(CNN)以选择C3节段的中间切片和一个二维U-Net以分割SM区域。使用骰子相似系数(DSC)作为内部测试集的主要指标评估模型性能,对于外部测试集,由两名经验丰富的放射肿瘤学家手动评估自动分割的质量。然后根据自动分割的SM的C3横截面积(CSA)计算L3骨骼肌面积(SMA)和SMI。最后,使用既定的SMI临界值进行进一步分析,以通过单变量和多变量Cox回归评估与外部机构生存和毒性终点的相关性。
验证集(n = 96)和内部测试集(n = 48)的DSC分别为0.90(95% CI:0.90 - 0.91)和0.90(95% CI:0.89 - 0.91)。预测的CSA在验证集(r = 0.99,< 0.0001)和测试集(r = 0.96,< 0.0001)中均与真实CSA高度相关。在外部测试集(n = 377)中,96.2%的SM分割经专家共识审查认为可接受。预测的SMA和SMI值与真实值高度相关,在所有数据集中,女性和男性患者的Pearson r均为β 0.99(p < 0.0001)。肌肉减少症与较差的总生存期(HR 2.05 [95% CI 1.04 - 4.04],p = 0.04)和更长的PEG管使用时间相关(中位数162天对134天,多变量分析中HR 1.51 [95% CI 1.12 - 2.08],p = 0.006)。
我们开发并外部验证了一个全自动平台,该平台与头颈部癌症患者成像评估的肌肉减少症密切相关,且与生存和毒性结果相关。本研究朝着将肌肉减少症评估纳入HNSCC诊断个体的决策制定迈出了重要一步。
在本研究中,我们开发并外部验证了一个深度学习模型,以研究定义为骨骼肌质量丧失的肌肉减少症对接受放疗的头颈部鳞状细胞癌(HNSCC)患者的影响。我们展示了一种高效、全自动的深度学习管道,该管道可以准确分割C3骨骼肌面积,计算横截面积,并从标准护理CT扫描中得出骨骼肌指数以诊断肌肉减少症。在多机构数据中,我们发现治疗前肌肉减少症与总体生存率显著降低和不良事件风险增加相关。鉴于HNSCC患者的脆弱性增加,放疗前肌肉减少症的评估可能有助于做出明智的治疗决策,并作为早期支持措施必要性的预测标志物。