Wang Fen, Hao Qinmin, Jia Yizhen, Zhang Lincen, Yu Tongfu, Xu Hai, Yuan Mei
Department of Radiology, The Affiliated Huai'an No. 1 People's Hospital of Nanjing Medical University, Huai'an, China.
Department of Radiology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China.
J Thorac Dis. 2025 Aug 31;17(8):6030-6044. doi: 10.21037/jtd-2025-171. Epub 2025 Aug 28.
Lung cancer associated with the harbor oncogenic driver mutations or fusions, such as anaplastic lymphoma kinase (ALK), could benefit from representative clinical first-line treatment. And patients with ALK-positive (ALK+) show longer median survival than ALK-negative patients. This study aimed to develop a machine-learning model based on habitat analysis derived from computed tomography (CT) lung images to predict disease progression and explore its prognostic value for progression-free survival (PFS) in patients with (ALK+) lung cancer.
A total of 79 patients with ALK-positive lung cancer and advanced TNM stage (III-IV) at The First Affiliated Hospital of Nanjing Medical University between April 2016 and August 2022 were divided into training (n=63) and testing (n=16) cohorts. Radiomics features were extracted from CT images using a habitat-based method with clustering analysis. Subsequently, the performance of eight machine learning algorithms was compared to build a habitat-radiomics (rad_habitat) model, yielding a rad_habitat_score. Using regression, we developed both a clinical model and a combined model that integrated clinical variables with the rad_habitat_score. Subsequently, these models' performance were evaluated. Univariate and multivariate Cox regression analyses were performed on clinical variables and the rad_habitat_score.
Three distinct habitats were identified. Among the eight machine learning algorithms, the multilayer perceptron (MLP) model demonstrated superior performance in handling non-linearity, achieving a higher area under the curve (AUC) of 0.836 [95% confidence interval (CI): 0.628-1.000] in the testing cohort. The combined model (AUC =0.836, 95% CI: 0.628-1.000) exhibited a higher AUC than the clinical model but a similar AUC to the rad_habitat model in the testing set. Delong test revealed a significant difference in the AUC values between the clinical model and the combined model and between the clinical model and rad_habitat model in the training cohort (P=0.002 and P=0.007, respectively), while the AUC values of the other models were not statistically different. Cox regression analysis identified the rad_habitat_score as the only independent predictor of PFS (P<0.001) for patients with high-risk ALK+ lung cancer.
The habitat-radiomics approach, utilizing three regional habitats, shows promise for accurately and interpretably identifying disease progression in ALK+ lung cancer. The rad_habitat_score, when combined with clinical features, demonstrated improved predictive accuracy for prognostic stratification compared with the clinical model alone.
与致癌驱动基因突变或融合相关的肺癌,如间变性淋巴瘤激酶(ALK),可从具有代表性的临床一线治疗中获益。ALK阳性(ALK+)患者的中位生存期比ALK阴性患者更长。本研究旨在基于计算机断层扫描(CT)肺部图像的栖息地分析开发一种机器学习模型,以预测疾病进展,并探讨其对ALK+肺癌患者无进展生存期(PFS)的预后价值。
2016年4月至2022年8月期间,南京医科大学第一附属医院的79例ALK阳性肺癌且TNM分期为晚期(III-IV期)的患者被分为训练组(n = 63)和测试组(n = 16)。使用基于栖息地的聚类分析方法从CT图像中提取放射组学特征。随后,比较了八种机器学习算法的性能,以构建栖息地-放射组学(rad_habitat)模型,得出rad_habitat_score。通过回归分析,我们开发了一个临床模型和一个将临床变量与rad_habitat_score相结合的联合模型。随后,对这些模型的性能进行了评估。对临床变量和rad_habitat_score进行单因素和多因素Cox回归分析。
识别出三个不同的栖息地。在八种机器学习算法中,多层感知器(MLP)模型在处理非线性方面表现出卓越性能,在测试组中实现了更高的曲线下面积(AUC),为0.836 [95%置信区间(CI):0.628 - 1.000]。联合模型(AUC = 0.836,95% CI:0.628 - 1.000)在测试集中的AUC高于临床模型,但与rad_habitat模型的AUC相似。德龙检验显示,在训练组中,临床模型与联合模型之间以及临床模型与rad_habitat模型之间的AUC值存在显著差异(分别为P = 0.002和P = 0.007),而其他模型的AUC值无统计学差异。Cox回归分析确定rad_habitat_score是高危ALK+肺癌患者PFS的唯一独立预测因子(P < 0.001)。
利用三个区域栖息地特征的栖息地-放射组学方法,有望准确且可解释地识别ALK+肺癌的疾病进展。与单独的临床模型相比,rad_habitat_score与临床特征相结合时,在预后分层方面显示出更高的预测准确性。