文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Development and validation of machine learning models based on molecular features for estimating the probability of multiple primary lung carcinoma versus intrapulmonary metastasis in patients presenting multiple non-small cell lung cancers.

作者信息

Liu Ning, Li Xue, Luo Xu, Liu Bin, Tang Jie, Xiao Fei, Wang Weiya, Tang Yuan, Shu Pei, Zhang Benxia, Chen Yue, Qin Diyuan, Ma Qizhi, Guo Fuchun, Tang Xiaojun, Zhu Daxing, Mei Jiandong, Chen Weizhi, Li Dan, Jiang Lili, Wang Yongsheng

机构信息

Division of Thoracic Tumor Multimodality Treatment, Cancer Center, West China Hospital, Sichuan University, Chengdu, China.

Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu, China.

出版信息

Transl Lung Cancer Res. 2025 Apr 30;14(4):1118-1137. doi: 10.21037/tlcr-24-875. Epub 2025 Apr 25.


DOI:10.21037/tlcr-24-875
PMID:40386724
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12082235/
Abstract

BACKGROUND: Discrimination of multiple non-small cell lung cancers (NSCLCs) as multiple primary lung cancers (MPLCs) or intrapulmonary metastases (IPMs) is critical but remains challenging. The aim of this study is to develop and validate the machine learning (ML) models based on the molecular features for estimating the probability of MPLC or IPM for patients presenting multiple NSCLCs. METHODS: A total of 72 multiple NSCLCs patients with 157 surgical resection tumor lesions from January 2012 to January 2018 at two institutions were included for developing and testing models. Specifically, 46 patients with 103 tumors which were defined as definitive MPLC or IPM according to International Association for the Study of Lung Cancer (IASLC) criteria were used to develop models. They were spilt into training and validation sets using stratified random sampling and five-fold cross-validation. The developed models were tested in other 26 patients whose tumors were undetermined by traditional methods. Whole-exome sequencing (WES) was performed on all included tumor samples. Four molecular features were calculated to characterize tumors relatedness and served as model inputs, including genetic divergence, shared mutation number, Pearson correlation coefficient and early mutation number. Decision trees (DT), random forests (RF), and gradient boosting decision trees (GBDT) were employed, with performance assessed by areas under the curve (AUCs), accuracy, precision, recall, and F1 score in validation set. Disease-free survival (DFS) were used to evaluate model performance in test cohort. Clinical and genetic characteristics were then compared between MPLC and IPM populations. RESULTS: All of the four molecular features showed significant differences between MPLC and IPM patients in development cohort. That is, MPLC exhibited higher genetic divergence, lower shared mutation number, Pearson correlation and early mutation number than IPM (P<0.001). DT model, RF model and GBDT model were developed with these factors and achieved a mean AUC of 0.94 [standard deviation (SD) 0.09], 1.00 (SD 0.00) and 1.00 (SD 0.00) in validation set, respectively. DT model, RF model and GBDT model discriminated the undetermined multiple NSCLCs as MPLC (n=15) and IPM (n=11) consistently. MPLC identified by ML models had significantly prolonged DFS [hazard ratio =0.21; 95% confidence interval (CI): 0.04-1.0; P=0.04] than that of IPM. MPLC patients had a relative higher prevalence of family history of first-degree relatives with cancer, and more than half of these patients reported a family history of lung cancer. EGFR remains the most common mutated driver both in MPLC and IPM populations. CONCLUSIONS: ML models based on the molecular features effectively distcriminate primary tumors from metastases in multiple NSCLCs, which improve the accuracy of multiple NSCLCs diagnosis and assist in clinical decision-making, particularly in challenging cases.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/9c36ad42e609/tlcr-14-04-1118-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/83d7ad3ef7bb/tlcr-14-04-1118-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/b8051b425636/tlcr-14-04-1118-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/c3f6a57ae79e/tlcr-14-04-1118-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/45314f30ee88/tlcr-14-04-1118-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/afb6482611e0/tlcr-14-04-1118-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/1db25f6caade/tlcr-14-04-1118-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/9c36ad42e609/tlcr-14-04-1118-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/83d7ad3ef7bb/tlcr-14-04-1118-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/b8051b425636/tlcr-14-04-1118-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/c3f6a57ae79e/tlcr-14-04-1118-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/45314f30ee88/tlcr-14-04-1118-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/afb6482611e0/tlcr-14-04-1118-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/1db25f6caade/tlcr-14-04-1118-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be9c/12082235/9c36ad42e609/tlcr-14-04-1118-f7.jpg

相似文献

[1]
Development and validation of machine learning models based on molecular features for estimating the probability of multiple primary lung carcinoma versus intrapulmonary metastasis in patients presenting multiple non-small cell lung cancers.

Transl Lung Cancer Res. 2025-4-30

[2]
Using molecular characteristics to distinguish multiple primary lung cancers and intrapulmonary metastases.

PeerJ. 2024-1-31

[3]
Establishment of Criteria for Molecular Differential Diagnosis of MPLC and IPM.

Front Oncol. 2021-1-21

[4]
Genetic and immune characteristics of multiple primary lung cancers and lung metastases.

Thorac Cancer. 2021-10

[5]
A comprehensive algorithm to distinguish between MPLC and IPM in multiple lung tumors patients.

Ann Transl Med. 2020-9

[6]
CXCL1 and CXCL8: Reliable and feasible biomarkers differentiating intrapulmonary metastasis from multiple primary neoplasms in non-small cell lung cancers.

Cancer Biomark. 2025-4

[7]
Classification of multiple primary lung cancer in patients with multifocal lung cancer: assessment of a machine learning approach using multidimensional genomic data.

Front Oncol. 2024-5-3

[8]
A pairwise radiomics algorithm-lesion pair relation estimation model for distinguishing multiple primary lung cancer from intrapulmonary metastasis.

Precis Clin Med. 2023-10-30

[9]
Optimizing the NGS-based discrimination of multiple lung cancers from the perspective of evolution.

NPJ Precis Oncol. 2025-1-14

[10]
Genetic features and application value of next generation sequencing in the diagnosis of synchronous multifocal lung adenocarcinoma.

Oncol Lett. 2020-9

引用本文的文献

[1]
Cost-Efficient Early Diagnostic Tool for Lung Cancer: Explainable AI in Clinical Systems.

Technol Cancer Res Treat. 2025

[2]
Cytogenetics and Cytogenomics in Clinical Diagnostics: Genome Architecture, Structural Variants, and Translational Applications.

Genes (Basel). 2025-6-30

本文引用的文献

[1]
Integration of clinical, pathological, radiological, and transcriptomic data improves prediction for first-line immunotherapy outcome in metastatic non-small cell lung cancer.

Nat Commun. 2025-1-12

[2]
Machine Learning and Computed Tomography Radiomics to Predict Disease Progression to Upfront Pembrolizumab Monotherapy in Advanced Non-Small-Cell Lung Cancer: A Pilot Study.

Cancers (Basel). 2024-12-28

[3]
Predicting Postoperative Lung Cancer Recurrence and Survival Using Cox Proportional Hazards Regression and Machine Learning.

Cancers (Basel). 2024-12-26

[4]
Deep learning using histological images for gene mutation prediction in lung cancer: a multicentre retrospective study.

Lancet Oncol. 2025-1

[5]
Differentiating Separate Primary Lung Adenocarcinomas From Intrapulmonary Metastases With Emphasis on Pathological and Molecular Considerations: Recommendations From the International Association for the Study of Lung Cancer Pathology Committee.

J Thorac Oncol. 2025-3

[6]
Deep generative AI models analyzing circulating orphan non-coding RNAs enable detection of early-stage lung cancer.

Nat Commun. 2024-11-21

[7]
Transformer-based deep learning model for the diagnosis of suspected lung cancer in primary care based on electronic health record data.

EBioMedicine. 2024-12

[8]
Learning from Imbalanced Data: Integration of Advanced Resampling Techniques and Machine Learning Models for Enhanced Cancer Diagnosis and Prognosis.

Cancers (Basel). 2024-10-8

[9]
Interpretable Machine Learning-Aided Optical Deciphering of Serum Exosomes for Early Detection, Staging, and Subtyping of Lung Cancer.

Anal Chem. 2024-10-15

[10]
AI for Multistructure Incidental Findings and Mortality Prediction at Chest CT in Lung Cancer Screening.

Radiology. 2024-9

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索