• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

血浆拷贝数变异作为使用极端梯度提升 (XGBoost) 分类器进行肺癌预测的工具。

Copy number variation in plasma as a tool for lung cancer prediction using Extreme Gradient Boosting (XGBoost) classifier.

机构信息

Thoracic Surgery Department, Beijing Chest Hospital, Capital Medical University; Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, China.

Ping An Health Technology, Beijing, China.

出版信息

Thorac Cancer. 2020 Jan;11(1):95-102. doi: 10.1111/1759-7714.13204. Epub 2019 Nov 6.

DOI:10.1111/1759-7714.13204
PMID:31694073
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6938748/
Abstract

BACKGROUND

The main cause of cancer death is lung cancer (LC) which usually presents at an advanced stage, but its early detection would increase the benefits of treatment. Blood is particularly favored in clinical research given the possibility of using it for relatively noninvasive analyses. Copy number variation (CNV) is a common genetic change in tumor genomes, and many studies have indicated that CNV-derived cell-free DNA (cfDNA) from plasma could be feasible as a biomarker for cancer diagnosis.

METHODS

In this study, we determined the possibility of using chromosomal arm-level CNV from cfDNA as a biomarker for lung cancer diagnosis in a small cohort of 40 patients and 41 healthy controls. Arm-level CNV distributions were analyzed based on z score, and the machine-learning algorithm Extreme Gradient Boosting (XGBoost) was applied for cancer prediction.

RESULTS

The results showed that amplifications tended to emerge on chromosomes 3q, 8q, 12p, and 7q. Deletions were frequently detected on chromosomes 22q, 3p, 5q, 16q, 10q, and 15q. Upon applying a trained XGBoost classifier, specificity and sensitivity of 100% were finally achieved in the test group (12 patients and 13 healthy controls). In addition, five-fold cross-validation proved the stability of the model. Finally, our results suggested that the integration of four arm-level CNVs and the concentration of cfDNA into the trained XGBoost classifier provides a potential method for detecting lung cancer.

CONCLUSION

Our results suggested that the integration of four arm-level CNVs and the concentration from of cfDNA integrated withinto the trained XGBoost classifier could become provides a potentially method for detecting lung cancer detection.

KEY POINTS

Significant findings of the study: Healthy individuals have different arm-level CNV profiles from cancer patients. Amplifications tend to emerge on chromosome 3q, 8q, 12p, 7q and deletions tend to emerge on chromosome 22q, 3p, 5q, 16q, 10q, 15q.

WHAT THIS STUDY ADDS

CfDNA concentration, arm 10q, 3q, 8q, 3p, and 22q are key features for prediction. Trained XGBoost classifier is a potential method for lung cancer detection.

摘要

背景

肺癌(LC)是癌症死亡的主要原因,通常在晚期出现,但早期发现可以提高治疗效果。由于血液分析相对无创,因此在临床研究中特别受欢迎。拷贝数变异(CNV)是肿瘤基因组中的常见遗传变化,许多研究表明,来自血浆的 CNV 衍生的无细胞 DNA(cfDNA)可以作为癌症诊断的生物标志物。

方法

在这项研究中,我们在 40 名患者和 41 名健康对照者的小队列中,确定了使用 cfDNA 中的染色体臂级 CNV 作为肺癌诊断的生物标志物的可能性。基于 z 分数分析了臂级 CNV 分布,并应用极端梯度增强(XGBoost)算法进行癌症预测。

结果

结果表明,扩增倾向于出现在染色体 3q、8q、12p 和 7q 上。缺失经常出现在染色体 22q、3p、5q、16q、10q 和 15q 上。应用训练好的 XGBoost 分类器后,最终在测试组(12 名患者和 13 名健康对照者)中达到了 100%的特异性和敏感性。此外,五倍交叉验证证明了该模型的稳定性。最后,我们的结果表明,将四个臂级 CNV 和 cfDNA 浓度整合到训练好的 XGBoost 分类器中,为检测肺癌提供了一种潜在的方法。

结论

我们的结果表明,将四个臂级 CNV 和 cfDNA 浓度整合到训练好的 XGBoost 分类器中,可以为检测肺癌提供一种潜在的方法。

关键点

本研究的重要发现:健康个体与癌症患者的臂级 CNV 谱不同。扩增倾向于出现在染色体 3q、8q、12p、7q 上,缺失倾向于出现在染色体 22q、3p、5q、16q、10q、15q 上。

本研究的新增内容

cfDNA 浓度、臂 10q、3q、8q、3p 和 22q 是预测的关键特征。训练好的 XGBoost 分类器是一种潜在的肺癌检测方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e612/6938748/9433429e2923/TCA-11-95-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e612/6938748/3ab387d0762a/TCA-11-95-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e612/6938748/929c777a22af/TCA-11-95-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e612/6938748/35eda99efefe/TCA-11-95-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e612/6938748/9433429e2923/TCA-11-95-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e612/6938748/3ab387d0762a/TCA-11-95-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e612/6938748/929c777a22af/TCA-11-95-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e612/6938748/35eda99efefe/TCA-11-95-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e612/6938748/9433429e2923/TCA-11-95-g004.jpg

相似文献

1
Copy number variation in plasma as a tool for lung cancer prediction using Extreme Gradient Boosting (XGBoost) classifier.血浆拷贝数变异作为使用极端梯度提升 (XGBoost) 分类器进行肺癌预测的工具。
Thorac Cancer. 2020 Jan;11(1):95-102. doi: 10.1111/1759-7714.13204. Epub 2019 Nov 6.
2
Cell-free DNA copy number variations in plasma from colorectal cancer patients.结直肠癌患者血浆中的游离DNA拷贝数变异
Mol Oncol. 2017 Aug;11(8):1099-1111. doi: 10.1002/1878-0261.12077. Epub 2017 Jun 6.
3
A Novel Method to Detect Early Colorectal Cancer Based on Chromosome Copy Number Variation in Plasma.一种基于血浆中染色体拷贝数变异检测早期结直肠癌的新方法。
Cell Physiol Biochem. 2018;45(4):1444-1454. doi: 10.1159/000487571. Epub 2018 Feb 16.
4
Genomic variations in plasma cell free DNA differentiate early stage lung cancers from normal controls.血浆游离 DNA 中的基因组变异可区分早期肺癌与正常对照。
Lung Cancer. 2015 Oct;90(1):78-84. doi: 10.1016/j.lungcan.2015.07.002. Epub 2015 Jul 15.
5
Low-pass Whole-genome Sequencing of Circulating Cell-free DNA Demonstrates Dynamic Changes in Genomic Copy Number in a Squamous Lung Cancer Clinical Cohort.循环游离 DNA 的低深度全基因组测序显示在一个鳞状肺癌临床队列中基因组拷贝数的动态变化。
Clin Cancer Res. 2019 Apr 1;25(7):2254-2263. doi: 10.1158/1078-0432.CCR-18-1593. Epub 2019 Jan 7.
6
Cell-free DNA copy number variations predict efficacy of immune checkpoint inhibitor-based therapy in hepatobiliary cancers.无细胞游离 DNA 拷贝数变异可预测免疫检查点抑制剂治疗在肝胆肿瘤中的疗效。
J Immunother Cancer. 2021 May;9(5). doi: 10.1136/jitc-2020-001942.
7
Multiple-level copy number variations in cell-free DNA for prognostic prediction of HCC with radical treatments.游离 DNA 中的多层次拷贝数变异用于预测 HCC 根治性治疗的预后。
Cancer Sci. 2021 Nov;112(11):4772-4784. doi: 10.1111/cas.15128. Epub 2021 Sep 14.
8
Evaluation and integration of cell-free DNA signatures for detection of lung cancer.游离 DNA 特征评估与整合用于肺癌检测。
Cancer Lett. 2024 Nov 1;604:217216. doi: 10.1016/j.canlet.2024.217216. Epub 2024 Sep 2.
9
Molecular features of tumor-derived genetic alterations in circulating cell-free DNA in virtue of autopsy analysis.基于尸检分析的循环无细胞 DNA 中肿瘤来源遗传改变的分子特征。
Sci Rep. 2021 Apr 16;11(1):8398. doi: 10.1038/s41598-021-87094-1.
10
A feasibility study of colorectal cancer diagnosis via circulating tumor DNA derived CNV detection.循环肿瘤 DNA 拷贝数变异检测用于结直肠癌诊断的可行性研究。
PLoS One. 2018 May 23;13(5):e0196826. doi: 10.1371/journal.pone.0196826. eCollection 2018.

引用本文的文献

1
Development of a Serum Proteomic-Based Diagnostic Model for Lung Cancer Using Machine Learning Algorithms and Unveiling the Role of SLC16A4 in Tumor Progression and Immune Response.利用机器学习算法开发基于血清蛋白质组学的肺癌诊断模型并揭示SLC16A4在肿瘤进展和免疫反应中的作用
Biomolecules. 2025 Jul 26;15(8):1081. doi: 10.3390/biom15081081.
2
Prediction of T Stage of Rectal Cancer After Neoadjuvant Therapy by Multi-Parameter Magnetic Resonance Radiomics Based on Machine Learning Algorithms.基于机器学习算法的多参数磁共振影像组学预测直肠癌新辅助治疗后的T分期
Technol Cancer Res Treat. 2024 Jan-Dec;23:15330338241305463. doi: 10.1177/15330338241305463.
3

本文引用的文献

1
Genome-wide cell-free DNA fragmentation in patients with cancer.癌症患者的全基因组游离 DNA 片段化。
Nature. 2019 Jun;570(7761):385-389. doi: 10.1038/s41586-019-1272-6. Epub 2019 May 29.
2
Annual Report to the Nation on the Status of Cancer, part I: National cancer statistics.《国家癌症报告:第一部分:全国癌症统计数据》
Cancer. 2018 Jul 1;124(13):2785-2800. doi: 10.1002/cncr.31551. Epub 2018 May 22.
3
Genomic alterations of plasma cell-free DNAs in small cell lung cancer and their clinical relevance.小细胞肺癌血浆游离 DNA 的基因组改变及其临床意义。
A stacking ensemble model for predicting the occurrence of carotid atherosclerosis.
用于预测颈动脉粥样硬化发生的堆叠集成模型。
Front Endocrinol (Lausanne). 2024 Jul 23;15:1390352. doi: 10.3389/fendo.2024.1390352. eCollection 2024.
4
Innovation through Artificial Intelligence in Triage Systems for Resource Optimization in Future Pandemics.通过人工智能优化分诊系统以在未来大流行中实现资源优化创新。
Biomimetics (Basel). 2024 Jul 18;9(7):440. doi: 10.3390/biomimetics9070440.
5
A privacy-preserving platform oriented medical healthcare and its application in identifying patients with candidemia.面向医疗保健的隐私保护平台及其在鉴定念珠菌血症患者中的应用。
Sci Rep. 2024 Jul 6;14(1):15589. doi: 10.1038/s41598-024-66596-8.
6
Using machine-learning models to predict extubation failure in neonates with bronchopulmonary dysplasia.使用机器学习模型预测患有支气管肺发育不良的新生儿拔管失败。
BMC Pulm Med. 2024 Jul 1;24(1):308. doi: 10.1186/s12890-024-03133-3.
7
Multi-Omics Integration for Liver Cancer Using Regression Analysis.使用回归分析对肝癌进行多组学整合
Curr Issues Mol Biol. 2024 Apr 19;46(4):3551-3562. doi: 10.3390/cimb46040222.
8
Generation of synthetic whole-slide image tiles of tumours from RNA-sequencing data via cascaded diffusion models.通过级联扩散模型从RNA测序数据生成肿瘤的合成全切片图像块
Nat Biomed Eng. 2025 Mar;9(3):320-332. doi: 10.1038/s41551-024-01193-8. Epub 2024 Mar 21.
9
Construction of a predictive model for bone metastasis from first primary lung adenocarcinoma within 3 cm based on machine learning algorithm: a retrospective study.基于机器学习算法构建的首个 3 cm 内原发性肺腺癌骨转移预测模型:一项回顾性研究。
PeerJ. 2024 Mar 14;12:e17098. doi: 10.7717/peerj.17098. eCollection 2024.
10
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.
Lung Cancer. 2018 Jun;120:113-121. doi: 10.1016/j.lungcan.2018.04.008. Epub 2018 Apr 12.
4
Circulating tumor DNA 5-hydroxymethylcytosine as a novel diagnostic biomarker for esophageal cancer.循环肿瘤DNA 5-羟甲基胞嘧啶作为食管癌的一种新型诊断生物标志物。
Cell Res. 2018 May;28(5):597-600. doi: 10.1038/s41422-018-0014-x. Epub 2018 Feb 21.
5
Predicting non-melanoma skin cancer via a multi-parameterized artificial neural network.通过多参数人工神经网络预测非黑色素瘤皮肤癌。
Sci Rep. 2018 Jan 26;8(1):1701. doi: 10.1038/s41598-018-19907-9.
6
Determinants and clinical implications of chromosomal instability in cancer.癌症中染色体不稳定性的决定因素及其临床意义。
Nat Rev Clin Oncol. 2018 Mar;15(3):139-150. doi: 10.1038/nrclinonc.2017.198. Epub 2018 Jan 3.
7
Risk factors of Lung Cancer in nonsmoker.非吸烟者患肺癌的风险因素。
Curr Probl Cancer. 2017 Sep-Oct;41(5):328-339. doi: 10.1016/j.currproblcancer.2017.07.002. Epub 2017 Jul 27.
8
Utility of Genomic Assessment of Blood-Derived Circulating Tumor DNA (ctDNA) in Patients with Advanced Lung Adenocarcinoma.血液衍生循环肿瘤 DNA(ctDNA)的基因组评估在晚期肺腺癌患者中的效用。
Clin Cancer Res. 2017 Sep 1;23(17):5101-5111. doi: 10.1158/1078-0432.CCR-16-2497. Epub 2017 May 24.
9
Circulating-tumor DNA as an early detection and diagnostic tool.循环肿瘤DNA作为一种早期检测和诊断工具。
Curr Opin Genet Dev. 2017 Feb;42:14-21. doi: 10.1016/j.gde.2016.12.003. Epub 2017 Jan 23.
10
Integration of Pathway Knowledge and Dynamic Bayesian Networks for the Prediction of Oral Cancer Recurrence.整合通路知识与动态贝叶斯网络用于口腔癌复发预测
IEEE J Biomed Health Inform. 2017 Mar;21(2):320-327. doi: 10.1109/JBHI.2016.2636448. Epub 2016 Dec 7.