• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用甲基化相关模块和集成机器学习分类器预测肺腺癌疾病进展

Predicting lung adenocarcinoma disease progression using methylation-correlated blocks and ensemble machine learning classifiers.

作者信息

Yu Xin, Yang Qian, Wang Dong, Li Zhaoyang, Chen Nianhang, Kong De-Xin

机构信息

State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, Hubei, China.

Agricultural Bioinformatics Key Laboratory of Hubei Province, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei, China.

出版信息

PeerJ. 2021 Feb 16;9:e10884. doi: 10.7717/peerj.10884. eCollection 2021.

DOI:10.7717/peerj.10884
PMID:33628643
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7894106/
Abstract

Applying the knowledge that methyltransferases and demethylases can modify adjacent cytosine-phosphorothioate-guanine (CpG) sites in the same DNA strand, we found that combining multiple CpGs into a single block may improve cancer diagnosis. However, survival prediction remains a challenge. In this study, we developed a pipeline named "stacked ensemble of machine learning models for methylation-correlated blocks" (EnMCB) that combined Cox regression, support vector regression (SVR), and elastic-net models to construct signatures based on DNA methylation-correlated blocks for lung adenocarcinoma (LUAD) survival prediction. We used methylation profiles from the Cancer Genome Atlas (TCGA) as the training set, and profiles from the Gene Expression Omnibus (GEO) as validation and testing sets. First, we partitioned the genome into blocks of tightly co-methylated CpG sites, which we termed methylation-correlated blocks (MCBs). After partitioning and feature selection, we observed different diagnostic capacities for predicting patient survival across the models. We combined the multiple models into a single stacking ensemble model. The stacking ensemble model based on the top-ranked block had the area under the receiver operating characteristic curve of 0.622 in the TCGA training set, 0.773 in the validation set, and 0.698 in the testing set. When stratified by clinicopathological risk factors, the risk score predicted by the top-ranked MCB was an independent prognostic factor. Our results showed that our pipeline was a reliable tool that may facilitate MCB selection and survival prediction.

摘要

基于甲基转移酶和去甲基酶可修饰同一DNA链中相邻的胞嘧啶-硫代磷酸酯-鸟嘌呤(CpG)位点这一知识,我们发现将多个CpG组合成一个单一模块可能会改善癌症诊断。然而,生存预测仍然是一项挑战。在本研究中,我们开发了一种名为“用于甲基化相关模块的机器学习模型堆叠集成”(EnMCB)的流程,该流程结合了Cox回归、支持向量回归(SVR)和弹性网络模型,以基于DNA甲基化相关模块构建特征,用于肺腺癌(LUAD)生存预测。我们将来自癌症基因组图谱(TCGA)的甲基化谱作为训练集,将来自基因表达综合数据库(GEO)的谱作为验证集和测试集。首先,我们将基因组划分为紧密共甲基化的CpG位点模块,我们将其称为甲基化相关模块(MCB)。在划分和特征选择后,我们观察到不同模型在预测患者生存方面具有不同的诊断能力。我们将多个模型组合成一个单一的堆叠集成模型。基于排名靠前模块的堆叠集成模型在TCGA训练集中的受试者操作特征曲线下面积为0.622,在验证集中为0.773,在测试集中为0.698。当按临床病理风险因素分层时,排名靠前的MCB预测的风险评分是一个独立的预后因素。我们的结果表明,我们的流程是一种可靠的工具,可能有助于MCB选择和生存预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/fb3223db70cb/peerj-09-10884-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/e29574230390/peerj-09-10884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/ac39ad024857/peerj-09-10884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/034e5a69aeb3/peerj-09-10884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/6c40806d27d5/peerj-09-10884-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/1fc68561414a/peerj-09-10884-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/8323654a8a43/peerj-09-10884-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/f3643bcfa80c/peerj-09-10884-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/fb3223db70cb/peerj-09-10884-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/e29574230390/peerj-09-10884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/ac39ad024857/peerj-09-10884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/034e5a69aeb3/peerj-09-10884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/6c40806d27d5/peerj-09-10884-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/1fc68561414a/peerj-09-10884-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/8323654a8a43/peerj-09-10884-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/f3643bcfa80c/peerj-09-10884-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fa4/7894106/fb3223db70cb/peerj-09-10884-g008.jpg

相似文献

1
Predicting lung adenocarcinoma disease progression using methylation-correlated blocks and ensemble machine learning classifiers.使用甲基化相关模块和集成机器学习分类器预测肺腺癌疾病进展
PeerJ. 2021 Feb 16;9:e10884. doi: 10.7717/peerj.10884. eCollection 2021.
2
EnMCB: an R/bioconductor package for predicting disease progression based on methylation correlated blocks using ensemble models.EnMCB:一个基于整合模型,利用甲基化相关模块预测疾病进展的 R/bioconductor 包。
Bioinformatics. 2021 Nov 18;37(22):4282-4284. doi: 10.1093/bioinformatics/btab415.
3
A Novel Methylation-based Model for Prognostic Prediction in Lung Adenocarcinoma.一种用于肺腺癌预后预测的新型甲基化模型。
Curr Genomics. 2024 Feb 23;25(1):26-40. doi: 10.2174/0113892029277397231228062412.
4
Prognostic and Predictive Value of Three DNA Methylation Signatures in Lung Adenocarcinoma.三种DNA甲基化特征在肺腺癌中的预后及预测价值
Front Genet. 2019 Apr 24;10:349. doi: 10.3389/fgene.2019.00349. eCollection 2019.
5
Integrative Analysis of DNA Methylation and Gene Expression Profiles Identifies Colorectal Cancer-Related Diagnostic Biomarkers.DNA甲基化与基因表达谱的综合分析鉴定出结直肠癌相关诊断生物标志物。
Pathol Oncol Res. 2021 Jul 21;27:1609784. doi: 10.3389/pore.2021.1609784. eCollection 2021.
6
A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies.一项通过机器学习策略确定肺腺癌新预后预测模型的大型队列研究。
BMC Cancer. 2019 Sep 5;19(1):886. doi: 10.1186/s12885-019-6101-7.
7
Development and validation of a robust immune-related prognostic signature in early-stage lung adenocarcinoma.早期肺腺癌中一种稳健的免疫相关预后标志物的开发与验证
J Transl Med. 2020 Oct 7;18(1):380. doi: 10.1186/s12967-020-02545-z.
8
A methylation-based nomogram for predicting survival in patients with lung adenocarcinoma.基于甲基化的列线图预测肺腺癌患者的生存。
BMC Cancer. 2021 Jul 12;21(1):801. doi: 10.1186/s12885-021-08539-4.
9
DNA methylation profiling to predict recurrence risk in stage Ι lung adenocarcinoma: Development and validation of a nomogram to clinical management.DNA 甲基化分析预测Ⅰ期肺腺癌复发风险:用于临床管理的列线图的建立和验证。
J Cell Mol Med. 2020 Jul;24(13):7576-7589. doi: 10.1111/jcmm.15393. Epub 2020 Jun 12.
10
Establishment of a prognostic signature for lung adenocarcinoma by integration of 7 pyroptosis-related genes and cross-validation between the TCGA and GEO cohorts: A comprehensive bioinformatics analysis.通过 TCGA 和 GEO 队列的整合,建立一个与 7 个细胞焦亡相关基因相关的肺腺癌预后特征:一项全面的生物信息学分析。
Medicine (Baltimore). 2022 Jul 22;101(29):e29710. doi: 10.1097/MD.0000000000029710.

引用本文的文献

1
Artificial intelligence in lung cancer: current applications, future perspectives, and challenges.人工智能在肺癌中的应用:当前应用、未来展望及挑战
Front Oncol. 2024 Dec 23;14:1486310. doi: 10.3389/fonc.2024.1486310. eCollection 2024.
2
Prognostic value of lncRNAs related to fatty acid metabolism in lung adenocarcinoma and their correlation with tumor microenvironment based on bioinformatics analysis.基于生物信息学分析的脂肪酸代谢相关lncRNAs在肺腺癌中的预后价值及其与肿瘤微环境的相关性
Front Oncol. 2022 Oct 10;12:1022097. doi: 10.3389/fonc.2022.1022097. eCollection 2022.
3
A Methylation Diagnostic Model Based on Random Forests and Neural Networks for Asthma Identification.

本文引用的文献

1
EnMCB: an R/bioconductor package for predicting disease progression based on methylation correlated blocks using ensemble models.EnMCB:一个基于整合模型,利用甲基化相关模块预测疾病进展的 R/bioconductor 包。
Bioinformatics. 2021 Nov 18;37(22):4282-4284. doi: 10.1093/bioinformatics/btab415.
2
DNA methylation profiling to predict overall survival risk in gastric cancer: development and validation of a nomogram to optimize clinical management.DNA甲基化谱预测胃癌总生存风险:用于优化临床管理的列线图的开发与验证
J Cancer. 2020 Apr 27;11(15):4352-4365. doi: 10.7150/jca.44436. eCollection 2020.
3
ASPEN, a methodology for reconstructing protein evolution with improved accuracy using ensemble models.
基于随机森林和神经网络的哮喘识别甲基化诊断模型。
Comput Math Methods Med. 2022 Sep 28;2022:2679050. doi: 10.1155/2022/2679050. eCollection 2022.
使用集成模型提高准确性的蛋白质进化重建方法 ASPEN。
Elife. 2019 Oct 17;8:e47676. doi: 10.7554/eLife.47676.
4
Distinct methylation levels of mature microRNAs in gastrointestinal cancers.胃肠道癌中成熟 microRNAs 的不同甲基化水平。
Nat Commun. 2019 Aug 29;10(1):3888. doi: 10.1038/s41467-019-11826-1.
5
A four-DNA methylation biomarker is a superior predictor of survival of patients with cutaneous melanoma.四项 DNA 甲基化生物标志物可更好地预测皮肤黑素瘤患者的生存情况。
Elife. 2019 Jun 6;8:e44310. doi: 10.7554/eLife.44310.
6
Does hypermethylation of CpG island in the promoter region of the E-cadherin gene increase the risk of lung cancer? A meta-analysis.E-钙黏蛋白基因启动子区 CpG 岛高甲基化是否增加肺癌风险?一项荟萃分析。
Thorac Cancer. 2019 Jan;10(1):54-59. doi: 10.1111/1759-7714.12900. Epub 2018 Nov 3.
7
An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines.基于多元判别分析、分类和回归树以及支持向量机的洪水易发性综合预测。
Sci Total Environ. 2019 Feb 15;651(Pt 2):2087-2096. doi: 10.1016/j.scitotenv.2018.10.064. Epub 2018 Oct 6.
8
Hypermethylation of gene body CpG islands predicts high dosage of functional oncogenes in liver cancer.基因体 CpG 岛的超甲基化预示着肝癌中功能性致癌基因的高剂量。
Nat Commun. 2018 Aug 8;9(1):3164. doi: 10.1038/s41467-018-05550-5.
9
DNA methylation haplotypes as cancer markers.作为癌症标志物的DNA甲基化单倍型
Nat Genet. 2018 Aug;50(8):1062-1063. doi: 10.1038/s41588-018-0185-x.
10
DNA methylation-based classification of ependymomas in adulthood: implications for diagnosis and treatment.基于 DNA 甲基化的成人室管膜瘤分类:对诊断和治疗的影响。
Neuro Oncol. 2018 Nov 12;20(12):1616-1624. doi: 10.1093/neuonc/noy118.