• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多组学数据的二元疾病结局预测的优先级弹性网络

Priority-Elastic net for binary disease outcome prediction based on multi-omics data.

作者信息

Musib Laila, Coletti Roberta, Lopes Marta B, Mouriño Helena, Carrasquinha Eunice

机构信息

Departamento de Estatística e Investigação Operacional, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisboa, 1749-016, Portugal.

CEAUL - Centro de Estatística e Aplicações, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, Lisbon, 1749-016, Portugal.

出版信息

BioData Min. 2024 Oct 29;17(1):45. doi: 10.1186/s13040-024-00401-0.

DOI:10.1186/s13040-024-00401-0
PMID:39472942
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11523883/
Abstract

BACKGROUND

High-dimensional omics data integration has emerged as a prominent avenue within the healthcare industry, presenting substantial potential to improve predictive models. However, the data integration process faces several challenges, including data heterogeneity, priority sequence in which data blocks are prioritized for rendering predictive information contained in multiple blocks, assessing the flow of information from one omics level to the other and multicollinearity.

METHODS

We propose the Priority-Elastic net algorithm, a hierarchical regression method extending Priority-Lasso for the binary logistic regression model by incorporating a priority order for blocks of variables while fitting Elastic-net models sequentially for each block. The fitted values from each step are then used as an offset in the subsequent step. Additionally, we considered the adaptive elastic-net penalty within our priority framework to compare the results.

RESULTS

The Priority-Elastic net and Priority-Adaptive Elastic net algorithms were evaluated on a brain tumor dataset available from The Cancer Genome Atlas (TCGA), accounting for transcriptomics, proteomics, and clinical information measured over two glioma types: Lower-grade glioma (LGG) and glioblastoma (GBM).

CONCLUSION

Our findings suggest that the Priority-Elastic net is a highly advantageous choice for a wide range of applications. It offers moderate computational complexity, flexibility in integrating prior knowledge while introducing a hierarchical modeling perspective, and, importantly, improved stability and accuracy in predictions, making it superior to the other methods discussed. This evolution marks a significant step forward in predictive modeling, offering a sophisticated tool for navigating the complexities of multi-omics datasets in pursuit of precision medicine's ultimate goal: personalized treatment optimization based on a comprehensive array of patient-specific data. This framework can be generalized to time-to-event, Cox proportional hazards regression and multicategorical outcomes. A practical implementation of this method is available upon request in R script, complete with an example to facilitate its application.

摘要

背景

高维组学数据整合已成为医疗行业中一个突出的途径,具有显著提升预测模型的潜力。然而,数据整合过程面临若干挑战,包括数据异质性、确定数据块优先级顺序以呈现多个数据块中包含的预测信息、评估从一个组学水平到另一个组学水平的信息流以及多重共线性。

方法

我们提出了优先级弹性网算法,这是一种层次回归方法,通过在拟合弹性网模型时为变量块引入优先级顺序,对用于二元逻辑回归模型的优先级套索法进行了扩展。然后将每个步骤的拟合值用作后续步骤的偏移量。此外,我们在优先级框架内考虑了自适应弹性网惩罚以比较结果。

结果

在可从癌症基因组图谱(TCGA)获取的脑肿瘤数据集上评估了优先级弹性网和优先级自适应弹性网算法,该数据集涵盖了两种胶质瘤类型(低级别胶质瘤(LGG)和成胶质细胞瘤(GBM))的转录组学、蛋白质组学和临床信息。

结论

我们的研究结果表明,优先级弹性网在广泛的应用中是一个非常有利的选择。它具有适度的计算复杂度,在整合先验知识时具有灵活性,同时引入了层次建模视角,重要的是,在预测方面具有更高的稳定性和准确性,使其优于所讨论的其他方法。这一进展标志着预测建模向前迈出了重要一步,为应对多组学数据集的复杂性提供了一个复杂的工具,以追求精准医学的最终目标:基于一系列全面的患者特定数据进行个性化治疗优化。该框架可推广到生存时间、Cox比例风险回归和多分类结果。可根据要求提供该方法在R脚本中的实际实现,并配有示例以方便其应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b038/11523883/4047ff8aa46b/13040_2024_401_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b038/11523883/cf1ac082789b/13040_2024_401_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b038/11523883/0722376a28db/13040_2024_401_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b038/11523883/b6b5ddaf40fc/13040_2024_401_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b038/11523883/4047ff8aa46b/13040_2024_401_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b038/11523883/cf1ac082789b/13040_2024_401_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b038/11523883/0722376a28db/13040_2024_401_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b038/11523883/b6b5ddaf40fc/13040_2024_401_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b038/11523883/4047ff8aa46b/13040_2024_401_Fig3_HTML.jpg

相似文献

1
Priority-Elastic net for binary disease outcome prediction based on multi-omics data.基于多组学数据的二元疾病结局预测的优先级弹性网络
BioData Min. 2024 Oct 29;17(1):45. doi: 10.1186/s13040-024-00401-0.
2
Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data.优先lasso:一种使用多组学数据预测临床结局的简单层次方法。
BMC Bioinformatics. 2018 Sep 12;19(1):322. doi: 10.1186/s12859-018-2344-6.
3
Prognosis of lasso-like penalized Cox models with tumor profiling improves prediction over clinical data alone and benefits from bi-dimensional pre-screening.具有肿瘤特征分析的套索惩罚 Cox 模型的预后可提高预测准确性,优于仅使用临床数据的预测,并且受益于二维预筛选。
BMC Cancer. 2022 Oct 5;22(1):1045. doi: 10.1186/s12885-022-10117-1.
4
Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions.使用正则化线性回归模型的基因组选择:岭回归、套索回归、弹性网络及其扩展。
BMC Proc. 2012 May 21;6 Suppl 2(Suppl 2):S10. doi: 10.1186/1753-6561-6-S2-S10.
5
Multi-omics facilitated variable selection in Cox-regression model for cancer prognosis prediction.多组学技术助力Cox回归模型中的变量选择以进行癌症预后预测。
Methods. 2017 Jul 15;124:100-107. doi: 10.1016/j.ymeth.2017.06.010. Epub 2017 Jun 13.
6
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
7
A plea for taking all available clinical information into account when assessing the predictive value of omics data.呼吁在评估组学数据的预测价值时,考虑所有可用的临床信息。
BMC Med Res Methodol. 2019 Jul 24;19(1):162. doi: 10.1186/s12874-019-0802-0.
8
DeepAutoGlioma: a deep learning autoencoder-based multi-omics data integration and classification tools for glioma subtyping.深度自动胶质瘤分类器:一种基于深度学习自动编码器的多组学数据集成与胶质瘤亚型分类工具。
BioData Min. 2023 Nov 15;16(1):32. doi: 10.1186/s13040-023-00349-7.
9
High-dimensional Cox models: the choice of penalty as part of the model building process.高维Cox模型:作为模型构建过程一部分的惩罚项选择
Biom J. 2010 Feb;52(1):50-69. doi: 10.1002/bimj.200900064.
10
Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.基于乳腺癌元维度组学数据间的相互作用预测删失生存数据。
J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.

引用本文的文献

1
Pharmacoepigenetic Biomarkers in Inflammatory Bowel Diseases: A Narrative Review.炎症性肠病中的药物表观遗传生物标志物:一篇叙述性综述
Yale J Biol Med. 2025 Jun 30;98(2):171-186. doi: 10.59249/FTXB7704. eCollection 2025 Jun.
2
Untargeted Lipidomic Biomarkers for Liver Cancer Diagnosis: A Tree-Based Machine Learning Model Enhanced by Explainable Artificial Intelligence.用于肝癌诊断的非靶向脂质组学生物标志物:一种由可解释人工智能增强的基于树的机器学习模型。
Medicina (Kaunas). 2025 Feb 26;61(3):405. doi: 10.3390/medicina61030405.

本文引用的文献

1
Insights from multi-omics integration in complex disease primary tissues.复杂疾病原发性组织中多组学整合的研究进展。
Trends Genet. 2023 Jan;39(1):46-58. doi: 10.1016/j.tig.2022.08.005. Epub 2022 Sep 19.
2
The 2021 WHO Classification of Tumors of the Central Nervous System: a summary.2021 年世卫组织中枢神经系统肿瘤分类:概述。
Neuro Oncol. 2021 Aug 2;23(8):1231-1251. doi: 10.1093/neuonc/noab106.
3
Foundational Statistical Principles in Medical Research: Sensitivity, Specificity, Positive Predictive Value, and Negative Predictive Value.
医学研究中的基本统计学原理:灵敏度、特异性、阳性预测值和阴性预测值。
Medicina (Kaunas). 2021 May 16;57(5):503. doi: 10.3390/medicina57050503.
4
Multi-omics Data Integration, Interpretation, and Its Application.多组学数据整合、解读及其应用
Bioinform Biol Insights. 2020 Jan 31;14:1177932219899051. doi: 10.1177/1177932219899051. eCollection 2020.
5
Twiner: correlation-based regularization for identifying common cancer gene signatures.Twiner:基于相关性的正则化方法,用于鉴定常见癌症基因特征。
BMC Bioinformatics. 2019 Jun 25;20(1):356. doi: 10.1186/s12859-019-2937-8.
6
Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data.优先lasso:一种使用多组学数据预测临床结局的简单层次方法。
BMC Bioinformatics. 2018 Sep 12;19(1):322. doi: 10.1186/s12859-018-2344-6.
7
Integrated Omics: Tools, Advances, and Future Approaches.综合组学:工具、进展与未来方法
J Mol Endocrinol. 2018 Jul 13. doi: 10.1530/JME-18-0055.
8
Adult Glioma Incidence and Survival by Race or Ethnicity in the United States From 2000 to 2014.美国 2000 年至 2014 年按种族或族裔划分的成人脑胶质瘤发病率和生存率。
JAMA Oncol. 2018 Sep 1;4(9):1254-1262. doi: 10.1001/jamaoncol.2018.1789.
9
More Is Better: Recent Progress in Multi-Omics Data Integration Methods.越多越好:多组学数据整合方法的最新进展
Front Genet. 2017 Jun 16;8:84. doi: 10.3389/fgene.2017.00084. eCollection 2017.
10
IPF-LASSO: Integrative -Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data.IPF-LASSO:基于多组学数据的带惩罚因子的整合惩罚回归用于预测
Comput Math Methods Med. 2017;2017:7691937. doi: 10.1155/2017/7691937. Epub 2017 May 4.