Suppr超能文献

通过结合生物信息的稀疏典型相关分析对转录组学和代谢组学数据进行综合分析。

Integrative analysis of transcriptomic and metabolomic data via sparse canonical correlation analysis with incorporation of biological information.

作者信息

Safo Sandra E, Li Shuzhao, Long Qi

机构信息

Department of Biostatistics and Bioinformatics, Emory University, Atlanta, Georgia, U.S.A.

Department of Medicine, Division of Pulmonary, Allergy and Critical Care Medicine, Emory University, Atlanta, Georgia, U.S.A.

出版信息

Biometrics. 2018 Mar;74(1):300-312. doi: 10.1111/biom.12715. Epub 2017 May 8.

Abstract

Integrative analysis of high dimensional omics data is becoming increasingly popular. At the same time, incorporating known functional relationships among variables in analysis of omics data has been shown to help elucidate underlying mechanisms for complex diseases. In this article, our goal is to assess association between transcriptomic and metabolomic data from a Predictive Health Institute (PHI) study that includes healthy adults at a high risk of developing cardiovascular diseases. Adopting a strategy that is both data-driven and knowledge-based, we develop statistical methods for sparse canonical correlation analysis (CCA) with incorporation of known biological information. Our proposed methods use prior network structural information among genes and among metabolites to guide selection of relevant genes and metabolites in sparse CCA, providing insight on the molecular underpinning of cardiovascular disease. Our simulations demonstrate that the structured sparse CCA methods outperform several existing sparse CCA methods in selecting relevant genes and metabolites when structural information is informative and are robust to mis-specified structural information. Our analysis of the PHI study reveals that a number of gene and metabolic pathways including some known to be associated with cardiovascular diseases are enriched in the set of genes and metabolites selected by our proposed approach.

摘要

高维组学数据的综合分析越来越受欢迎。与此同时,在组学数据分析中纳入变量之间已知的功能关系已被证明有助于阐明复杂疾病的潜在机制。在本文中,我们的目标是评估来自预测健康研究所(PHI)一项研究的转录组学和代谢组学数据之间的关联,该研究纳入了有患心血管疾病高风险的健康成年人。我们采用一种数据驱动和基于知识的策略,开发了用于稀疏典型相关分析(CCA)并纳入已知生物学信息的统计方法。我们提出的方法利用基因之间和代谢物之间的先验网络结构信息来指导稀疏CCA中相关基因和代谢物的选择,从而深入了解心血管疾病的分子基础。我们的模拟表明,当结构信息具有信息量时,结构化稀疏CCA方法在选择相关基因和代谢物方面优于几种现有的稀疏CCA方法,并且对错误指定的结构信息具有鲁棒性。我们对PHI研究的分析表明,我们提出的方法所选择的基因和代谢物集合中富集了许多基因和代谢途径,包括一些已知与心血管疾病相关的途径。

相似文献

4
7
Robust sparse canonical correlation analysis.稳健稀疏典型相关分析
BMC Syst Biol. 2016 Aug 11;10(1):72. doi: 10.1186/s12918-016-0317-9.

引用本文的文献

3
Knowledge-guided learning methods for integrative analysis of multi-omics data.用于多组学数据综合分析的知识引导学习方法。
Comput Struct Biotechnol J. 2024 Apr 30;23:1945-1950. doi: 10.1016/j.csbj.2024.04.053. eCollection 2024 Dec.
5
Interpretable deep learning methods for multiview learning.多视图学习的可解释深度学习方法。
BMC Bioinformatics. 2024 Feb 14;25(1):69. doi: 10.1186/s12859-024-05679-9.

本文引用的文献

1
KEGG as a reference resource for gene and protein annotation.KEGG作为基因和蛋白质注释的参考资源。
Nucleic Acids Res. 2016 Jan 4;44(D1):D457-62. doi: 10.1093/nar/gkv1070. Epub 2015 Oct 17.
2
Regulation of uric acid metabolism and excretion.尿酸代谢与排泄的调节。
Int J Cardiol. 2016 Jun 15;213:8-14. doi: 10.1016/j.ijcard.2015.08.109. Epub 2015 Aug 14.
3
MetaboAnalyst 3.0--making metabolomics more meaningful.MetaboAnalyst 3.0——让代谢组学更具意义。
Nucleic Acids Res. 2015 Jul 1;43(W1):W251-7. doi: 10.1093/nar/gkv380. Epub 2015 Apr 20.
4
Predicting network activity from high throughput metabolomics.从高通量代谢组学预测网络活动。
PLoS Comput Biol. 2013;9(7):e1003123. doi: 10.1371/journal.pcbi.1003123. Epub 2013 Jul 4.
6

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验