• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于高维组学数据的多重集稀疏冗余分析。

Multiset sparse redundancy analysis for high-dimensional omics data.

作者信息

Csala Attila, Hof Michel H, Zwinderman Aeilko H

机构信息

Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academic Medical Center, Amsterdam, The Netherlands.

出版信息

Biom J. 2019 Mar;61(2):406-423. doi: 10.1002/bimj.201700248. Epub 2018 Dec 3.

DOI:10.1002/bimj.201700248
PMID:30506971
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6587877/
Abstract

Redundancy Analysis (RDA) is a well-known method used to describe the directional relationship between related data sets. Recently, we proposed sparse Redundancy Analysis (sRDA) for high-dimensional genomic data analysis to find explanatory variables that explain the most variance of the response variables. As more and more biomolecular data become available from different biological levels, such as genotypic and phenotypic data from different omics domains, a natural research direction is to apply an integrated analysis approach in order to explore the underlying biological mechanism of certain phenotypes of the given organism. We show that the multiset sparse Redundancy Analysis (multi-sRDA) framework is a prominent candidate for high-dimensional omics data analysis since it accounts for the directional information transfer between omics sets, and, through its sparse solutions, the interpretability of the result is improved. In this paper, we also describe a software implementation for multi-sRDA, based on the Partial Least Squares Path Modeling algorithm. We test our method through simulation and real omics data analysis with data sets of 364,134 methylation markers, 18,424 gene expression markers, and 47 cytokine markers measured on 37 patients with Marfan syndrome.

摘要

冗余分析(RDA)是一种用于描述相关数据集之间方向关系的著名方法。最近,我们提出了用于高维基因组数据分析的稀疏冗余分析(sRDA),以找到能够解释响应变量最大方差的解释变量。随着越来越多来自不同生物水平的生物分子数据可用,例如来自不同组学领域的基因型和表型数据,一个自然的研究方向是应用综合分析方法,以探索给定生物体某些表型的潜在生物学机制。我们表明,多集稀疏冗余分析(multi-sRDA)框架是高维组学数据分析的一个突出候选方法,因为它考虑了组学集之间的方向信息传递,并且通过其稀疏解,提高了结果的可解释性。在本文中,我们还描述了基于偏最小二乘路径建模算法的multi-sRDA的软件实现。我们通过对37名马凡综合征患者测量的364,134个甲基化标记、18,424个基因表达标记和47个细胞因子标记的数据集进行模拟和实际组学数据分析来测试我们的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b363/6587877/5e1513f20ae8/BIMJ-61-406-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b363/6587877/aa311fee0b3b/BIMJ-61-406-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b363/6587877/c49267fe1cac/BIMJ-61-406-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b363/6587877/ea658b8cc269/BIMJ-61-406-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b363/6587877/5e1513f20ae8/BIMJ-61-406-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b363/6587877/aa311fee0b3b/BIMJ-61-406-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b363/6587877/c49267fe1cac/BIMJ-61-406-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b363/6587877/ea658b8cc269/BIMJ-61-406-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b363/6587877/5e1513f20ae8/BIMJ-61-406-g004.jpg

相似文献

1
Multiset sparse redundancy analysis for high-dimensional omics data.用于高维组学数据的多重集稀疏冗余分析。
Biom J. 2019 Mar;61(2):406-423. doi: 10.1002/bimj.201700248. Epub 2018 Dec 3.
2
Sparse redundancy analysis of high-dimensional genetic and genomic data.高维遗传和基因组数据的稀疏冗余分析。
Bioinformatics. 2017 Oct 15;33(20):3228-3234. doi: 10.1093/bioinformatics/btx374.
3
Multiset sparse partial least squares path modeling for high dimensional omics data analysis.多集稀疏偏最小二乘路径建模在高维组学数据分析中的应用。
BMC Bioinformatics. 2020 Jan 9;21(1):9. doi: 10.1186/s12859-019-3286-3.
4
Multivariate Statistical Methods for High-Dimensional Multiset Omics Data Analysis用于高维多集组学数据分析的多元统计方法
5
Sparse partial least squares with group and subgroup structure.稀疏偏最小二乘与分组和子分组结构。
Stat Med. 2018 Oct 15;37(23):3338-3356. doi: 10.1002/sim.7821. Epub 2018 Jun 11.
6
Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components.基于分块稀疏主成分的多组学数据综合分析。
Int J Mol Sci. 2020 Nov 2;21(21):8202. doi: 10.3390/ijms21218202.
7
Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.基于乳腺癌元维度组学数据间的相互作用预测删失生存数据。
J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.
8
Computationally scalable regression modeling for ultrahigh-dimensional omics data with ParProx.使用 ParProx 进行超高维组学数据的计算可扩展回归建模。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab256.
9
Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data.基于多组学数据预测卵巢癌生存的最小冗余最大相关性多视图特征选择。
BMC Med Genomics. 2018 Sep 14;11(Suppl 3):71. doi: 10.1186/s12920-018-0388-0.
10
A Systemic Analysis of Transcriptomic and Epigenomic Data To Reveal Regulation Patterns for Complex Disease.基于转录组和表观基因组数据的系统分析揭示复杂疾病的调控模式。
G3 (Bethesda). 2017 Jul 5;7(7):2271-2279. doi: 10.1534/g3.117.042408.

引用本文的文献

1
Ginsenoside Rg Activates the LKB1/AMPK/mTOR Signaling Pathway and Modifies the Gut Microbiota to Alleviate Nonalcoholic Fatty Liver Disease Induced by a High-Fat Diet.人参皂苷 Rg 通过激活 LKB1/AMPK/mTOR 信号通路并调节肠道微生物群来缓解高脂饮食诱导的非酒精性脂肪肝病。
Nutrients. 2024 Mar 15;16(6):842. doi: 10.3390/nu16060842.
2
Redundancy Analysis to Reduce the High-Dimensional Near-Infrared Spectral Information to Improve the Authentication of Olive Oil.冗余分析降低高维近红外光谱信息以提高橄榄油的鉴别能力。
J Chem Inf Model. 2022 Oct 10;62(19):4620-4628. doi: 10.1021/acs.jcim.2c00964. Epub 2022 Sep 21.
3

本文引用的文献

1
More Is Better: Recent Progress in Multi-Omics Data Integration Methods.越多越好:多组学数据整合方法的最新进展
Front Genet. 2017 Jun 16;8:84. doi: 10.3389/fgene.2017.00084. eCollection 2017.
2
Sparse redundancy analysis of high-dimensional genetic and genomic data.高维遗传和基因组数据的稀疏冗余分析。
Bioinformatics. 2017 Oct 15;33(20):3228-3234. doi: 10.1093/bioinformatics/btx374.
3
Supervised multiblock sparse multivariable analysis with application to multimodal brain imaging genetics.监督式多块稀疏多变量分析及其在多模态脑成像遗传学中的应用。
Transcriptome Profiling and Metagenomic Analysis Help to Elucidate Interactions in an Inflammation-Associated Cancer Mouse Model.
转录组分析和宏基因组分析有助于阐明炎症相关癌症小鼠模型中的相互作用。
Cancers (Basel). 2021 Jul 22;13(15):3683. doi: 10.3390/cancers13153683.
4
Multiset sparse partial least squares path modeling for high dimensional omics data analysis.多集稀疏偏最小二乘路径建模在高维组学数据分析中的应用。
BMC Bioinformatics. 2020 Jan 9;21(1):9. doi: 10.1186/s12859-019-3286-3.
Biostatistics. 2017 Oct 1;18(4):651-665. doi: 10.1093/biostatistics/kxx011.
4
Integration of omics: more than the sum of its parts.组学整合:不止是各部分的总和。
Cancer Metab. 2016 Feb 19;4:4. doi: 10.1186/s40170-016-0143-y. eCollection 2016.
5
A Model and Simple Iterative Algorithm For Redundancy Analysis.一种用于冗余分析的模型及简单迭代算法。
Multivariate Behav Res. 1988 Jul 1;23(3):349-60. doi: 10.1207/s15327906mbr2303_4.
6
Sparse canonical correlation analysis from a predictive point of view.从预测角度看稀疏典型相关分析。
Biom J. 2015 Sep;57(5):834-51. doi: 10.1002/bimj.201400226. Epub 2015 Jul 6.
7
Losartan reduces aortic dilatation rate in adults with Marfan syndrome: a randomized controlled trial.氯沙坦可降低马凡综合征成年患者的主动脉扩张速度:一项随机对照试验。
Eur Heart J. 2013 Dec;34(45):3491-500. doi: 10.1093/eurheartj/eht334. Epub 2013 Sep 2.
8
Inflammation aggravates disease severity in Marfan syndrome patients.炎症加重马凡综合征患者的疾病严重程度。
PLoS One. 2012;7(3):e32963. doi: 10.1371/journal.pone.0032963. Epub 2012 Mar 30.
9
Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks.稀疏典型相关分析用于识别、连接和完成基因表达网络。
BMC Bioinformatics. 2009 Sep 28;10:315. doi: 10.1186/1471-2105-10-315.
10
Central dogma of molecular biology.分子生物学的中心法则
Nature. 1970 Aug 8;227(5258):561-3. doi: 10.1038/227561a0.