• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多重集相关性和因子分析有助于对多组学数据进行探索。

Multiset correlation and factor analysis enables exploration of multi-omics data.

作者信息

Brown Brielin C, Wang Collin, Kasela Silva, Aguet François, Nachun Daniel C, Taylor Kent D, Tracy Russell P, Durda Peter, Liu Yongmei, Johnson W Craig, Van Den Berg David, Gupta Namrata, Gabriel Stacy, Smith Joshua D, Gerzsten Robert, Clish Clary, Wong Quenna, Papanicolau George, Blackwell Thomas W, Rotter Jerome I, Rich Stephen S, Barr R Graham, Ardlie Kristin G, Knowles David A, Lappalainen Tuuli

机构信息

New York Genome Center, New York, NY, USA.

Data Science Institute, Columbia University, New York, NY, USA.

出版信息

Cell Genom. 2023 Jul 10;3(8):100359. doi: 10.1016/j.xgen.2023.100359. eCollection 2023 Aug 9.

DOI:10.1016/j.xgen.2023.100359
PMID:
37601969
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10435377/
Abstract

Multi-omics datasets are becoming more common, necessitating better integration methods to realize their revolutionary potential. Here, we introduce multi-set correlation and factor analysis (MCFA), an unsupervised integration method tailored to the unique challenges of high-dimensional genomics data that enables fast inference of shared and private factors. We used MCFA to integrate methylation markers, protein expression, RNA expression, and metabolite levels in 614 diverse samples from the Trans-Omics for Precision Medicine/Multi-Ethnic Study of Atherosclerosis multi-omics pilot. Samples cluster strongly by ancestry in the shared space, even in the absence of genetic information, while private spaces frequently capture dataset-specific technical variation. Finally, we integrated genetic data by conducting a genome-wide association study (GWAS) of our inferred factors, observing that several factors are enriched for GWAS hits and -expression quantitative trait loci. Two of these factors appear to be related to metabolic disease. Our study provides a foundation and framework for further integrative analysis of ever larger multi-modal genomic datasets.

摘要

多组学数据集正变得越来越普遍,这就需要更好的整合方法来实现其变革性潜力。在此,我们介绍多集相关性和因子分析(MCFA),这是一种针对高维基因组数据的独特挑战量身定制的无监督整合方法,能够快速推断共享因子和私有因子。我们使用MCFA对精准医学跨组学/动脉粥样硬化多族裔研究多组学试点项目中614个不同样本的甲基化标记、蛋白质表达、RNA表达和代谢物水平进行整合。在共享空间中,样本按祖先强烈聚类,即使在没有遗传信息的情况下也是如此,而私有空间经常捕捉特定于数据集的技术变异。最后,我们通过对推断出的因子进行全基因组关联研究(GWAS)来整合遗传数据,观察到几个因子在GWAS命中和表达数量性状位点方面富集。其中两个因子似乎与代谢疾病有关。我们的研究为进一步整合分析更大规模的多模态基因组数据集提供了基础和框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b1e/10435377/baf2579cac2e/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b1e/10435377/7e1b0a5cea41/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b1e/10435377/175235ff9b85/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b1e/10435377/d703c7c28eda/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b1e/10435377/baf2579cac2e/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b1e/10435377/7e1b0a5cea41/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b1e/10435377/175235ff9b85/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b1e/10435377/d703c7c28eda/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b1e/10435377/baf2579cac2e/gr3.jpg

相似文献

1
Multiset correlation and factor analysis enables exploration of multi-omics data.多重集相关性和因子分析有助于对多组学数据进行探索。
Cell Genom. 2023 Jul 10;3(8):100359. doi: 10.1016/j.xgen.2023.100359. eCollection 2023 Aug 9.
2
General Kernel Machine Methods for Multi-Omics Integration and Genome-Wide Association Testing With Related Individuals.用于多组学整合及相关个体全基因组关联测试的通用核机器方法
Genet Epidemiol. 2025 Jan;49(1):e22610. doi: 10.1002/gepi.22610.
3
Multi-omics analysis reveals novel causal pathways in psoriasis pathogenesis.多组学分析揭示了银屑病发病机制中的新因果途径。
J Transl Med. 2025 Jan 22;23(1):100. doi: 10.1186/s12967-025-06099-w.
4
Identifying cross-tissue molecular targets of lung function by multi-omics integration analysis from DNA methylation and gene expression of diverse human tissues.通过对多种人类组织的DNA甲基化和基因表达进行多组学整合分析来鉴定肺功能的跨组织分子靶点。
BMC Genomics. 2025 Mar 24;26(1):289. doi: 10.1186/s12864-025-11476-2.
5
Integrative Analysis of Multi-omics Data for Discovery and Functional Studies of Complex Human Diseases.用于复杂人类疾病发现和功能研究的多组学数据综合分析
Adv Genet. 2016;93:147-90. doi: 10.1016/bs.adgen.2015.11.004. Epub 2016 Jan 25.
6
A Systemic Analysis of Transcriptomic and Epigenomic Data To Reveal Regulation Patterns for Complex Disease.基于转录组和表观基因组数据的系统分析揭示复杂疾病的调控模式。
G3 (Bethesda). 2017 Jul 5;7(7):2271-2279. doi: 10.1534/g3.117.042408.
7
Multiset sparse partial least squares path modeling for high dimensional omics data analysis.多集稀疏偏最小二乘路径建模在高维组学数据分析中的应用。
BMC Bioinformatics. 2020 Jan 9;21(1):9. doi: 10.1186/s12859-019-3286-3.
8
Knowledge Base Commons (KBCommons) v1.1: a universal framework for multi-omics data integration and biological discoveries.知识库共通体(KBCommons)v1.1:一种用于多组学数据集成和生物学发现的通用框架。
BMC Genomics. 2019 Dec 20;20(Suppl 11):947. doi: 10.1186/s12864-019-6287-8.
9
A multi-omics data simulator for complex disease studies and its application to evaluate multi-omics data analysis methods for disease classification.用于复杂疾病研究的多组学数据模拟器及其在评估疾病分类的多组学数据分析方法中的应用。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz045.
10
Tissue-specific multi-omics analysis of atrial fibrillation.心房颤动的组织特异性多组学分析。
Nat Commun. 2022 Jan 21;13(1):441. doi: 10.1038/s41467-022-27953-1.

引用本文的文献

1
Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation.基于原则的英国生物银行表型数据提取揭示了人类变异的潜在结构。
Nat Hum Behav. 2024 Aug;8(8):1599-1615. doi: 10.1038/s41562-024-01909-5. Epub 2024 Jul 4.
2
Linking Prenatal Environmental Exposures to Lifetime Health with Epigenome-Wide Association Studies: State-of-the-Science Review and Future Recommendations.将产前环境暴露与全基因组关联研究联系起来,以了解终生健康:科学综述及未来建议。
Environ Health Perspect. 2023 Dec;131(12):126001. doi: 10.1289/EHP12956. Epub 2023 Dec 4.
3
Ötzi the Iceman has a new look: balding and dark-skinned.

本文引用的文献

1
Interaction molecular QTL mapping discovers cellular and environmental modifiers of genetic regulatory effects.交互分子 QTL 作图发现遗传调控效应的细胞和环境修饰因子。
Am J Hum Genet. 2024 Jan 4;111(1):133-149. doi: 10.1016/j.ajhg.2023.11.013.
2
Discovery and systematic characterization of risk variants and genes for coronary artery disease in over a million participants.在超过 100 万名参与者中发现并系统地描述了冠心病的风险变异和基因。
Nat Genet. 2022 Dec;54(12):1803-1815. doi: 10.1038/s41588-022-01233-6. Epub 2022 Dec 6.
3
Schizophrenia: a disorder of broken brain bioenergetics.
冰人奥茨有了新形象:谢顶且皮肤黝黑。
Nature. 2023 Aug 16. doi: 10.1038/d41586-023-02562-0.
4
Canonical correlation analysis for multi-omics: Application to cross-cohort analysis.多组学的典范相关分析:在跨队列分析中的应用。
PLoS Genet. 2023 May 22;19(5):e1010517. doi: 10.1371/journal.pgen.1010517. eCollection 2023 May.
精神分裂症:一种大脑生物能量障碍的疾病。
Mol Psychiatry. 2022 May;27(5):2393-2404. doi: 10.1038/s41380-022-01494-x. Epub 2022 Mar 9.
4
The power of genetic diversity in genome-wide association studies of lipids.遗传多样性在全基因组关联研究脂质中的作用。
Nature. 2021 Dec;600(7890):675-679. doi: 10.1038/s41586-021-04064-3. Epub 2021 Dec 9.
5
Welch-weighted Egger regression reduces false positives due to correlated pleiotropy in Mendelian randomization.Welch 加权 Egger 回归减少了孟德尔随机化中由于相关的多效性导致的假阳性。
Am J Hum Genet. 2021 Dec 2;108(12):2319-2335. doi: 10.1016/j.ajhg.2021.10.006.
6
Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression.大规模顺式和反式 eQTL 分析确定了数千个调节血液基因表达的遗传位点和多基因评分。
Nat Genet. 2021 Sep;53(9):1300-1310. doi: 10.1038/s41588-021-00913-z. Epub 2021 Sep 2.
7
A System for Phenotype Harmonization in the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine (TOPMed) Program.国家心肺血液研究所精准医学转化组学(TOPMed)计划中的表型协调系统。
Am J Epidemiol. 2021 Oct 1;190(10):1977-1992. doi: 10.1093/aje/kwab115.
8
Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.美国国立卫生研究院生物医学高级研究与发展局(NHLBI)TOPMed 项目中对 53831 个不同基因组进行测序。
Nature. 2021 Feb;590(7845):290-299. doi: 10.1038/s41586-021-03205-y. Epub 2021 Feb 10.
9
Genetics of 35 blood and urine biomarkers in the UK Biobank.英国生物库中 35 项血液和尿液生物标志物的遗传学研究
Nat Genet. 2021 Feb;53(2):185-194. doi: 10.1038/s41588-020-00757-z. Epub 2021 Jan 18.
10
DNA methylation and lipid metabolism: an EWAS of 226 metabolic measures.DNA 甲基化与脂质代谢:226 项代谢指标的 EWAS 研究。
Clin Epigenetics. 2021 Jan 7;13(1):7. doi: 10.1186/s13148-020-00957-8.