• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合组学应用中的元分析主成分分析。

Meta-analytic principal component analysis in integrative omics application.

机构信息

Department of Statistics, Keimyung University, Daegu 42601, South Korea.

Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.

出版信息

Bioinformatics. 2018 Apr 15;34(8):1321-1328. doi: 10.1093/bioinformatics/btx765.

DOI:10.1093/bioinformatics/btx765
PMID:29186328
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5905607/
Abstract

MOTIVATION

With the prevalent usage of microarray and massively parallel sequencing, numerous high-throughput omics datasets have become available in the public domain. Integrating abundant information among omics datasets is critical to elucidate biological mechanisms. Due to the high-dimensional nature of the data, methods such as principal component analysis (PCA) have been widely applied, aiming at effective dimension reduction and exploratory visualization.

RESULTS

In this article, we combine multiple omics datasets of identical or similar biological hypothesis and introduce two variations of meta-analytic framework of PCA, namely MetaPCA. Regularization is further incorporated to facilitate sparse feature selection in MetaPCA. We apply MetaPCA and sparse MetaPCA to simulations, three transcriptomic meta-analysis studies in yeast cell cycle, prostate cancer, mouse metabolism and a TCGA pan-cancer methylation study. The result shows improved accuracy, robustness and exploratory visualization of the proposed framework.

AVAILABILITY AND IMPLEMENTATION

An R package MetaPCA is available online. (http://tsenglab.biostat.pitt.edu/software.htm).

CONTACT

ctseng@pitt.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

随着微阵列和大规模平行测序的广泛应用,大量高通量组学数据集已经在公共领域中可用。整合组学数据集中丰富的信息对于阐明生物学机制至关重要。由于数据的高维性质,已经广泛应用了主成分分析(PCA)等方法,旨在实现有效的降维和探索性可视化。

结果

在本文中,我们将相同或相似生物学假设的多个组学数据集结合起来,并引入了两种 PCA 的荟萃分析框架变体,即 MetaPCA。进一步纳入正则化以促进 MetaPCA 中的稀疏特征选择。我们将 MetaPCA 和稀疏 MetaPCA 应用于模拟、酵母细胞周期、前列腺癌、小鼠代谢的三个转录组荟萃分析研究以及 TCGA 泛癌甲基化研究。结果表明,所提出的框架提高了准确性、稳健性和探索性可视化。

可用性和实现

一个名为 MetaPCA 的 R 包可在线获得。(http://tsenglab.biostat.pitt.edu/software.htm)。

联系方式

ctsend@pitt.edu。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
Meta-analytic principal component analysis in integrative omics application.整合组学应用中的元分析主成分分析。
Bioinformatics. 2018 Apr 15;34(8):1321-1328. doi: 10.1093/bioinformatics/btx765.
2
MetaKTSP: a meta-analytic top scoring pair method for robust cross-study validation of omics prediction analysis.MetaKTSP:一种用于组学预测分析的稳健跨研究验证的元分析最高得分对方法。
Bioinformatics. 2016 Jul 1;32(13):1966-73. doi: 10.1093/bioinformatics/btw115. Epub 2016 Mar 2.
3
An R package suite for microarray meta-analysis in quality control, differentially expressed gene analysis and pathway enrichment detection.一个用于微阵列荟萃分析的 R 包套件,用于质量控制、差异表达基因分析和途径富集检测。
Bioinformatics. 2012 Oct 1;28(19):2534-6. doi: 10.1093/bioinformatics/bts485. Epub 2012 Aug 3.
4
Meta-analytic framework for liquid association.液体关联的荟萃分析框架。
Bioinformatics. 2017 Jul 15;33(14):2140-2147. doi: 10.1093/bioinformatics/btx138.
5
Bias correction for selecting the minimal-error classifier from many machine learning models.从众多机器学习模型中选择最小错误分类器的偏差校正。
Bioinformatics. 2014 Nov 15;30(22):3152-8. doi: 10.1093/bioinformatics/btu520. Epub 2014 Aug 1.
6
Meta-analysis for pathway enrichment analysis when combining multiple genomic studies.多组学研究整合的通路富集分析的元分析
Bioinformatics. 2010 May 15;26(10):1316-23. doi: 10.1093/bioinformatics/btq148. Epub 2010 Apr 21.
7
MetaDCN: meta-analysis framework for differential co-expression network detection with an application in breast cancer.MetaDCN:用于差异共表达网络检测的荟萃分析框架及其在乳腺癌中的应用
Bioinformatics. 2017 Apr 15;33(8):1121-1129. doi: 10.1093/bioinformatics/btw788.
8
Exploring high-dimensional biological data with sparse contrastive principal component analysis.稀疏对比主成分分析在高维生物学数据中的应用。
Bioinformatics. 2020 Jun 1;36(11):3422-3430. doi: 10.1093/bioinformatics/btaa176.
9
A novel bi-level meta-analysis approach: applied to biological pathway analysis.一种新型的双层次荟萃分析方法:应用于生物通路分析。
Bioinformatics. 2016 Feb 1;32(3):409-16. doi: 10.1093/bioinformatics/btv588. Epub 2015 Oct 14.
10
iSFun: an R package for integrative dimension reduction analysis.iSFun:一个用于整合维度缩减分析的 R 包。
Bioinformatics. 2022 May 26;38(11):3134-3135. doi: 10.1093/bioinformatics/btac281.

引用本文的文献

1
Comparative study of the effects of baicalin and probenecid on microRNA expression profiles in porcine aortic vascular endothelial cells infected by Glaesserella parasuis.黄芩苷与丙磺舒对副猪嗜血杆菌感染的猪主动脉血管内皮细胞中微小RNA表达谱影响的比较研究
BMC Vet Res. 2025 Apr 2;21(1):237. doi: 10.1186/s12917-025-04702-2.
2
Identification of a C2H2 zinc finger-related lncRNA prognostic signature and its association with the immune microenvironment in clear cell renal cell carcinoma.C2H2锌指相关长链非编码RNA预后特征的鉴定及其与透明细胞肾细胞癌免疫微环境的关系
Transl Androl Urol. 2025 Feb 28;14(2):412-431. doi: 10.21037/tau-2024-769. Epub 2025 Feb 25.
3
Application of machine learning for mass spectrometry-based multi-omics in thyroid diseases.机器学习在基于质谱的甲状腺疾病多组学中的应用。
Front Mol Biosci. 2024 Dec 17;11:1483326. doi: 10.3389/fmolb.2024.1483326. eCollection 2024.
4
Construction and validation of immunogenic cell death-related molecular clusters, signature, and immune landscape in pancreatic cancer.胰腺癌中免疫原性细胞死亡相关分子簇、特征及免疫格局的构建与验证
Clin Exp Med. 2024 Dec 21;25(1):19. doi: 10.1007/s10238-024-01533-7.
5
Principal component analysis revisited: fast multitrait genetic evaluations with smooth convergence.重新审视主成分分析:具有平滑收敛性的快速多性状遗传评估
G3 (Bethesda). 2024 Oct 21;14(12). doi: 10.1093/g3journal/jkae228.
6
Batch-effect correction in single-cell RNA sequencing data using JIVE.使用JIVE对单细胞RNA测序数据进行批次效应校正。
Bioinform Adv. 2024 Sep 13;4(1):vbae134. doi: 10.1093/bioadv/vbae134. eCollection 2024.
7
Review: Cancer and neurodevelopmental disorders: multi-scale reasoning and computational guide.综述:癌症与神经发育障碍:多尺度推理与计算指南。
Front Cell Dev Biol. 2024 Jul 2;12:1376639. doi: 10.3389/fcell.2024.1376639. eCollection 2024.
8
Identification and validation of a pyroptosis-related prognostic model for colorectal cancer based on bulk and single-cell RNA sequencing data.基于批量和单细胞RNA测序数据的结直肠癌焦亡相关预后模型的鉴定与验证
World J Clin Oncol. 2024 Feb 24;15(2):329-355. doi: 10.5306/wjco.v15.i2.329.
9
Prediction of immune infiltration and prognosis for patients with cholangiocarcinoma based on a cuproptosis-related lncRNA signature.基于铜死亡相关lncRNA特征预测胆管癌患者的免疫浸润和预后
Heliyon. 2023 Dec 20;10(1):e22774. doi: 10.1016/j.heliyon.2023.e22774. eCollection 2024 Jan 15.
10
High-throughput sequencing reveals the change of TCR α chain CDR3 with Takayasu arteritis.高通量测序揭示 Takayasu 动脉炎中 TCRα 链 CDR3 的变化。
Immun Inflamm Dis. 2023 Dec;11(12):e1122. doi: 10.1002/iid3.1122.

本文引用的文献

1
Statistical Methods in Integrative Genomics.整合基因组学中的统计方法
Annu Rev Stat Appl. 2016 Jun;3:181-209. doi: 10.1146/annurev-statistics-041715-033506. Epub 2016 Apr 18.
2
Meta-analytic framework for sparse -means to identify disease subtypes in multiple transcriptomic studies.用于在多个转录组学研究中识别疾病亚型的稀疏均值荟萃分析框架。
J Am Stat Assoc. 2016;111(513):27-42. doi: 10.1080/01621459.2015.1086354. Epub 2016 May 5.
3
MetaKTSP: a meta-analytic top scoring pair method for robust cross-study validation of omics prediction analysis.MetaKTSP:一种用于组学预测分析的稳健跨研究验证的元分析最高得分对方法。
Bioinformatics. 2016 Jul 1;32(13):1966-73. doi: 10.1093/bioinformatics/btw115. Epub 2016 Mar 2.
4
The Scree Test For The Number Of Factors.因子数量的碎石检验
Multivariate Behav Res. 1966 Apr 1;1(2):245-76. doi: 10.1207/s15327906mbr0102_10.
5
Applying stability selection to consistently estimate sparse principal components in high-dimensional molecular data.应用稳定性选择方法在高维分子数据中一致估计稀疏主成分。
Bioinformatics. 2015 Aug 15;31(16):2683-90. doi: 10.1093/bioinformatics/btv197. Epub 2015 Apr 10.
6
Using high-throughput transcriptomic data for prognosis: a critical overview and perspectives.利用高通量转录组学数据进行预后评估:批判性综述与展望。
Cancer Res. 2014 Sep 1;74(17):4612-21. doi: 10.1158/0008-5472.CAN-13-3338.
7
The Cancer Genome Atlas Pan-Cancer analysis project.癌症基因组图谱泛癌分析项目。
Nat Genet. 2013 Oct;45(10):1113-20. doi: 10.1038/ng.2764.
8
JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES.用于多数据类型综合分析的联合与个体变异解释(JIVE)
Ann Appl Stat. 2013 Mar 1;7(1):523-542. doi: 10.1214/12-AOAS597.
9
Detecting disease-associated genes with confounding variable adjustment and the impact on genomic meta-analysis: with application to major depressive disorder.调整混杂变量并检测疾病相关基因及其对基因组荟萃分析的影响:以重度抑郁症为例。
BMC Bioinformatics. 2012 Mar 29;13:52. doi: 10.1186/1471-2105-13-52.
10
Comprehensive literature review and statistical considerations for microarray meta-analysis.综合文献回顾和微阵列荟萃分析的统计考虑。
Nucleic Acids Res. 2012 May;40(9):3785-99. doi: 10.1093/nar/gkr1265. Epub 2012 Jan 19.