微阵列实验中检测差异表达基因的荟萃分析方法比较。

A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments.

作者信息

Hong Fangxin, Breitling Rainer

机构信息

Department of Biostatistics, Division of Information Sciences, City of Hope National Medical Center, Beckman Research Institute, 1500 Duarte Rd, Duarte, CA 91010, USA.

出版信息

Bioinformatics. 2008 Feb 1;24(3):374-82. doi: 10.1093/bioinformatics/btm620. Epub 2008 Jan 18.

DOI:10.1093/bioinformatics/btm620

PMID:18204063

Abstract

MOTIVATION

The proliferation of public data repositories creates a need for meta-analysis methods to efficiently evaluate, integrate and validate related datasets produced by independent groups. A t-based approach has been proposed to integrate effect size from multiple studies by modeling both intra- and between-study variation. Recently, a non-parametric 'rank product' method, which is derived based on biological reasoning of fold-change criteria, has been applied to directly combine multiple datasets into one meta study. Fisher's Inverse chi(2) method, which only depends on P-values from individual analyses of each dataset, has been used in a couple of medical studies. While these methods address the question from different angles, it is not clear how they compare with each other.

RESULTS

We comparatively evaluate the three methods; t-based hierarchical modeling, rank products and Fisher's Inverse chi(2) test with P-values from either the t-based or the rank product method. A simulation study shows that the rank product method, in general, has higher sensitivity and selectivity than the t-based method in both individual and meta-analysis, especially in the setting of small sample size and/or large between-study variation. Not surprisingly, Fisher's chi(2) method highly depends on the method used in the individual analysis. Application to real datasets demonstrates that meta-analysis achieves more reliable identification than an individual analysis, and rank products are more robust in gene ranking, which leads to a much higher reproducibility among independent studies. Though t-based meta-analysis greatly improves over the individual analysis, it suffers from a potentially large amount of false positives when P-values serve as threshold. We conclude that careful meta-analysis is a powerful tool for integrating multiple array studies.

摘要

动机

公共数据存储库的激增使得需要元分析方法来有效评估、整合和验证由独立研究小组产生的相关数据集。已经提出了一种基于t检验的方法，通过对研究内和研究间的变异进行建模来整合来自多项研究的效应量。最近，一种基于倍数变化标准的生物学推理推导出来的非参数“秩乘积”方法已被应用于直接将多个数据集合并为一项元研究。费舍尔逆卡方方法仅依赖于每个数据集单独分析得到的P值，已在一些医学研究中使用。虽然这些方法从不同角度解决了问题，但它们之间如何相互比较尚不清楚。

结果

我们对三种方法进行了比较评估；基于t检验的层次模型、秩乘积法以及使用基于t检验或秩乘积法得到的P值的费舍尔逆卡方检验。一项模拟研究表明，一般来说，秩乘积法在个体分析和元分析中都比基于t检验的方法具有更高的灵敏度和选择性，尤其是在小样本量和/或研究间变异较大的情况下。不出所料，费舍尔卡方方法高度依赖于个体分析中使用的方法。对实际数据集的应用表明，元分析比个体分析能实现更可靠的识别，并且秩乘积法在基因排序中更稳健，这导致独立研究之间具有更高的可重复性。尽管基于t检验的元分析比个体分析有很大改进，但当以P值作为阈值时，它会受到大量潜在假阳性的影响。我们得出结论，仔细的元分析是整合多项阵列研究的有力工具。

相似文献

A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments.微阵列实验中检测差异表达基因的荟萃分析方法比较。

Bioinformatics. 2008 Feb 1;24(3):374-82. doi: 10.1093/bioinformatics/btm620. Epub 2008 Jan 18.

Group testing for pathway analysis improves comparability of different microarray datasets.用于通路分析的分组检验可提高不同微阵列数据集的可比性。

Bioinformatics. 2006 Oct 15;22(20):2500-6. doi: 10.1093/bioinformatics/btl424. Epub 2006 Aug 7.

Moderated effect size and P-value combinations for microarray meta-analyses.基于微阵列荟萃分析的调节效应量和 P 值组合。

Bioinformatics. 2009 Oct 15;25(20):2692-9. doi: 10.1093/bioinformatics/btp444. Epub 2009 Jul 23.

RankProd: a bioconductor package for detecting differentially expressed genes in meta-analysis.RankProd：一个用于在荟萃分析中检测差异表达基因的生物导体软件包。

Bioinformatics. 2006 Nov 15;22(22):2825-7. doi: 10.1093/bioinformatics/btl476. Epub 2006 Sep 18.

Meta-analysis based on control of false discovery rate: combining yeast ChIP-chip datasets.基于错误发现率控制的Meta分析：整合酵母染色质免疫沉淀芯片数据集

Bioinformatics. 2006 Oct 15;22(20):2516-22. doi: 10.1093/bioinformatics/btl439. Epub 2006 Aug 14.

Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。

BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.

Leveraging two-way probe-level block design for identifying differential gene expression with high-density oligonucleotide arrays.利用双向探针水平块设计通过高密度寡核苷酸阵列鉴定差异基因表达。

BMC Bioinformatics. 2004 Apr 20;5:42. doi: 10.1186/1471-2105-5-42.

Empirical Bayes screening of many p-values with applications to microarray studies.用于微阵列研究的多p值经验贝叶斯筛选。

Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2.

A meta-data based method for DNA microarray imputation.一种基于元数据的DNA微阵列插补方法。

BMC Bioinformatics. 2007 Mar 29;8:109. doi: 10.1186/1471-2105-8-109.

Sample size calculations based on ranking and selection in microarray experiments.基于微阵列实验中排序与选择的样本量计算。

Biometrics. 2008 Mar;64(1):217-26. doi: 10.1111/j.1541-0420.2007.00875.x. Epub 2007 Aug 3.

引用本文的文献

Incidences of Laryngospasm Using a Laryngeal Mask Airway or Endotracheal Tube in Paediatric Adenotonsillectomy: A Systematic Review.在小儿腺样体扁桃体切除术中使用喉罩气道或气管内导管时喉痉挛的发生率：一项系统评价

J Clin Med. 2025 May 12;14(10):3369. doi: 10.3390/jcm14103369.

A computational framework for extracting biological insights from SRA cancer data.一种用于从SRA癌症数据中提取生物学见解的计算框架。

Sci Rep. 2025 Mar 8;15(1):8117. doi: 10.1038/s41598-025-91781-8.

Integrative network analysis suggests prioritised drugs for atopic dermatitis.整合网络分析提示优先考虑用于特应性皮炎的药物。

J Transl Med. 2024 Jan 16;22(1):64. doi: 10.1186/s12967-024-04879-4.

Using meta-analysis and machine learning to investigate the transcriptional response of immune cells to Leishmania infection.利用荟萃分析和机器学习研究免疫细胞对利什曼原虫感染的转录反应。

PLoS Negl Trop Dis. 2024 Jan 8;18(1):e0011892. doi: 10.1371/journal.pntd.0011892. eCollection 2024 Jan.

Meta-analysis of transcriptome reveals key genes relating to oil quality in olive.基于转录组的荟萃分析揭示了与橄榄油品质相关的关键基因。

BMC Genomics. 2023 Sep 22;24(1):566. doi: 10.1186/s12864-023-09673-y.

vissE.cloud: a webserver to visualise higher order molecular phenotypes from enrichment analysis.vissE.cloud：一个可视化富集分析中更高阶分子表型的网络服务器。

Nucleic Acids Res. 2023 Jul 5;51(W1):W593-W600. doi: 10.1093/nar/gkad337.

An ancestral molecular response to nanomaterial particulates.纳米颗粒的祖先分子反应。

Nat Nanotechnol. 2023 Aug;18(8):957-966. doi: 10.1038/s41565-023-01393-4. Epub 2023 May 8.

Integrative systems biology analysis of barley transcriptome ─ hormonal signaling against biotic stress.大麦转录组的综合系统生物学分析 ─ 激素信号转导对抗生物胁迫。

PLoS One. 2023 Apr 27;18(4):e0281470. doi: 10.1371/journal.pone.0281470. eCollection 2023.

Integrating Tumor-Intrinsic and Immunologic Factors to Identify Immunogenic Breast Cancers from a Low-Risk Cohort: Results from the Randomized SweBCG91RT Trial.整合肿瘤内在和免疫因素，从低危队列中鉴定出免疫原性乳腺癌：来自随机 SweBCG91RT 试验的结果。

Clin Cancer Res. 2023 May 1;29(9):1783-1793. doi: 10.1158/1078-0432.CCR-22-2746.

Recent developments and future directions in meta-analysis of differential gene expression in livestock RNA-Seq.家畜RNA测序中差异基因表达的荟萃分析的最新进展与未来方向

Front Genet. 2022 Sep 19;13:983043. doi: 10.3389/fgene.2022.983043. eCollection 2022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

微阵列实验中检测差异表达基因的荟萃分析方法比较。

A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

动机

结果

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献