用于从微阵列数据中识别差异表达的统计检验的调查与比较研究

A Survey and Comparative Study of Statistical Tests for Identifying Differential Expression from Microarray Data.

作者信息

Bandyopadhyay Sanghamitra, Mallik Saurav, Mukhopadhyay Anirban

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2014 Jan-Feb;11(1):95-115. doi: 10.1109/TCBB.2013.147.

DOI:10.1109/TCBB.2013.147

Abstract

DNA microarray is a powerful technology that can simultaneously determine the levels of thousands of transcripts (generated, for example, from genes/miRNAs) across different experimental conditions or tissue samples. The motto of differential expression analysis is to identify the transcripts whose expressions change significantly across different types of samples or experimental conditions. A number of statistical testing methods are available for this purpose. In this paper, we provide a comprehensive survey on different parametric and non-parametric testing methodologies for identifying differential expression from microarray data sets. The performances of the different testing methods have been compared based on some real-life miRNA and mRNA expression data sets. For validating the resulting differentially expressed miRNAs, the outcomes of each test are checked with the information available for miRNA in the standard miRNA database PhenomiR 2.0. Subsequently, we have prepared different simulated data sets of different sample sizes (from 10 to 100 per group/population) and thereafter the power of each test have been calculated individually. The comparative simulated study might lead to formulate robust and comprehensive judgements about the performance of each test in the basis of assumption of data distribution. Finally, a list of advantages and limitations of the different statistical tests has been provided, along with indications of some areas where further studies are required.

摘要

DNA微阵列是一项强大的技术，它能够同时测定在不同实验条件或组织样本中数千种转录本（例如由基因/微小RNA产生的转录本）的水平。差异表达分析的主旨是识别那些在不同类型样本或实验条件下表达发生显著变化的转录本。有多种统计检验方法可用于此目的。在本文中，我们对用于从微阵列数据集中识别差异表达的不同参数和非参数检验方法进行了全面综述。基于一些实际的微小RNA和信使核糖核酸表达数据集，对不同检验方法的性能进行了比较。为了验证所得的差异表达微小RNA，将每个检验的结果与标准微小RNA数据库PhenomiR 2.0中微小RNA的可用信息进行核对。随后，我们准备了不同样本量（每组/总体从10到100）的不同模拟数据集，然后分别计算每个检验的功效。该比较模拟研究可能会在数据分布假设的基础上，对每个检验的性能形成稳健而全面的判断。最后，给出了不同统计检验的优缺点列表，以及一些需要进一步研究的领域的指示。

相似文献

A Survey and Comparative Study of Statistical Tests for Identifying Differential Expression from Microarray Data.

IEEE/ACM Trans Comput Biol Bioinform. 2014 Jan-Feb;11(1):95-115. doi: 10.1109/TCBB.2013.147.

Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.

BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.

Leveraging two-way probe-level block design for identifying differential gene expression with high-density oligonucleotide arrays.

BMC Bioinformatics. 2004 Apr 20;5:42. doi: 10.1186/1471-2105-5-42.

The effects of normalization on the correlation structure of microarray data.

BMC Bioinformatics. 2005 May 16;6:120. doi: 10.1186/1471-2105-6-120.

Ranking analysis for identifying differentially expressed genes.

Genomics. 2011 May;97(5):326-9. doi: 10.1016/j.ygeno.2011.03.002. Epub 2011 Mar 22.

In silico microdissection of microarray data from heterogeneous cell populations.

BMC Bioinformatics. 2005 Mar 14;6:54. doi: 10.1186/1471-2105-6-54.

Nonparametric tests for differential gene expression and interaction effects in multi-factorial microarray experiments.

BMC Bioinformatics. 2005 Jul 21;6:186. doi: 10.1186/1471-2105-6-186.

A non-transformation method for identifying differentially expressed genes from cDNA microarrays.

Yi Chuan Xue Bao. 2006 Jan;33(1):80-8. doi: 10.1016/S0379-4172(06)60012-7.

Comparison of small n statistical tests of differential expression applied to microarrays.

BMC Bioinformatics. 2009 Feb 3;10:45. doi: 10.1186/1471-2105-10-45.

Normality of oligonucleotide microarray data and implications for parametric statistical analyses.

Bioinformatics. 2003 Nov 22;19(17):2254-62. doi: 10.1093/bioinformatics/btg311.

引用本文的文献

Deciphering differences in DNA methylation and transcriptome profiles of oocytes from pigs with high and low developmental competence.

Environ Epigenet. 2025 Jun 3;11(1):dvaf018. doi: 10.1093/eep/dvaf018. eCollection 2025.

Optimal ranking and directional signature classification using the integral strategy of multi-objective optimization-based association rule mining of multi-omics data.

Front Bioinform. 2023 Jul 27;3:1182176. doi: 10.3389/fbinf.2023.1182176. eCollection 2023.

3PNMF-MKL: A non-negative matrix factorization-based multiple kernel learning method for multi-modal data integration and its application to gene signature detection.

Front Genet. 2023 Feb 14;14:1095330. doi: 10.3389/fgene.2023.1095330. eCollection 2023.

Breast cancer detection: Shallow convolutional neural network against deep convolutional neural networks based approach.

Front Genet. 2023 Jan 4;13:1097207. doi: 10.3389/fgene.2022.1097207. eCollection 2022.

A Seven-Autophagy-Related Long Non-Coding RNA Signature Can Accurately Predict the Prognosis of Patients with Renal Cell Carcinoma.

Int J Gen Med. 2022 Nov 10;15:8143-8157. doi: 10.2147/IJGM.S381027. eCollection 2022.

Hsa_circ_0040809 and hsa_circ_0000467 promote colorectal cancer cells progression and construction of a circRNA-miRNA-mRNA network.

Front Genet. 2022 Oct 20;13:993727. doi: 10.3389/fgene.2022.993727. eCollection 2022.

DNA methylation loci identification for pan-cancer early-stage diagnosis and prognosis using a new distributed parallel partial least squares method.

Front Genet. 2022 Oct 19;13:940214. doi: 10.3389/fgene.2022.940214. eCollection 2022.

Designing optimal convolutional neural network architecture using differential evolution algorithm.

Patterns (N Y). 2022 Aug 24;3(9):100567. doi: 10.1016/j.patter.2022.100567. eCollection 2022 Sep 9.

Reprogramming barriers in bovine cells nuclear transfer revealed by single-cell RNA-seq analysis.

J Cell Mol Med. 2022 Sep;26(18):4792-4804. doi: 10.1111/jcmm.17505. Epub 2022 Aug 15.

Novel Epigenetic Clock Biomarkers of Age-Related Macular Degeneration.

Front Med (Lausanne). 2022 Jun 16;9:856853. doi: 10.3389/fmed.2022.856853. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于从微阵列数据中识别差异表达的统计检验的调查与比较研究

A Survey and Comparative Study of Statistical Tests for Identifying Differential Expression from Microarray Data.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献