Suppr超能文献

一种用于在cDNA(双色)微阵列实验中鉴定差异表达基因的全局方法。

A global approach to identify differentially expressed genes in cDNA (two-color) microarray experiments.

作者信息

Zhou Yiyong, Cras-Méneur Corentin, Ohsugi Mitsuru, Stormo Gary D, Permutt M Alan

机构信息

Division of Endocrinology, Metabolism and Lipid Research, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA.

出版信息

Bioinformatics. 2007 Aug 15;23(16):2073-9. doi: 10.1093/bioinformatics/btm292. Epub 2007 Jun 5.

Abstract

MOTIVATION

Currently most of the methods for identifying differentially expressed genes fall into the category of so called single-gene-analysis, performing hypothesis testing on a gene-by-gene basis. In a single-gene-analysis approach, estimating the variability of each gene is required to determine whether a gene is differentially expressed or not. Poor accuracy of variability estimation makes it difficult to identify genes with small fold-changes unless a very large number of replicate experiments are performed.

RESULTS

We propose a method that can avoid the difficult task of estimating variability for each gene, while reliably identifying a group of differentially expressed genes with low false discovery rates, even when the fold-changes are very small. In this article, a new characterization of differentially expressed genes is established based on a theorem about the distribution of ranks of genes sorted by (log) ratios within each array. This characterization of differentially expressed genes based on rank is an example of all-gene-analysis instead of single gene analysis. We apply the method to a cDNA microarray dataset and many low fold-changed genes (as low as 1.3 fold-changes) are reliably identified without carrying out hypothesis testing on a gene-by-gene basis. The false discovery rate is estimated in two different ways reflecting the variability from all the genes without the complications related to multiple hypothesis testing. We also provide some comparisons between our approach and single-gene-analysis based methods.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

目前,大多数用于识别差异表达基因的方法都属于所谓的单基因分析范畴,即逐基因进行假设检验。在单基因分析方法中,需要估计每个基因的变异性,以确定一个基因是否差异表达。变异性估计的准确性较差,使得难以识别具有小倍数变化的基因,除非进行大量的重复实验。

结果

我们提出了一种方法,该方法可以避免估计每个基因变异性这一艰巨任务,同时即使在倍数变化非常小的情况下,也能以低错误发现率可靠地识别出一组差异表达基因。在本文中,基于一个关于每个阵列中按(对数)比率排序的基因排名分布的定理,建立了差异表达基因的新特征描述。这种基于排名的差异表达基因特征描述是全基因分析而非单基因分析的一个例子。我们将该方法应用于一个cDNA微阵列数据集,无需逐基因进行假设检验,就能可靠地识别出许多低倍数变化的基因(低至1.3倍变化)。通过两种不同方式估计错误发现率,这两种方式反映了所有基因的变异性,而不会出现与多重假设检验相关的复杂情况。我们还对我们的方法与基于单基因分析的方法进行了一些比较。

补充信息

补充数据可在《生物信息学》在线获取。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验