使用混合混合模型估计差异表达基因的倍数变化

Fold-change estimation of differentially expressed genes using mixture mixed-model.

作者信息

Gusnanto Arief, Ploner Alexander, Pawitan Yudi

机构信息

Medical Research Council-Biostatistics Unit, Institute of Public Health, Cambridge CB2 2SR, United Kingdom.

出版信息

Stat Appl Genet Mol Biol. 2005;4:Article26. doi: 10.2202/1544-6115.1145. Epub 2005 Sep 21.

DOI:10.2202/1544-6115.1145

PMID:16646844

Abstract

Microarray experiments produce expression measurements for thousands of genes simultaneously, though usually for a small number of RNA samples. The most common problem is the identification of genes that are differentially expressed between different groups of samples or biological conditions. As the number of genes far exceeds the number of RNA samples, the inherent multiplicity poses a severe problem in both hypothesis testing and effect estimation. While much of the recent literature is focused on the hypothesis aspects, we concentrate in this paper on effect estimation as a tool for the identification of differentially expressed genes. We propose a linear mixed model where the random effects are assumed to follow a mixture distribution, and study in detail the case of three normals, corresponding to genes that are down-, up- or non regulated. Our approach leads to a new type of non-linear shrinkage estimation, where a proportion of estimates is shrunk to zero, while the rest follows standard linear shrinkage. This allows us to estimate the log fold-change of the genes involved and to identify those that are differentially expressed within the same model framework. We investigate the operating characteristics of our method using simulation and spike-in studies, and illustrate its application to real data using a breast-cancer dataset.

摘要

微阵列实验可同时对数千个基因进行表达测量，不过通常针对的是少量RNA样本。最常见的问题是识别在不同样本组或生物学条件之间差异表达的基因。由于基因数量远远超过RNA样本数量，内在的多重性在假设检验和效应估计中都构成了严重问题。尽管近期的许多文献都聚焦于假设方面，但在本文中我们将重点放在效应估计上，将其作为识别差异表达基因的一种工具。我们提出一种线性混合模型，其中假定随机效应服从混合分布，并详细研究对应于下调、上调或无调控基因的三个正态分布的情况。我们的方法导致了一种新型的非线性收缩估计，其中一部分估计值被收缩至零，而其余部分遵循标准线性收缩。这使我们能够估计所涉及基因的对数变化倍数，并在同一模型框架内识别那些差异表达的基因。我们使用模拟和掺入研究来研究我们方法的操作特性，并使用乳腺癌数据集说明其在实际数据中的应用。

相似文献

Fold-change estimation of differentially expressed genes using mixture mixed-model.使用混合混合模型估计差异表达基因的倍数变化

Stat Appl Genet Mol Biol. 2005;4:Article26. doi: 10.2202/1544-6115.1145. Epub 2005 Sep 21.

Variance component estimation for mixed model analysis of cDNA microarray data.用于cDNA微阵列数据混合模型分析的方差成分估计

Biom J. 2008 Dec;50(6):927-39. doi: 10.1002/bimj.200810476.

Powers of multiple-testing procedures for identification of genes significantly differentially expressed in microarray experiments.用于识别在微阵列实验中显著差异表达基因的多重检验程序的功效。

Yi Chuan Xue Bao. 2006 Dec;33(12):1132-40. doi: 10.1016/S0379-4172(06)60152-2.

Identification of differentially expressed genes by meta-analysis of microarray data on breast cancer.通过对乳腺癌微阵列数据的荟萃分析鉴定差异表达基因。

In Silico Biol. 2008;8(5-6):383-411.

A Bayesian approach to estimation and testing in time-course microarray experiments.一种用于时间进程微阵列实验中估计和检验的贝叶斯方法。

Stat Appl Genet Mol Biol. 2007;6:Article24. doi: 10.2202/1544-6115.1299. Epub 2007 Sep 16.

Selection of differentially expressed genes in microarray data analysis.微阵列数据分析中差异表达基因的选择。

Pharmacogenomics J. 2007 Jun;7(3):212-20. doi: 10.1038/sj.tpj.6500412. Epub 2006 Aug 29.

Linear models and empirical bayes methods for assessing differential expression in microarray experiments.用于评估微阵列实验中差异表达的线性模型和经验贝叶斯方法。

Stat Appl Genet Mol Biol. 2004;3:Article3. doi: 10.2202/1544-6115.1027. Epub 2004 Feb 12.

EVE (external variance estimation) increases statistical power for detecting differentially expressed genes.EVE（外部方差估计）提高了检测差异表达基因的统计功效。

Plant J. 2007 Nov;52(3):561-9. doi: 10.1111/j.1365-313X.2007.03227.x. Epub 2007 Aug 3.

[Identification of the differentially expressed genes between primary breast cancer and paired lymph node metastasis through combining mRNA differential display and gene microarray].通过结合mRNA差异显示和基因芯片技术鉴定原发性乳腺癌与配对淋巴结转移之间的差异表达基因

Zhonghua Yi Xue Za Zhi. 2006 Oct 24;86(39):2749-55.

A statistical method for estimating the proportion of differentially expressed genes.一种用于估计差异表达基因比例的统计方法。

Comput Biol Chem. 2006 Jun;30(3):193-202. doi: 10.1016/j.compbiolchem.2006.03.001. Epub 2006 May 2.

引用本文的文献

Characterization of regulatory pathways in Xylella fastidiosa: genes and phenotypes controlled by gacA.桑氏假单胞菌调控途径的表征：受gacA控制的基因和表型

Appl Environ Microbiol. 2009 Apr;75(8):2275-83. doi: 10.1128/AEM.01964-08. Epub 2009 Feb 13.

Bayesian mixture model analysis for detecting differentially expressed genes.用于检测差异表达基因的贝叶斯混合模型分析

Int J Plant Genomics. 2008;2008:892927. doi: 10.1155/2008/892927.

Characterization of regulatory pathways in Xylella fastidiosa: genes and phenotypes controlled by algU.桑氏木质部小菌 regulatory 途径的表征：受 algU 控制的基因和表型

Appl Environ Microbiol. 2007 Nov;73(21):6748-56. doi: 10.1128/AEM.01232-07. Epub 2007 Sep 7.

Detecting multiple associations in genome-wide studies.在全基因组研究中检测多重关联。

Hum Genomics. 2006 Mar;2(5):310-7. doi: 10.1186/1479-7364-2-5-310.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用混合混合模型估计差异表达基因的倍数变化

Fold-change estimation of differentially expressed genes using mixture mixed-model.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献