寡核苷酸微阵列数据的转换与归一化

Transformation and normalization of oligonucleotide microarray data.

作者信息

Geller Sue C, Gregg Jeff P, Hagerman Paul, Rocke David M

机构信息

Department of Mathematics, Texas A&M University, College Station, TX 77843-3368, USA.

出版信息

Bioinformatics. 2003 Sep 22;19(14):1817-23. doi: 10.1093/bioinformatics/btg245.

DOI:10.1093/bioinformatics/btg245

PMID:14512353

Abstract

MOTIVATION

Most methods of analyzing microarray data or doing power calculations have an underlying assumption of constant variance across all levels of gene expression. The most common transformation, the logarithm, results in data that have constant variance at high levels but not at low levels. Rocke and Durbin showed that data from spotted arrays fit a two-component model and Durbin, Hardin, Hawkins, and Rocke, Huber et al. and Munson provided a transformation that stabilizes the variance as well as symmetrizes and normalizes the error structure. We wish to evaluate the applicability of this transformation to the error structure of GeneChip microarrays.

RESULTS

We demonstrate in an example study a simple way to use the two-component model of Rocke and Durbin and the data transformation of Durbin, Hardin, Hawkins and Rocke, Huber et al. and Munson on Affymetrix GeneChip data. In addition we provide a method for normalization of Affymetrix GeneChips simultaneous with the determination of the transformation, producing a data set without chip or slide effects but with constant variance and with symmetric errors. This transformation/normalization process can be thought of as a machine calibration in that it requires a few biologically constant replicates of one sample to determine the constant needed to specify the transformation and normalize. It is hypothesized that this constant needs to be found only once for a given technology in a lab, perhaps with periodic updates. It does not require extensive replication in each study. Furthermore, the variance of the transformed pilot data can be used to do power calculations using standard power analysis programs.

AVAILABILITY

SPLUS code for the transformation/normalization for four replicates is available from the first author upon request. A program written in C is available from the last author.

摘要

动机

大多数分析微阵列数据或进行功效计算的方法都有一个潜在假设，即基因表达的所有水平上方差恒定。最常见的变换，即对数变换，会使数据在高水平时具有恒定方差，但在低水平时并非如此。罗克和德宾表明，点阵微阵列的数据符合双组分模型，德宾、哈丁、霍金斯以及罗克、胡贝尔等人和芒森提供了一种变换，该变换可稳定方差并使误差结构对称化和归一化。我们希望评估这种变换对基因芯片微阵列误差结构的适用性。

结果

我们在一个示例研究中展示了一种简单方法，可将罗克和德宾的双组分模型以及德宾、哈丁、霍金斯和罗克、胡贝尔等人和芒森的数据变换应用于Affymetrix基因芯片数据。此外，我们提供了一种在确定变换的同时对Affymetrix基因芯片进行归一化的方法，从而生成一个没有芯片或载玻片效应、具有恒定方差且误差对称的数据集。这种变换/归一化过程可被视为一种机器校准，因为它需要对一个样本进行一些生物学上恒定的重复实验，以确定指定变换和归一化所需的常数。据推测，对于实验室中的给定技术，这个常数只需找到一次，可能需要定期更新。它不需要在每个研究中进行大量重复实验。此外，变换后的先导数据的方差可用于使用标准功效分析程序进行功效计算。

可用性

如有需要，可向第一作者索取用于四个重复样本的变换/归一化的SPLUS代码。最后一位作者提供了一个用C编写的程序。

相似文献

Transformation and normalization of oligonucleotide microarray data.寡核苷酸微阵列数据的转换与归一化

Bioinformatics. 2003 Sep 22;19(14):1817-23. doi: 10.1093/bioinformatics/btg245.

Estimation of transformation parameters for microarray data.微阵列数据转换参数的估计

Bioinformatics. 2003 Jul 22;19(11):1360-7. doi: 10.1093/bioinformatics/btg178.

A variance-stabilizing transformation for gene-expression microarray data.一种用于基因表达微阵列数据的方差稳定变换。

Bioinformatics. 2002;18 Suppl 1:S105-10. doi: 10.1093/bioinformatics/18.suppl_1.s105.

Normalization for Affymetrix GeneChips.Affymetrix基因芯片的标准化

Methods Inf Med. 2005;44(3):414-7.

Statistical analysis of high-density oligonucleotide arrays: a multiplicative noise model.高密度寡核苷酸阵列的统计分析：一种乘性噪声模型。

Bioinformatics. 2002 Dec;18(12):1633-40. doi: 10.1093/bioinformatics/18.12.1633.

An expression index for Affymetrix GeneChips based on the generalized logarithm.基于广义对数的Affymetrix基因芯片表达指数。

Bioinformatics. 2005 Nov 1;21(21):3983-9. doi: 10.1093/bioinformatics/bti665. Epub 2005 Sep 13.

Normalization of microarray data using a spatial mixed model analysis which includes splines.使用包含样条函数的空间混合模型分析对微阵列数据进行标准化。

Bioinformatics. 2004 Nov 22;20(17):3196-205. doi: 10.1093/bioinformatics/bth384. Epub 2004 Jul 1.

Selection and validation of normalization methods for c-DNA microarrays using within-array replications.使用芯片内重复数据对c-DNA微阵列标准化方法进行选择与验证

Bioinformatics. 2007 Sep 15;23(18):2391-8. doi: 10.1093/bioinformatics/btm361. Epub 2007 Jul 27.

New normalization methods for cDNA microarray data.cDNA微阵列数据的新标准化方法。

Bioinformatics. 2003 Jul 22;19(11):1325-32. doi: 10.1093/bioinformatics/btg146.

A generalized likelihood ratio test to identify differentially expressed genes from microarray data.一种用于从微阵列数据中识别差异表达基因的广义似然比检验。

Bioinformatics. 2004 Jan 1;20(1):100-4. doi: 10.1093/bioinformatics/btg384.

引用本文的文献

Screening for interaction effects in gene expression data.基因表达数据中交互效应的筛选。

PLoS One. 2017 Mar 16;12(3):e0173847. doi: 10.1371/journal.pone.0173847. eCollection 2017.

Quality Visualization of Microarray Datasets Using Circos.使用Circos对微阵列数据集进行质量可视化

Microarrays (Basel). 2012 Aug 7;1(2):84-94. doi: 10.3390/microarrays1020084.

Genome wide identification of aberrant alternative splicing events in myotonic dystrophy type 2.2型强直性肌营养不良中异常可变剪接事件的全基因组鉴定

PLoS One. 2014 Apr 10;9(4):e93983. doi: 10.1371/journal.pone.0093983. eCollection 2014.

Effect of normalization on statistical and biological interpretation of gene expression profiles.标准化对基因表达谱的统计和生物学解释的影响。

Front Genet. 2013 May 31;3:160. doi: 10.3389/fgene.2012.00160. eCollection 2012.

A glance at DNA microarray technology and applications.DNA 微阵列技术及其应用一瞥。

Bioimpacts. 2011;1(2):75-86. doi: 10.5681/bi.2011.011. Epub 2011 Aug 4.

Motif effects in Affymetrix GeneChips seriously affect probe intensities.Affymetrix GeneChips 中的基序效应会严重影响探针强度。

Nucleic Acids Res. 2012 Oct;40(19):9705-16. doi: 10.1093/nar/gks717. Epub 2012 Aug 16.

Analyzing multiple-probe microarray: estimation and application of gene expression indexes.分析多探针微阵列：基因表达指数的估计与应用

Biometrics. 2012 Sep;68(3):784-92. doi: 10.1111/j.1541-0420.2012.01727.x. Epub 2012 Jul 26.

Normalized Affymetrix expression data are biased by G-quadruplex formation.标准化的 Affymetrix 表达数据受到 G-四链体形成的影响。

Nucleic Acids Res. 2012 Apr;40(8):3307-15. doi: 10.1093/nar/gkr1230. Epub 2011 Dec 22.

Expectations, validity, and reality in gene expression profiling.基因表达谱分析中的预期、有效性和现实。

J Clin Epidemiol. 2010 Sep;63(9):950-9. doi: 10.1016/j.jclinepi.2010.02.018. Epub 2010 Jun 25.

Predicting novel human gene ontology annotations using semantic analysis.利用语义分析预测新的人类基因本体论注释。

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):91-9. doi: 10.1109/TCBB.2008.29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

寡核苷酸微阵列数据的转换与归一化

Transformation and normalization of oligonucleotide microarray data.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献