Wall M E, Dyck P A, Brettin T S
Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
Bioinformatics. 2001 Jun;17(6):566-8. doi: 10.1093/bioinformatics/17.6.566.
We have developed two novel methods for Singular Value Decomposition analysis (SVD) of microarray data. The first is a threshold-based method for obtaining gene groups, and the second is a method for obtaining a measure of confidence in SVD analysis. Gene groups are obtained by identifying elements of the left singular vectors, or gene coefficient vectors, that are greater in magnitude than the threshold W N(-1/2), where N is the number of genes, and W is a weight factor whose default value is 3. The groups are non-exclusive and may contain genes of opposite (i.e. inversely correlated) regulatory response. The confidence measure is obtained by systematically deleting assays from the data set, interpolating the SVD of the reduced data set to reconstruct the missing assay, and calculating the Pearson correlation between the reconstructed assay and the original data. This confidence measure is applicable when each experimental assay corresponds to a value of parameter that can be interpolated, such as time, dose or concentration. Algorithms for the grouping method and the confidence measure are available in a software application called SVD Microarray ANalysis (SVDMAN). In addition to calculating the SVD for generic analysis, SVDMAN provides a new means for using microarray data to develop hypotheses for gene associations and provides a measure of confidence in the hypotheses, thus extending current SVD research in the area of global gene expression analysis.
我们开发了两种用于微阵列数据奇异值分解分析(SVD)的新方法。第一种是基于阈值获取基因组的方法,第二种是获取SVD分析置信度的方法。通过识别左奇异向量或基因系数向量中绝对值大于阈值WN^(-1/2)的元素来获取基因组,其中N是基因数量,W是权重因子,其默认值为3。这些组并非相互排斥,可能包含具有相反(即负相关)调控反应的基因。通过系统地从数据集中删除分析、对简化数据集的SVD进行插值以重建缺失的分析,并计算重建分析与原始数据之间的皮尔逊相关性来获得置信度。当每个实验分析对应于一个可以插值的参数值(如时间、剂量或浓度)时,这种置信度是适用的。分组方法和置信度的算法可在一个名为SVD微阵列分析(SVDMAN)的软件应用程序中获得。除了计算用于一般分析的SVD外,SVDMAN还提供了一种利用微阵列数据来建立基因关联假设的新方法,并提供了对这些假设的置信度度量,从而扩展了当前在全局基因表达分析领域的SVD研究。