Greller L D, Tobin F L
Bioinformatics-Mathematical Biology, SmithKline Beecham Pharmaceuticals Research & Development, King of Prussia, Pennsylvania 19406
Genome Res. 1999 Mar;9(3):282-96.
Selective expression of a gene product (mRNA or protein) is a pattern in which the expression is markedly high, or markedly low, in one particular tissue compared with its level in other tissues or sources. We present a computational method for the identification of such patterns. The method combines assessments of the reliability of expression quantitation with a statistical test of expression distribution patterns. The method is applicable to small studies or to data mining of abundance data from expression databases, whether mRNA or protein. Though the method was developed originally for gene-expression analyses, the computational method is, in fact, rather general. It is well suited for the identification of exceptional values in many sorts of intensity data, even noisy data, for which assessments of confidences in the sources of the intensities are available. Moreover, the method is indifferent as to whether the intensities are experimentally or computationally derived. We show details of the general method and examples of computational results on gene abundance data.
基因产物(mRNA或蛋白质)的选择性表达是一种模式,即在一个特定组织中,其表达水平相较于其他组织或来源显著高或显著低。我们提出了一种用于识别此类模式的计算方法。该方法将表达定量的可靠性评估与表达分布模式的统计检验相结合。该方法适用于小型研究或对来自表达数据库的丰度数据(无论是mRNA还是蛋白质)进行数据挖掘。尽管该方法最初是为基因表达分析而开发的,但实际上这种计算方法相当通用。它非常适合识别多种强度数据(甚至是有噪声的数据)中的异常值,前提是可以获得强度来源的置信度评估。此外,该方法对于强度是通过实验得出还是通过计算得出并不关心。我们展示了该通用方法的详细信息以及基因丰度数据的计算结果示例。