Ben-Shaul Yoram, Bergman Hagai, Soreq Hermona
Department of Biological Chemistry, The Life Sciences Institute Jerusalem, 91904, Israel.
Bioinformatics. 2005 Apr 1;21(7):1129-37. doi: 10.1093/bioinformatics/bti149. Epub 2004 Nov 18.
Analysis of large-scale expression data is greatly facilitated by the availability of gene ontologies (GOs). Many current methods test whether sets of transcripts annotated with specific ontology terms contain an excess of 'changed' transcripts. This approach suffers from two main limitations. First, since gene expression is continuous rather than discrete, designating a gene as changed or unchanged is arbitrary and oblivious to the actual magnitude of the change. Second, by considering only the number of changed genes, finer changes in expression patterns associated with the category may be ignored. Since genes generally participate in multiple networks, widespread and subtle modifications in expression patterns are at least as important as extreme increases/decreases of a few genes.
Numerical simulations confirm that incorporating continuous measures of gene expression for all measured transcripts yields detection of considerably more subtle changes. Applying continuous measures to microarray data from brains of mice injected with the Parkinsonian neurotoxin, MPTP, enables detection of changes in various biologically relevant GO terms, many of which are overlooked by discrete approaches.
基因本体论(GO)的可用性极大地促进了大规模表达数据的分析。当前许多方法测试用特定本体术语注释的转录本集合是否包含过量的“变化”转录本。这种方法存在两个主要局限性。首先,由于基因表达是连续的而非离散的,将一个基因指定为变化或未变化是任意的,并且忽略了变化的实际幅度。其次,仅考虑变化基因的数量,可能会忽略与该类别相关的表达模式中更细微的变化。由于基因通常参与多个网络,表达模式中广泛而细微的修饰至少与少数基因的极端增加/减少同样重要。
数值模拟证实,对所有测量的转录本纳入基因表达的连续测量可检测到更为细微的变化。将连续测量应用于注射帕金森神经毒素MPTP的小鼠大脑的微阵列数据,能够检测各种生物学相关GO术语中的变化,其中许多变化被离散方法所忽略。