Zhou Xianghong, Kao Ming-Chih J, Wong Wing Hung
Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA.
Proc Natl Acad Sci U S A. 2002 Oct 1;99(20):12783-8. doi: 10.1073/pnas.192159399. Epub 2002 Aug 26.
Current methods for the functional analysis of microarray gene expression data make the implicit assumption that genes with similar expression profiles have similar functions in cells. However, among genes involved in the same biological pathway, not all gene pairs show high expression similarity. Here, we propose that transitive expression similarity among genes can be used as an important attribute to link genes of the same biological pathway. Based on large-scale yeast microarray expression data, we use the shortest-path analysis to identify transitive genes between two given genes from the same biological process. We find that not only functionally related genes with correlated expression profiles are identified but also those without. In the latter case, we compare our method to hierarchical clustering, and show that our method can reveal functional relationships among genes in a more precise manner. Finally, we show that our method can be used to reliably predict the function of unknown genes from known genes lying on the same shortest path. We assigned functions for 146 yeast genes that are considered as unknown by the Saccharomyces Genome Database and by the Yeast Proteome Database. These genes constitute around 5% of the unknown yeast ORFome.
当前用于微阵列基因表达数据功能分析的方法隐含地假设,具有相似表达谱的基因在细胞中具有相似的功能。然而,在参与同一生物途径的基因中,并非所有基因对都表现出高表达相似性。在此,我们提出基因间的传递性表达相似性可作为连接同一生物途径中基因的一个重要属性。基于大规模酵母微阵列表达数据,我们使用最短路径分析从同一生物过程中识别两个给定基因之间的传递性基因。我们发现,不仅能识别出具有相关表达谱的功能相关基因,也能识别出不具有相关表达谱的基因。在后一种情况下,我们将我们的方法与层次聚类进行比较,结果表明我们的方法能够以更精确的方式揭示基因间的功能关系。最后,我们表明我们的方法可用于从位于同一条最短路径上的已知基因可靠地预测未知基因的功能。我们为酿酒酵母基因组数据库和酵母蛋白质组数据库认定为未知的146个酵母基因赋予了功能。这些基因约占未知酵母开放阅读框组的5%。