Yamanishi Y, Vert J-P, Nakaya A, Kanehisa M
Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto 611-0011, Japan.
Bioinformatics. 2003;19 Suppl 1:i323-30. doi: 10.1093/bioinformatics/btg1045.
A major issue in computational biology is the reconstruction of pathways from several genomic datasets, such as expression data, protein interaction data and phylogenetic profiles. As a first step toward this goal, it is important to investigate the amount of correlation which exists between these data.
These methods are successfully tested on their ability to recognize operons in the Escherichia coli genome, from the comparison of three datasets corresponding to functional relationships between genes in metabolic pathways, geometrical relationships along the chromosome, and co-expression relationships as observed by gene expression data.
计算生物学中的一个主要问题是从多个基因组数据集中重建通路,例如表达数据、蛋白质相互作用数据和系统发育谱。作为朝着这个目标迈出的第一步,研究这些数据之间存在的相关性程度很重要。
通过比较对应于代谢途径中基因之间功能关系、染色体上几何关系以及基因表达数据所观察到的共表达关系的三个数据集,对这些方法识别大肠杆菌基因组中操纵子的能力进行了成功测试。