Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA.
Sci Rep. 2013;3:1695. doi: 10.1038/srep01695.
The difficulty in annotating the vast amounts of biological information poses one of the greatest current challenges in biological research. The number of genomic, proteomic, and metabolomic datasets has increased dramatically over the last two decades, far outstripping the pace of curation efforts. Here, we tackle the challenge of curating metabolic network reconstructions. We predict organismal metabolic networks using sequence homology and a global metabolic network constructed from all available organismal networks. While sequence homology has been a standard to annotate metabolic networks it has been faulted for its lack of predictive power. We show, however, that when homology is used with a global metabolic network one is able to predict organismal metabolic networks that have enhanced network connectivity. Additionally, we compare the annotation behavior of current database curation efforts with our predictions and find that curation efforts are biased towards adding (rather than removing) reactions to organismal networks.
注释大量的生物信息具有很大的难度,这是当前生物研究面临的最大挑战之一。在过去的二十年中,基因组、蛋白质组和代谢组数据集的数量急剧增加,远远超过了编辑工作的速度。在这里,我们解决编辑代谢网络重建的挑战。我们使用序列同源性和从所有可用的生物体网络构建的全局代谢网络来预测生物体的代谢网络。虽然序列同源性一直是注释代谢网络的标准,但它因缺乏预测能力而受到批评。然而,我们表明,当将同源性与全局代谢网络一起使用时,人们能够预测具有增强网络连通性的生物体代谢网络。此外,我们将当前数据库编辑工作的注释行为与我们的预测进行了比较,发现编辑工作偏向于向生物体网络添加(而不是删除)反应。