Stobbe Miranda D, Swertz Morris A, Thiele Ines, Rengaw Trebor, van Kampen Antoine H C, Moerland Perry D
Bioinformatics Laboratory, Academic Medical Center, University of Amsterdam, PO Box 22700, Amsterdam 1100 DE, the Netherlands.
BMC Syst Biol. 2013 Jun 26;7:50. doi: 10.1186/1752-0509-7-50.
The metabolic network of H. sapiens and many other organisms is described in multiple pathway databases. The level of agreement between these descriptions, however, has proven to be low. We can use these different descriptions to our advantage by identifying conflicting information and combining their knowledge into a single, more accurate, and more complete description. This task is, however, far from trivial.
We introduce the concept of Consensus and Conflict Cards (C₂Cards) to provide concise overviews of what the databases do or do not agree on. Each card is centered at a single gene, EC number or reaction. These three complementary perspectives make it possible to distinguish disagreements on the underlying biology of a metabolic process from differences that can be explained by different decisions on how and in what detail to represent knowledge. As a proof-of-concept, we implemented C₂Cards(Human), as a web application http://www.molgenis.org/c2cards, covering five human pathway databases.
C₂Cards can contribute to ongoing reconciliation efforts by simplifying the identification of consensus and conflicts between pathway databases and lowering the threshold for experts to contribute. Several case studies illustrate the potential of the C₂Cards in identifying disagreements on the underlying biology of a metabolic process. The overviews may also point out controversial biological knowledge that should be subject of further research. Finally, the examples provided emphasize the importance of manual curation and the need for a broad community involvement.
智人和许多其他生物体的代谢网络在多个途径数据库中都有描述。然而,这些描述之间的一致程度已被证明很低。我们可以通过识别相互冲突的信息并将它们的知识整合为一个更准确、更完整的单一描述,来利用这些不同的描述。然而,这项任务绝非易事。
我们引入了共识与冲突卡片(C₂卡片)的概念,以简要概述数据库之间的共识与分歧。每张卡片都以单个基因、酶委员会编号或反应为中心。这三个互补的视角使得区分代谢过程基础生物学上的分歧与因表示知识的方式和详细程度的不同决策而产生的差异成为可能。作为概念验证,我们实现了C₂卡片(人类版),作为一个网络应用程序http://www.molgenis.org/c2cards,涵盖五个人类途径数据库。
C₂卡片可以通过简化途径数据库之间共识和冲突的识别,并降低专家参与的门槛,为正在进行的协调工作做出贡献。几个案例研究说明了C₂卡片在识别代谢过程基础生物学上的分歧方面的潜力。这些概述也可能指出有争议的生物学知识,这些知识应该成为进一步研究的主题。最后,所提供的例子强调了人工编目的重要性以及广泛的社区参与的必要性。