Kersey Paul, Apweiler Rolf
EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
Nat Cell Biol. 2006 Nov;8(11):1183-9. doi: 10.1038/ncb1495.
The computational reconstruction of biological systems, 'systems biology', is necessarily dependent on the existence of well-annotated data sets defining and describing the components of these systems, especially genes and the proteins they encode. Information about these components can be accessed either through structured bioinformatics databases, which store basic chemical and functional information abstracted from (or supplementing) the scientific literature, or through the literature itself, which is richer in content but essentially unstructured.
生物系统的计算重建,即“系统生物学”,必然依赖于存在经过充分注释的数据集,这些数据集定义和描述了这些系统的组成部分,尤其是基因及其编码的蛋白质。有关这些组成部分的信息可以通过结构化的生物信息学数据库获取,这些数据库存储从科学文献中提取(或补充)的基本化学和功能信息,也可以通过文献本身获取,文献内容更丰富但本质上是非结构化的。