Reeves Gabrielle A, Talavera David, Thornton Janet M
EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
J R Soc Interface. 2009 Feb 6;6(31):129-47. doi: 10.1098/rsif.2008.0341.
Recent years have seen a huge increase in the generation of genomic and proteomic data. This has been due to improvements in current biological methodologies, the development of new experimental techniques and the use of computers as support tools. All these raw data are useless if they cannot be properly analysed, annotated, stored and displayed. Consequently, a vast number of resources have been created to present the data to the wider community. Annotation tools and databases provide the means to disseminate these data and to comprehend their biological importance. This review examines the various aspects of annotation: type, methodology and availability. Moreover, it puts a special interest on novel annotation fields, such as that of phenotypes, and highlights the recent efforts focused on the integrating annotations.
近年来,基因组和蛋白质组数据的生成量大幅增加。这得益于当前生物学方法的改进、新实验技术的发展以及计算机作为支持工具的使用。如果这些原始数据不能得到妥善分析、注释、存储和展示,那么它们就毫无用处。因此,已经创建了大量资源来向更广泛的群体呈现这些数据。注释工具和数据库提供了传播这些数据并理解其生物学重要性的手段。本综述探讨了注释的各个方面:类型、方法和可用性。此外,它特别关注新颖的注释领域,如表型领域,并强调了近期专注于整合注释的努力。