UMR203 BF2I, Biologie Fonctionnelle Insectes et Interactions, INRA, INSA-Lyon, Université de Lyon, 20 av. A. Einstein, F-69621 Villeurbanne, France.
Database (Oxford). 2011 Apr 7;2011:bar008. doi: 10.1093/database/bar008. Print 2011.
In recent years, genomes from an increasing number of organisms have been sequenced, but their annotation remains a time-consuming process. The BioCyc databases offer a framework for the integrated analysis of metabolic networks. The Pathway tool software suite allows the automated construction of a database starting from an annotated genome, but it requires prior integration of all annotations into a specific summary file or into a GenBank file. To allow the easy creation and update of a BioCyc database starting from the multiple genome annotation resources available over time, we have developed an ad hoc data management system that we called Cyc Annotation Database System (CycADS). CycADS is centred on a specific database model and on a set of Java programs to import, filter and export relevant information. Data from GenBank and other annotation sources (including for example: KAAS, PRIAM, Blast2GO and PhylomeDB) are collected into a database to be subsequently filtered and extracted to generate a complete annotation file. This file is then used to build an enriched BioCyc database using the PathoLogic program of Pathway Tools. The CycADS pipeline for annotation management was used to build the AcypiCyc database for the pea aphid (Acyrthosiphon pisum) whose genome was recently sequenced. The AcypiCyc database webpage includes also, for comparative analyses, two other metabolic reconstruction BioCyc databases generated using CycADS: TricaCyc for Tribolium castaneum and DromeCyc for Drosophila melanogaster. Linked to its flexible design, CycADS offers a powerful software tool for the generation and regular updating of enriched BioCyc databases. The CycADS system is particularly suited for metabolic gene annotation and network reconstruction in newly sequenced genomes. Because of the uniform annotation used for metabolic network reconstruction, CycADS is particularly useful for comparative analysis of the metabolism of different organisms. Database URL: http://www.cycadsys.org.
近年来,越来越多的生物体的基因组已经被测序,但它们的注释仍然是一个耗时的过程。BioCyc 数据库为代谢网络的综合分析提供了一个框架。Pathway tool 软件套件允许从注释基因组自动构建数据库,但它需要将所有注释预先集成到特定的摘要文件或 GenBank 文件中。为了允许从随时间可用的多个基因组注释资源轻松创建和更新 BioCyc 数据库,我们开发了一个专门的数据管理系统,我们称之为 Cyc Annotation Database System (CycADS)。CycADS 以特定的数据库模型和一组 Java 程序为中心,用于导入、过滤和导出相关信息。来自 GenBank 和其他注释源(例如:KAAS、PRIAM、Blast2GO 和 PhylomeDB)的数据被收集到一个数据库中,然后对其进行过滤和提取,以生成一个完整的注释文件。然后,使用 Pathway Tools 的 PathoLogic 程序将该文件用于构建一个丰富的 BioCyc 数据库。CycADS 注释管理管道用于构建豌豆蚜(Acyrthosiphon pisum)的 AcypiCyc 数据库,其基因组最近被测序。AcypiCyc 数据库网页还包括使用 CycADS 生成的另外两个代谢重建 BioCyc 数据库:TricaCyc 用于 Tribolium castaneum 和 DromeCyc 用于 Drosophila melanogaster,用于比较分析。由于其灵活的设计,CycADS 为生成和定期更新丰富的 BioCyc 数据库提供了一个强大的软件工具。CycADS 系统特别适合新测序基因组中的代谢基因注释和网络重建。由于代谢网络重建使用统一的注释,CycADS 特别适用于不同生物体代谢的比较分析。数据库网址:http://www.cycadsys.org。