Zhou Carol L Ecale, Lam Marisa W, Smith Jason R, Zemla Adam T, Dyer Matthew D, Kuczmarski Thomas A, Vitalis Elizabeth A, Slezak Thomas R
Lawrence Livermore National Laboratory, Pathogen Bio-informatics, Livermore, CA, USA.
BMC Bioinformatics. 2006 Oct 17;7:459. doi: 10.1186/1471-2105-7-459.
MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data.
MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-source tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO.
MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports. Access to MannDB is freely available at http://manndb.llnl.gov/.
创建MannDB是为了满足对快速、全面的自动化蛋白质序列分析的需求,以支持选择适合作为开发病原体或蛋白质毒素检测试剂靶点的蛋白质。由于需要大量开源工具,因此有必要开发一个软件系统来扩展全蛋白质组分析的计算规模。因此,我们构建了一个全自动系统,用于执行软件工具以及存储、整合和显示自动化蛋白质序列分析及注释数据。
MannDB是一个关系数据库,它组织使用开源工具进行的全自动、高通量蛋白质序列分析所产生的数据。提供的分析类型包括切割预测、化学性质、分类、特征、功能分配、翻译后修饰、基序、抗原性和二级结构预测。蛋白质组(假设蛋白质和已知蛋白质列表)从Genbank下载并解析,然后插入MannDB,当在Genbank条目中找到标识符或识别出相同序列时,下载来自SwissProt的注释。目前,针对MannDB蛋白质序列,在本地系统上或通过批量提交到外部服务器运行36个开源工具。此外,还会对我们的微生物毒力因子数据库MvirDB中的蛋白质条目进行BLAST搜索。一个网络客户端浏览器可用于查看计算结果和下载的注释,一个查询工具提供结构化和自由文本搜索功能。如有可用链接,会提供到包括MvirDB在内的外部数据库的链接。MannDB包含对美国动植物卫生检验局(APHIS)、疾病控制与预防中心(CDC)、美国卫生与公众服务部(HHS)、美国国立过敏和传染病研究所(NIAID)、美国农业部(USDA)、美国食品药品监督管理局(USFDA)和世界卫生组织(WHO)列出的每类生物威胁生物体中至少一种代表性生物体的全蛋白质组分析。
MannDB包含大量基因组以及对几个关注生物恐怖主义的政府组织网站上列为高优先级病原体的生物体的全面蛋白质序列分析。MannDB为用户提供了一个用于比较天然和非天然序列的BLAST接口以及一个方便选择感兴趣蛋白质的查询工具。此外,用户可以使用基于网络的浏览器来编译全面而详尽的报告。可通过http://manndb.llnl.gov/免费访问MannDB。