Winsor Geoffrey L, Lo Raymond, Ho Sui Shannan J, Ung Korine S E, Huang Shaoshan, Cheng Dean, Ching Wai-Kay Ho, Hancock Robert E W, Brinkman Fiona S L
Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, B.C., Canada, V5A 1S6.
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D338-43. doi: 10.1093/nar/gki047.
Using the Pseudomonas aeruginosa Genome Project as a test case, we have developed a database and submission system to facilitate a community-based approach to continually updated genome annotation (http://www.pseudomonas.com). Researchers submit proposed annotation updates through one of three web-based form options which are then subjected to review, and if accepted, entered into both the database and log file of updates with author acknowledgement. In addition, a coordinator continually reviews literature for suitable updates, as we have found such reviews to be the most efficient. Both the annotations database and updates-log database have Boolean search capability with the ability to sort results and download all data or search results as tab-delimited files. To complement this peer-reviewed genome annotation, we also provide a linked GBrowse view which displays alternate annotations. Additional tools and analyses are also integrated, including PseudoCyc, and knockout mutant information. We propose that this database system, with its focus on facilitating flexible queries of the data and providing access to both peer-reviewed annotations as well as alternate annotation information, may be a suitable model for other genome projects wishing to use a continually updated, community-based annotation approach. The source code is freely available under GNU General Public Licence.
以铜绿假单胞菌基因组计划作为测试案例,我们开发了一个数据库和提交系统,以促进基于社区的方法来持续更新基因组注释(http://www.pseudomonas.com)。研究人员通过三种基于网络的表单选项之一提交提议的注释更新,这些更新随后会经过审核,如果被接受,将被录入数据库以及带有作者确认的更新日志文件中。此外,有一位协调员会持续查阅文献以寻找合适的更新内容,因为我们发现这种查阅是最有效的。注释数据库和更新日志数据库都具备布尔搜索功能,能够对结果进行排序,并以制表符分隔的文件形式下载所有数据或搜索结果。为补充这种经过同行评审的基因组注释,我们还提供了一个链接的GBrowse视图,用于显示替代注释。还集成了其他工具和分析,包括PseudoCyc以及基因敲除突变体信息。我们认为,这个数据库系统专注于便于对数据进行灵活查询,并提供对同行评审注释以及替代注释信息的访问,对于其他希望采用持续更新的、基于社区的注释方法的基因组计划而言,可能是一个合适的模型。源代码在GNU通用公共许可证下可免费获取。