Menda Naama, Buels Robert M, Tecle Isaak, Mueller Lukas A
Department of Plant Breeding and Genetics, and Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, New York 14853, USA.
Plant Physiol. 2008 Aug;147(4):1788-99. doi: 10.1104/pp.108.119560. Epub 2008 Jun 6.
The amount of biological data available in the public domain is growing exponentially, and there is an increasing need for infrastructural and human resources to organize, store, and present the data in a proper context. Model organism databases (MODs) invest great efforts to functionally annotate genomes and phenomes by in-house curators. The SOL Genomics Network (SGN; http://www.sgn.cornell.edu) is a clade-oriented database (COD), which provides a more scalable and comparative framework for biological information. SGN has recently spearheaded a new approach by developing community annotation tools to expand its curational capacity. These tools effectively allow some curation to be delegated to qualified researchers, while, at the same time, preserving the in-house curators' full editorial control. Here we describe the background, features, implementation, results, and development road map of SGN's community annotation tools for curating genotypes and phenotypes. Since the inception of this project in late 2006, interest and participation from the Solanaceae research community has been strong and growing continuously to the extent that we plan to expand the framework to accommodate more plant taxa. All data, tools, and code developed at SGN are freely available to download and adapt.
公共领域中可用的生物数据量正在呈指数级增长,因此越来越需要基础设施和人力资源来在适当的背景下组织、存储和呈现这些数据。模式生物数据库(MODs)投入大量精力,由内部管理员对基因组和表型组进行功能注释。SOL基因组学网络(SGN;http://www.sgn.cornell.edu)是一个面向进化枝的数据库(COD),它为生物信息提供了一个更具扩展性和可比性的框架。SGN最近率先采用了一种新方法,即开发社区注释工具来扩大其管理能力。这些工具有效地允许将一些管理工作委托给合格的研究人员,同时保持内部管理员的完全编辑控制权。在这里,我们描述了SGN用于管理基因型和表型的社区注释工具的背景、特点、实施情况、结果和发展路线图。自2006年末启动该项目以来,茄科研究社区的兴趣和参与度一直很高,并且持续增长,以至于我们计划扩大该框架以容纳更多植物类群。SGN开发的所有数据、工具和代码均可免费下载和改编。