Klimke William, O'Donovan Claire, White Owen, Brister J Rodney, Clark Karen, Fedorov Boris, Mizrachi Ilene, Pruitt Kim D, Tatusova Tatiana
Stand Genomic Sci. 2011 Oct 15;5(1):168-93. doi: 10.4056/sigs.2084864. Epub 2011 Oct 1.
The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries.
基因组测序的前景是,通过比较大量可用序列来绘制这片广袤未知的领域,并帮助研究人员解读每个基因在每种生物体中的作用。研究人员认识到需要高质量的数据。然而,不同的注释程序、众多的数据库以及实验确定的基因功能所占比例不断下降,导致了注释质量的参差不齐。美国国立医学图书馆国家生物技术信息中心(NCBI)与测序中心、存档数据库和研究人员合作,制定了首个国际注释标准,这是确保高质量完整原核生物基因组作为黄金标准参考可用的关键一步。亮点包括注释评估工具的开发、蛋白质命名标准的社区认可、注释资源比较以提供一致注释,以及改进用于生成特定注释的证据跟踪。一套最低标准的制定,包括要求注释的完整原核生物基因组包含全套核糖体RNA、转运RNA以及编码核心保守功能的蛋白质,这是一个历史性的里程碑。在现有基因组和未来提交的序列中使用这些标准将提高数据库质量,使研究人员能够做出准确的生物学发现。