Abe Takashi, Ikemura Toshimichi, Ohara Yasuo, Uehara Hiroshi, Kinouchi Makoto, Kanaya Shigehiko, Yamada Yuko, Muto Akira, Inokuchi Hachiro
Nagahama Institute of Bio-Science and Technology, Nagahama, Shiga, Japan.
Nucleic Acids Res. 2009 Jan;37(Database issue):D163-8. doi: 10.1093/nar/gkn692. Epub 2008 Oct 8.
We constructed a new large-scale database of tRNA genes by analyzing 534 complete genomes of prokaryotes and 394 draft genomes in WGS (Whole Genome Shotgun) division in DDBJ/EMBL/GenBank and approximately 6.2 million DNA fragment sequences obtained from metagenomic analyses. This exhaustive search for tRNA genes was performed by running three computer programs to enhance completeness and accuracy of the prediction. Discordances of assignment among three programs were found for approximately 4% of the total of tRNA gene candidates obtained from these prokaryote genomes analyzed. The discordant cases were manually checked by experts in the tRNA experimental field. In total, 144,061 tRNA genes were registered in the database 'tRNADB-CE', and the number of the genes was more than four times of that of the genes previously reported by the database from analyses of complete genomes with tRNAscan-SE program. The tRNADB-CE allows for browsing sequence information, cloverleaf structures and results of similarity searches among all tRNA genes. For each of the complete genomes, the number of tRNA genes for individual anticodons and the codon usage frequency in all protein genes and the positioning of individual tRNA genes in each genome can be browsed. tRNADB-CE can be accessed freely at http://trna.nagahama-i-bio.ac.jp.
我们通过分析DDBJ/EMBL/GenBank的WGS(全基因组鸟枪法测序)部门中的534个原核生物完整基因组和394个草图基因组以及从宏基因组分析中获得的约620万个DNA片段序列,构建了一个新的大规模tRNA基因数据库。通过运行三个计算机程序进行了对tRNA基因的详尽搜索,以提高预测的完整性和准确性。在对这些原核生物基因组分析得到的tRNA基因候选序列总数中,约4%的序列在三个程序之间存在分配不一致的情况。这些不一致的情况由tRNA实验领域的专家进行了人工检查。总共144,061个tRNA基因被登记在数据库“tRNADB-CE”中,该基因数量是之前使用tRNAscan-SE程序对完整基因组分析的数据库所报告基因数量的四倍多。tRNADB-CE允许浏览所有tRNA基因的序列信息、三叶草结构和相似性搜索结果。对于每个完整基因组,可以浏览各个反密码子的tRNA基因数量、所有蛋白质基因中的密码子使用频率以及每个基因组中各个tRNA基因的定位。可通过http://trna.nagahama-i-bio.ac.jp免费访问tRNADB-CE。