DDBJ Center, National Institute of Genetics, Yata 1111, Mishima, Shizuoka 411-8540, Japan.
Nucleic Acids Res. 2013 Jan;41(Database issue):D25-9. doi: 10.1093/nar/gks1152. Epub 2012 Nov 24.
The DNA data bank of Japan (DDBJ, http://www.ddbj.nig.ac.jp) maintains a primary nucleotide sequence database and provides analytical resources for biological information to researchers. This database content is exchanged with the US National Center for Biotechnology Information (NCBI) and the European Bioinformatics Institute (EBI) within the framework of the International Nucleotide Sequence Database Collaboration (INSDC). Resources provided by the DDBJ include traditional nucleotide sequence data released in the form of 27 316 452 entries or 16 876 791 557 base pairs (as of June 2012), and raw reads of new generation sequencers in the sequence read archive (SRA). A Japanese researcher published his own genome sequence via DDBJ-SRA on 31 July 2012. To cope with the ongoing genomic data deluge, in March 2012, our computer previous system was totally replaced by a commodity cluster-based system that boasts 122.5 TFlops of CPU capacity and 5 PB of storage space. During this upgrade, it was considered crucial to replace and refactor substantial portions of the DDBJ software systems as well. As a result of the replacement process, which took more than 2 years to perform, we have achieved significant improvements in system performance.
日本 DNA 数据库(DDBJ,http://www.ddbj.nig.ac.jp)维护一个主要的核苷酸序列数据库,并为研究人员提供生物信息分析资源。该数据库内容与美国国家生物技术信息中心(NCBI)和欧洲生物信息学研究所(EBI)在国际核苷酸序列数据库合作组织(INSDC)的框架内进行交换。DDBJ 提供的资源包括以 27316452 项或 16876791557 个碱基对(截至 2012 年 6 月)的形式发布的传统核苷酸序列数据,以及序列读取档案(SRA)中新一代测序仪的原始读取数据。2012 年 7 月 31 日,一位日本研究人员通过 DDBJ-SRA 发布了他自己的基因组序列。为了应对不断增加的基因组数据,2012 年 3 月,我们的计算机前期系统完全被基于商用集群的系统所取代,该系统拥有 122.5 TFlops 的 CPU 容量和 5 PB 的存储空间。在这次升级过程中,我们认为替换和重构 DDBJ 软件系统的大量部分也是至关重要的。经过超过 2 年的替换过程,我们已经在系统性能方面取得了显著的改进。