Leinonen Rasko, Sugawara Hideaki, Shumway Martin
European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
Nucleic Acids Res. 2011 Jan;39(Database issue):D19-21. doi: 10.1093/nar/gkq1019. Epub 2010 Nov 9.
The combination of significantly lower cost and increased speed of sequencing has resulted in an explosive growth of data submitted into the primary next-generation sequence data archive, the Sequence Read Archive (SRA). The preservation of experimental data is an important part of the scientific record, and increasing numbers of journals and funding agencies require that next-generation sequence data are deposited into the SRA. The SRA was established as a public repository for the next-generation sequence data and is operated by the International Nucleotide Sequence Database Collaboration (INSDC). INSDC partners include the National Center for Biotechnology Information (NCBI), the European Bioinformatics Institute (EBI) and the DNA Data Bank of Japan (DDBJ). The SRA is accessible at http://www.ncbi.nlm.nih.gov/Traces/sra from NCBI, at http://www.ebi.ac.uk/ena from EBI and at http://trace.ddbj.nig.ac.jp from DDBJ. In this article, we present the content and structure of the SRA, detail our support for sequencing platforms and provide recommended data submission levels and formats. We also briefly outline our response to the challenge of data growth.
测序成本显著降低与速度提升相结合,使得提交至主要的下一代序列数据存档库即序列读取存档库(SRA)的数据呈爆炸式增长。实验数据的保存是科学记录的重要组成部分,越来越多的期刊和资助机构要求将下一代序列数据存入SRA。SRA作为下一代序列数据的公共存储库而设立,由国际核苷酸序列数据库协作组织(INSDC)运营。INSDC的合作伙伴包括美国国家生物技术信息中心(NCBI)、欧洲生物信息学研究所(EBI)和日本DNA数据库(DDBJ)。可通过NCBI的http://www.ncbi.nlm.nih.gov/Traces/sra、EBI的http://www.ebi.ac.uk/ena以及DDBJ的http://trace.ddbj.nig.ac.jp访问SRA。在本文中,我们介绍了SRA的内容和结构,详述了我们对测序平台的支持,并提供了推荐的数据提交级别和格式。我们还简要概述了我们应对数据增长挑战的措施。