National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA.
Nucleic Acids Res. 2010 Jan;38(Database issue):D46-51. doi: 10.1093/nar/gkp1024. Epub 2009 Nov 12.
GenBank is a comprehensive database that contains publicly available nucleotide sequences for more than 300,000 organisms named at the genus level or lower, obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and accession numbers are assigned by GenBank staff upon receipt. Daily data exchange with the European Molecular Biology Laboratory Nucleotide Sequence Database in Europe and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bi-monthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI homepage: www.ncbi.nlm.nih.gov.
GenBank 是一个综合数据库,包含超过 30 万个生物体的公共核苷酸序列,这些生物体的命名级别在属或更低,主要通过来自个别实验室的提交和来自大规模测序项目的批量提交获得,包括全基因组鸟枪法(WGS)和环境采样项目。大多数提交都是使用基于网络的 BankIt 或独立的 Sequin 程序进行的,并且在收到后由 GenBank 工作人员分配访问号。与欧洲分子生物学实验室核苷酸序列数据库在欧洲和日本 DNA 数据库的日常数据交换确保了全球覆盖范围。GenBank 可通过 NCBI Entrez 检索系统访问,该系统整合了来自主要 DNA 和蛋白质序列数据库的信息,以及分类学、基因组、图谱、蛋白质结构和域信息,以及通过 PubMed 访问生物医学期刊文献。BLAST 提供 GenBank 和其他序列数据库的序列相似性搜索。完整的双月发布和每日更新的 GenBank 数据库可通过 FTP 访问。要访问 GenBank 及其相关检索和分析服务,请从 NCBI 主页开始:www.ncbi.nlm.nih.gov。