Nucleic Acids Res. 2019 Jan 8;47(D1):D520-D528. doi: 10.1093/nar/gky949.
The Protein Data Bank (PDB) is the single global archive of experimentally determined three-dimensional (3D) structure data of biological macromolecules. Since 2003, the PDB has been managed by the Worldwide Protein Data Bank (wwPDB; wwpdb.org), an international consortium that collaboratively oversees deposition, validation, biocuration, and open access dissemination of 3D macromolecular structure data. The PDB Core Archive houses 3D atomic coordinates of more than 144 000 structural models of proteins, DNA/RNA, and their complexes with metals and small molecules and related experimental data and metadata. Structure and experimental data/metadata are also stored in the PDB Core Archive using the readily extensible wwPDB PDBx/mmCIF master data format, which will continue to evolve as data/metadata from new experimental techniques and structure determination methods are incorporated by the wwPDB. Impacts of the recently developed universal wwPDB OneDep deposition/validation/biocuration system and various methods-specific wwPDB Validation Task Forces on improving the quality of structures and data housed in the PDB Core Archive are described together with current challenges and future plans.
蛋白质数据库 (PDB) 是生物大分子实验测定的三维 (3D) 结构数据的全球单一档案库。自 2003 年以来,PDB 一直由全球蛋白质数据库 (wwPDB; wwpdb.org) 管理,这是一个国际联盟,共同监督 3D 大分子结构数据的沉积、验证、生物注释和开放获取传播。PDB 核心档案库保存了超过 144000 个蛋白质、DNA/RNA 及其与金属和小分子复合物的三维原子坐标,以及相关的实验数据和元数据。结构和实验数据/元数据也使用可扩展的 wwPDB PDBx/mmCIF 主数据格式存储在 PDB 核心档案库中,该格式将随着新的实验技术和结构测定方法的数据/元数据的纳入而不断发展。本文描述了最近开发的通用 wwPDB OneDep 沉积/验证/生物注释系统和各种特定方法的 wwPDB 验证工作队对提高 PDB 核心档案库中存储的结构和数据质量的影响,以及当前的挑战和未来计划。