Suppr超能文献

蛋白质数据库(PDB):五十三岁正青春,对科学与社会产生变革性影响。

Protein Data Bank (PDB): Fifty-three years young and having a transformative impact on science and society.

作者信息

Berman Helen M, Burley Stephen K

机构信息

Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA.

Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA.

出版信息

Q Rev Biophys. 2025 Feb 20;58:e9. doi: 10.1017/S0033583525000034.

Abstract

This review article describes the co-evolution of structural biology as a discipline and the Protein Data Bank (PDB), established in 1971 as the first open-access data resource in biology by like-minded structural scientists. As the PDB archive grew in size and scope to encompass macromolecular crystallography, NMR spectroscopy, and cryo-electron microscopy, new technologies were developed to ingest, validate, curate, store, and distribute the information. Community engagement ensured that the needs of structural biologists (data depositors) and data consumers were met. Today, the archive houses more than 230,000 experimentally determined structures of proteins, nucleic acids, and macromolecular machines and their complexes with one another and small-molecule ligands. Aggregate costs of PDB data preservation are ~1% of the cost of structure determination. The enormous impact of PDB data on basic and applied research and education across the natural and medical sciences is presented and highlighted with illustrative examples. Enablement of protein structure prediction (AlphaFold2, RoseTTAfold, OpenFold, ) is the most widely appreciated benefit of having a corpus of rigorously validated, expertly curated 3D biostructure data.

摘要

这篇综述文章描述了结构生物学作为一门学科与蛋白质数据库(PDB)的共同发展历程。蛋白质数据库于1971年由志同道合的结构科学家建立,是生物学领域首个开放获取的数据资源。随着PDB档案库在规模和范围上不断扩大,涵盖了大分子晶体学、核磁共振光谱学和冷冻电子显微镜技术,人们开发了新技术来摄取、验证、整理、存储和分发这些信息。社区参与确保了结构生物学家(数据存入者)和数据使用者的需求得到满足。如今,该档案库收录了超过23万种通过实验确定的蛋白质、核酸、大分子机器及其相互之间以及与小分子配体的复合物的结构。PDB数据保存的总费用约为结构测定成本的1%。文中通过实例展示并强调了PDB数据对自然科学和医学领域基础研究、应用研究及教育产生的巨大影响。能够进行蛋白质结构预测(AlphaFold2、RoseTTAfold、OpenFold等)是拥有一批经过严格验证、专业整理的3D生物结构数据所带来的最广受认可的益处。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验