• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质数据库中的重复条目:如何检测与处理

Duplicate entries in the Protein Data Bank: how to detect and handle them.

作者信息

Wlodawer Alexander, Dauter Zbigniew, Rubach Pawel, Minor Wladek, Jaskolski Mariusz, Jiang Ziqiu, Jeffcott William, Anosova Olga, Kurlin Vitaliy

机构信息

Center for Structural Biology, Center for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA.

Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22908, USA.

出版信息

Acta Crystallogr D Struct Biol. 2025 Apr 1;81(Pt 4):170-180. doi: 10.1107/S2059798325001883. Epub 2025 Mar 8.

DOI:10.1107/S2059798325001883
PMID:40056147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11966240/
Abstract

A global analysis of protein crystal structures in the Protein Data Bank (PDB) using a newly developed computational approach reveals many pairs with (nearly) identical main-chain coordinates. Such cases are identified and analyzed, showing that duplication is possible since the PDB does not currently have tools or mechanisms that would detect potentially duplicate submissions. Some duplicated entries represent modeling efforts of ligand binding that masquerade as experimentally determined structures. We propose that duplicate entries should either be obsoleted by the PDB or, as a minimum, marked with a clear `CAVEAT' record that would alert potential users to the presence of such problems. We also suggest that using a tool for verifying the uniqueness of the deposited structure, such as that presented in this work, should become part of the routine validation procedure for new depositions.

摘要

使用一种新开发的计算方法对蛋白质数据库(PDB)中的蛋白质晶体结构进行全局分析,发现许多(几乎)具有相同主链坐标的配对。此类情况已被识别和分析,结果表明存在重复提交的可能性,因为PDB目前没有能够检测潜在重复提交的工具或机制。一些重复条目代表配体结合的建模成果,却伪装成实验确定的结构。我们建议PDB要么废弃重复条目,要么至少标记一条明确的“注意事项”记录,以提醒潜在用户存在此类问题。我们还建议,使用如本文所介绍的用于验证所提交结构唯一性的工具,应成为新提交条目的常规验证程序的一部分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f17e/11966240/90495a0cdf58/d-81-00170-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f17e/11966240/90495a0cdf58/d-81-00170-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f17e/11966240/90495a0cdf58/d-81-00170-fig1.jpg

相似文献

1
Duplicate entries in the Protein Data Bank: how to detect and handle them.蛋白质数据库中的重复条目:如何检测与处理
Acta Crystallogr D Struct Biol. 2025 Apr 1;81(Pt 4):170-180. doi: 10.1107/S2059798325001883. Epub 2025 Mar 8.
2
The Protein Data Bank Archive.蛋白质数据库档案。
Methods Mol Biol. 2021;2305:3-21. doi: 10.1007/978-1-0716-1406-8_1.
3
Protein Data Bank (PDB): The Single Global Macromolecular Structure Archive.蛋白质数据库(PDB):单一的全球大分子结构存档库。
Methods Mol Biol. 2017;1607:627-641. doi: 10.1007/978-1-4939-7000-1_26.
4
Implementing an X-ray validation pipeline for the Protein Data Bank.为蛋白质数据库实施一个X射线验证流程。
Acta Crystallogr D Biol Crystallogr. 2012 Apr;68(Pt 4):478-83. doi: 10.1107/S0907444911050359. Epub 2012 Mar 16.
5
Waterless structures in the Protein Data Bank.蛋白质数据库中的无水结构。
IUCrJ. 2024 Nov 1;11(Pt 6):966-976. doi: 10.1107/S2052252524009928.
6
Protein Data Bank depositions from synchrotron sources.来自同步辐射源的蛋白质数据库存档。
J Synchrotron Radiat. 2004 Jul 1;11(Pt 4):319-27. doi: 10.1107/S0909049504013792. Epub 2004 Jun 23.
7
Building a structured PDB: the RS-PDB database.构建结构化蛋白质数据银行:RS-蛋白质数据银行数据库
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:5755-8. doi: 10.1109/IEMBS.2006.259331.
8
Crystallographically correct but confusing presentation of structural models deposited in the Protein Data Bank.晶体学正确但结构模型在蛋白质数据库中呈现方式令人困惑。
Acta Crystallogr D Struct Biol. 2018 Sep 1;74(Pt 9):939-945. doi: 10.1107/S2059798318009828. Epub 2018 Sep 5.
9
Continuous mutual improvement of macromolecular structure models in the PDB and of X-ray crystallographic software: the dual role of deposited experimental data.蛋白质数据库(PDB)中大分子结构模型与X射线晶体学软件的持续相互改进:沉积实验数据的双重作用。
Acta Crystallogr D Biol Crystallogr. 2014 Oct;70(Pt 10):2533-43. doi: 10.1107/S1399004714017040. Epub 2014 Sep 30.
10
Integrative/Hybrid Methods Structural Biology: Role of Macromolecular Crystallography.综合/混合方法结构生物学:大分子晶体学的作用。
Adv Exp Med Biol. 2018;1105:11-18. doi: 10.1007/978-981-13-2200-6_2.

本文引用的文献

1
Controlling enzyme activity by mutagenesis and metal exchange to obtain crystal structures of stable substrate complexes of Class 3 l-asparaginase.通过诱变和金属交换控制酶活性以获得3类L-天冬酰胺酶稳定底物复合物的晶体结构。
FEBS J. 2025 Mar;292(5):1159-1173. doi: 10.1111/febs.17388. Epub 2025 Jan 3.
2
Everyone is using biological structures, but how does one find the structure(s) one wants?每个人都在使用生物结构,但如何找到自己想要的结构呢?
Acta Crystallogr D Struct Biol. 2024 Dec 1;80(Pt 12):819-820. doi: 10.1107/S2059798324007848. Epub 2024 Dec 5.
3
Waterless structures in the Protein Data Bank.
蛋白质数据库中的无水结构。
IUCrJ. 2024 Nov 1;11(Pt 6):966-976. doi: 10.1107/S2052252524009928.
4
Towards a dependable data set of structures for L-asparaginase research.致力于建立一个可靠的 L-天冬酰胺酶结构数据集。
Acta Crystallogr D Struct Biol. 2024 Jul 1;80(Pt 7):506-527. doi: 10.1107/S2059798324005461. Epub 2024 Jun 27.
5
The importance of definitions in crystallography.晶体学中定义的重要性。
IUCrJ. 2024 Jul 1;11(Pt 4):453-463. doi: 10.1107/S2052252524004056.
6
Solubilizer Tag Effect on PD-L1/Inhibitor Binding Properties for -Terphenyl Derivatives.增溶剂标签对 - 三联苯衍生物的PD - L1/抑制剂结合特性的影响。
ACS Med Chem Lett. 2023 Dec 14;15(1):36-44. doi: 10.1021/acsmedchemlett.3c00306. eCollection 2024 Jan 11.
7
Structural Insight into Polymerase Mechanism via a Chiral Center Generated with a Single Selenium Atom.通过单个硒原子生成的手性中心深入了解聚合酶机制。
Int J Mol Sci. 2023 Oct 30;24(21):15758. doi: 10.3390/ijms242115758.
8
Multitargeted 6-Substituted Thieno[2,3-]pyrimidines as Folate Receptor-Selective Anticancer Agents that Inhibit Cytosolic and Mitochondrial One-Carbon Metabolism.多靶点6-取代噻吩并[2,3 -]嘧啶作为叶酸受体选择性抗癌剂,可抑制胞质和线粒体一碳代谢。
ACS Pharmacol Transl Sci. 2023 Apr 26;6(5):748-770. doi: 10.1021/acsptsci.3c00020. eCollection 2023 May 12.
9
New Insights into Conformationally Restricted Carbonic Anhydrase Inhibitors.构象受限碳酸酐酶抑制剂的新见解。
Molecules. 2023 Jan 16;28(2):890. doi: 10.3390/molecules28020890.
10
Geographic style maps for two-dimensional lattices.二维格网的地理风格地图。
Acta Crystallogr A Found Adv. 2023 Jan 1;79(Pt 1):1-13. doi: 10.1107/S2053273322010075.