• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过跟随深度神经网络的提示,学习在蛋白质的三维结构中识别生理和偶然的金属结合位点。

Learning to Identify Physiological and Adventitious Metal-Binding Sites in the Three-Dimensional Structures of Proteins by Following the Hints of a Deep Neural Network.

机构信息

Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy.

Institute for Drug Discovery, Leipzig University, Brüderstr. 34, 04103 Leipzig, Germany.

出版信息

J Chem Inf Model. 2022 Jun 27;62(12):2951-2960. doi: 10.1021/acs.jcim.2c00522. Epub 2022 Jun 9.

DOI:10.1021/acs.jcim.2c00522
PMID:35679182
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9241070/
Abstract

Thirty-eight percent of protein structures in the Protein Data Bank contain at least one metal ion. However, not all these metal sites are biologically relevant. Cations present as impurities during sample preparation or in the crystallization buffer can cause the formation of protein-metal complexes that do not exist in vivo. We implemented a deep learning approach to build a classifier able to distinguish between physiological and adventitious zinc-binding sites in the 3D structures of metalloproteins. We trained the classifier using manually annotated sites extracted from the MetalPDB database. Using a 10-fold cross validation procedure, the classifier achieved an accuracy of about 90%. The same neural classifier could predict the physiological relevance of non-heme mononuclear iron sites with an accuracy of nearly 80%, suggesting that the rules learned on zinc sites have general relevance. By quantifying the relative importance of the features describing the input zinc sites from the network perspective and by analyzing the characteristics of the MetalPDB datasets, we inferred some common principles. Physiological sites present a low solvent accessibility of the aminoacids forming coordination bonds with the metal ion (the metal ligands), a relatively large number of residues in the metal environment (≥20), and a distinct pattern of conservation of Cys and His residues in the site. Adventitious sites, on the other hand, tend to have a low number of donor atoms from the polypeptide chain (often one or two). These observations support the evaluation of the physiological relevance of novel metal-binding sites in protein structures.

摘要

蛋白质数据库中 38%的蛋白质结构含有至少一个金属离子。然而,并非所有这些金属位点都具有生物学意义。在样品制备或结晶缓冲液中作为杂质存在的阳离子可能导致体内不存在的蛋白质-金属复合物的形成。我们采用深度学习方法构建了一个分类器,能够区分金属蛋白三维结构中生理和偶然锌结合位点。我们使用从 MetalPDB 数据库中提取的手动注释位点来训练分类器。通过 10 倍交叉验证程序,分类器的准确率约为 90%。相同的神经分类器可以预测非血红素单核铁位点的生理相关性,准确率接近 80%,表明在锌位点上学习的规则具有普遍意义。通过从网络角度量化描述输入锌位点的特征的相对重要性,并分析 MetalPDB 数据集的特征,我们推断出一些共同的原则。生理位点的金属离子配位键形成氨基酸(金属配体)的溶剂可及性较低,金属环境中的残基数相对较多(≥20),且位点中半胱氨酸和组氨酸残基的保守模式明显。另一方面,偶然位点往往来自多肽链的供体原子数量较少(通常为一个或两个)。这些观察结果支持在蛋白质结构中评估新的金属结合位点的生理相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/dc9d03dbf99c/ci2c00522_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/1ea1507d1aec/ci2c00522_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/ccf019306448/ci2c00522_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/6dbf3592dc05/ci2c00522_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/2ca8a702f02d/ci2c00522_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/dc9d03dbf99c/ci2c00522_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/1ea1507d1aec/ci2c00522_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/ccf019306448/ci2c00522_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/6dbf3592dc05/ci2c00522_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/2ca8a702f02d/ci2c00522_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0af5/9241070/dc9d03dbf99c/ci2c00522_0006.jpg

相似文献

1
Learning to Identify Physiological and Adventitious Metal-Binding Sites in the Three-Dimensional Structures of Proteins by Following the Hints of a Deep Neural Network.通过跟随深度神经网络的提示,学习在蛋白质的三维结构中识别生理和偶然的金属结合位点。
J Chem Inf Model. 2022 Jun 27;62(12):2951-2960. doi: 10.1021/acs.jcim.2c00522. Epub 2022 Jun 9.
2
Structural Bioinformatics and Deep Learning of Metalloproteins: Recent Advances and Applications.金属蛋白的结构生物信息学和深度学习:最新进展与应用。
Int J Mol Sci. 2022 Jul 12;23(14):7684. doi: 10.3390/ijms23147684.
3
MetalPDB in 2018: a database of metal sites in biological macromolecular structures.2018 年的 MetalPDB:生物大分子结构中金属位点数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D459-D464. doi: 10.1093/nar/gkx989.
4
MetalPDB: a database of metal sites in biological macromolecular structures.金属 PDB:生物大分子结构中金属位点数据库。
Nucleic Acids Res. 2013 Jan;41(Database issue):D312-9. doi: 10.1093/nar/gks1063. Epub 2012 Nov 15.
5
Development of METAL-ACTIVE SITE and ZINCCLUSTER tool to predict active site pockets.用于预测活性位点口袋的金属活性位点和锌簇工具的开发。
Proteins. 2018 Mar;86(3):322-331. doi: 10.1002/prot.25441. Epub 2018 Jan 22.
6
Metal3D: a general deep learning framework for accurate metal ion location prediction in proteins.Metal3D:一种用于准确预测蛋白质中金属离子位置的通用深度学习框架。
Nat Commun. 2023 May 11;14(1):2713. doi: 10.1038/s41467-023-37870-6.
7
MetalHawk: Enhanced Classification of Metal Coordination Geometries by Artificial Neural Networks.MetalHawk:基于人工神经网络的金属配位几何结构的增强分类。
J Chem Inf Model. 2024 Apr 8;64(7):2356-2367. doi: 10.1021/acs.jcim.3c00873. Epub 2023 Nov 13.
8
Flexibility of metal binding sites in proteins on a database scale.基于数据库规模的蛋白质中金属结合位点的灵活性
Proteins. 2005 May 1;59(2):221-30. doi: 10.1002/prot.20431.
9
Hidden relationships between metalloproteins unveiled by structural comparison of their metal sites.通过金属位点的结构比较揭示金属蛋白之间的隐藏关系。
Sci Rep. 2015 Mar 30;5:9486. doi: 10.1038/srep09486.
10
Minimal Functional Sites in Metalloproteins and Their Usage in Structural Bioinformatics.金属蛋白中的最小功能位点及其在结构生物信息学中的应用
Int J Mol Sci. 2016 May 4;17(5):671. doi: 10.3390/ijms17050671.

引用本文的文献

1
Benchmarking Zinc-Binding Site Predictors: A Comparative Analysis of Structure-Based Approaches.锌结合位点预测工具的基准测试:基于结构方法的比较分析
J Chem Inf Model. 2025 May 26;65(10):5205-5215. doi: 10.1021/acs.jcim.5c00549. Epub 2025 May 15.
2
Recent advances and future challenges in predictive modeling of metalloproteins by artificial intelligence.人工智能在金属蛋白预测建模方面的最新进展与未来挑战
Mol Cells. 2025 Apr;48(4):100191. doi: 10.1016/j.mocell.2025.100191. Epub 2025 Feb 10.
3
Bacterial Metallostasis: Metal Sensing, Metalloproteome Remodeling, and Metal Trafficking.

本文引用的文献

1
Structural investigation of a thermostable 1,2-β-mannobiose phosphorylase from Thermoanaerobacter sp. X-514.热厌氧菌 X-514 中耐热 1,2-β-甘露二糖磷酸化酶的结构研究。
Biochem Biophys Res Commun. 2021 Nov 19;579:54-61. doi: 10.1016/j.bbrc.2021.09.046. Epub 2021 Sep 21.
2
Accurate prediction of protein structures and interactions using a three-track neural network.使用三轨神经网络准确预测蛋白质结构和相互作用。
Science. 2021 Aug 20;373(6557):871-876. doi: 10.1126/science.abj8754. Epub 2021 Jul 15.
3
Highly accurate protein structure prediction with AlphaFold.
细菌金属稳态:金属感应、金属蛋白质组重塑及金属转运
Chem Rev. 2024 Dec 25;124(24):13574-13659. doi: 10.1021/acs.chemrev.4c00264. Epub 2024 Dec 10.
4
MetalHawk: Enhanced Classification of Metal Coordination Geometries by Artificial Neural Networks.MetalHawk:基于人工神经网络的金属配位几何结构的增强分类。
J Chem Inf Model. 2024 Apr 8;64(7):2356-2367. doi: 10.1021/acs.jcim.3c00873. Epub 2023 Nov 13.
5
Hunting down zinc(II)-binding sites in proteins with distance matrices.利用距离矩阵在蛋白质中寻找锌(II)结合位点。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad653.
6
Structural Bioinformatics and Deep Learning of Metalloproteins: Recent Advances and Applications.金属蛋白的结构生物信息学和深度学习:最新进展与应用。
Int J Mol Sci. 2022 Jul 12;23(14):7684. doi: 10.3390/ijms23147684.
利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
4
Machine learning differentiates enzymatic and non-enzymatic metals in proteins.机器学习区分蛋白质中的酶促金属和非酶促金属。
Nat Commun. 2021 Jun 17;12(1):3712. doi: 10.1038/s41467-021-24070-3.
5
BioMetAll: Identifying Metal-Binding Sites in Proteins from Backbone Preorganization.BioMetAll:从蛋白质的骨架预组织中识别金属结合位点。
J Chem Inf Model. 2021 Jan 25;61(1):311-323. doi: 10.1021/acs.jcim.0c00827. Epub 2020 Dec 18.
6
Deep learning model predicts water interaction sites on the surface of proteins using limited-resolution data.深度学习模型使用低分辨率数据预测蛋白质表面的水相互作用位点。
Chem Commun (Camb). 2020 Dec 21;56(98):15454-15457. doi: 10.1039/d0cc04383d. Epub 2020 Nov 25.
7
Enzyme Evolution: An Epistatic Ratchet versus a Smooth Reversible Transition.酶的进化:由上位性棘轮与平滑的可逆转变决定。
Mol Biol Evol. 2020 Apr 1;37(4):1133-1147. doi: 10.1093/molbev/msz298.
8
High-Throughput PIXE as an Essential Quantitative Assay for Accurate Metalloprotein Structural Analysis: Development and Application.高通量 PIXE 作为准确金属蛋白结构分析的基本定量分析方法:开发与应用。
J Am Chem Soc. 2020 Jan 8;142(1):185-197. doi: 10.1021/jacs.9b09186. Epub 2019 Dec 20.
9
DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.DeepMSA:构建深度多重序列比对以改进远距离同源蛋白质的接触预测和折叠识别。
Bioinformatics. 2020 Apr 1;36(7):2105-2112. doi: 10.1093/bioinformatics/btz863.
10
HH-suite3 for fast remote homology detection and deep protein annotation.HH-suite3 用于快速远程同源检测和深度蛋白质注释。
BMC Bioinformatics. 2019 Sep 14;20(1):473. doi: 10.1186/s12859-019-3019-7.