• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习打分和虚拟筛选的生物分子的代数拓扑表示。

Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening.

机构信息

Department of Mathematics, Michigan State University, East Lansing, Michigan, United States of America.

Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee, United States of America.

出版信息

PLoS Comput Biol. 2018 Jan 8;14(1):e1005929. doi: 10.1371/journal.pcbi.1005929. eCollection 2018 Jan.

DOI:10.1371/journal.pcbi.1005929
PMID:29309403
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5774846/
Abstract

This work introduces a number of algebraic topology approaches, including multi-component persistent homology, multi-level persistent homology, and electrostatic persistence for the representation, characterization, and description of small molecules and biomolecular complexes. In contrast to the conventional persistent homology, multi-component persistent homology retains critical chemical and biological information during the topological simplification of biomolecular geometric complexity. Multi-level persistent homology enables a tailored topological description of inter- and/or intra-molecular interactions of interest. Electrostatic persistence incorporates partial charge information into topological invariants. These topological methods are paired with Wasserstein distance to characterize similarities between molecules and are further integrated with a variety of machine learning algorithms, including k-nearest neighbors, ensemble of trees, and deep convolutional neural networks, to manifest their descriptive and predictive powers for protein-ligand binding analysis and virtual screening of small molecules. Extensive numerical experiments involving 4,414 protein-ligand complexes from the PDBBind database and 128,374 ligand-target and decoy-target pairs in the DUD database are performed to test respectively the scoring power and the discriminatory power of the proposed topological learning strategies. It is demonstrated that the present topological learning outperforms other existing methods in protein-ligand binding affinity prediction and ligand-decoy discrimination.

摘要

这项工作介绍了一些代数拓扑方法,包括多分量持久同调、多层次持久同调和静电持久,用于小分子和生物分子复合物的表示、特征描述和描述。与传统的持久同调相比,多分量持久同调在生物分子几何复杂性的拓扑简化过程中保留了关键的化学和生物学信息。多层次持久同调能够对感兴趣的分子内和/或分子间相互作用进行定制化的拓扑描述。静电持久将部分电荷信息纳入拓扑不变量中。这些拓扑方法与 Wasserstein 距离相结合,用于表征分子之间的相似性,并进一步与各种机器学习算法(包括 k-最近邻、树集成和深度卷积神经网络)集成,以展示它们在蛋白质-配体结合分析和小分子虚拟筛选中的描述和预测能力。进行了广泛的数值实验,涉及来自 PDBBind 数据库的 4414 个蛋白质-配体复合物和来自 DUD 数据库的 128374 个配体-靶标和诱饵-靶标对,分别测试了所提出的拓扑学习策略的评分能力和区分能力。结果表明,与其他现有方法相比,本拓扑学习方法在蛋白质-配体结合亲和力预测和配体-诱饵区分方面表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/52be845e4e9a/pcbi.1005929.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/5676e9866d38/pcbi.1005929.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/f956abb151ac/pcbi.1005929.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/470efd540994/pcbi.1005929.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/51b15f0ea208/pcbi.1005929.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/d7189929ae8f/pcbi.1005929.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/8acd29892efc/pcbi.1005929.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/9e7c89d7d46d/pcbi.1005929.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/191d3cdd0004/pcbi.1005929.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/0363c6dfdda7/pcbi.1005929.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/dcf0b2a91fbb/pcbi.1005929.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/425b9d362926/pcbi.1005929.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/52be845e4e9a/pcbi.1005929.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/5676e9866d38/pcbi.1005929.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/f956abb151ac/pcbi.1005929.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/470efd540994/pcbi.1005929.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/51b15f0ea208/pcbi.1005929.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/d7189929ae8f/pcbi.1005929.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/8acd29892efc/pcbi.1005929.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/9e7c89d7d46d/pcbi.1005929.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/191d3cdd0004/pcbi.1005929.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/0363c6dfdda7/pcbi.1005929.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/dcf0b2a91fbb/pcbi.1005929.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/425b9d362926/pcbi.1005929.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c35/5774846/52be845e4e9a/pcbi.1005929.g012.jpg

相似文献

1
Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening.基于机器学习打分和虚拟筛选的生物分子的代数拓扑表示。
PLoS Comput Biol. 2018 Jan 8;14(1):e1005929. doi: 10.1371/journal.pcbi.1005929. eCollection 2018 Jan.
2
TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions.拓扑网络:用于生物分子性质预测的基于拓扑的深度卷积和多任务神经网络。
PLoS Comput Biol. 2017 Jul 27;13(7):e1005690. doi: 10.1371/journal.pcbi.1005690. eCollection 2017 Jul.
3
Integration of element specific persistent homology and machine learning for protein-ligand binding affinity prediction.用于蛋白质-配体结合亲和力预测的元素特异性持久同调与机器学习的整合
Int J Numer Method Biomed Eng. 2018 Feb;34(2). doi: 10.1002/cnm.2914. Epub 2017 Aug 16.
4
AGL-Score: Algebraic Graph Learning Score for Protein-Ligand Binding Scoring, Ranking, Docking, and Screening.AGL-Score:用于蛋白质-配体结合评分、排序、对接和筛选的代数图学习评分。
J Chem Inf Model. 2019 Jul 22;59(7):3291-3304. doi: 10.1021/acs.jcim.9b00334. Epub 2019 Jul 1.
5
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction.基于持久谱超图的机器学习(PSH-ML)用于蛋白质-配体结合亲和力预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab127.
6
Quantitative Toxicity Prediction Using Topology Based Multitask Deep Neural Networks.基于拓扑的多任务深度神经网络的定量毒性预测。
J Chem Inf Model. 2018 Feb 26;58(2):520-531. doi: 10.1021/acs.jcim.7b00558. Epub 2018 Jan 31.
7
Machine learning in computational docking.计算对接中的机器学习。
Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.
8
TopP-S: Persistent homology-based multi-task deep neural networks for simultaneous predictions of partition coefficient and aqueous solubility.TopP-S:基于持久同调的多任务深度神经网络,用于同时预测分配系数和水溶解度。
J Comput Chem. 2018 Jul 30;39(20):1444-1454. doi: 10.1002/jcc.25213. Epub 2018 Apr 6.
9
Atom-specific persistent homology and its application to protein flexibility analysis.原子特异性持久同调及其在蛋白质柔性分析中的应用。
Comput Math Biophys. 2020 Jan;8(1):1-35. doi: 10.1515/cmb-2020-0001. Epub 2020 Feb 17.
10
Persistent Cohomology for Data With Multicomponent Heterogeneous Information.具有多组分异构信息的数据的持久上同调
SIAM J Math Data Sci. 2020;2(2):396-418. doi: 10.1137/19m1272226. Epub 2020 May 19.

引用本文的文献

1
Multiscale topology-enabled structure-to-sequence transformer for protein-ligand interaction predictions.用于蛋白质-配体相互作用预测的多尺度拓扑结构到序列变压器
Nat Mach Intell. 2024 Jul;6(7):799-810. doi: 10.1038/s42256-024-00855-1. Epub 2024 Jun 21.
2
Predicting Affinity Through Homology (PATH): Interpretable binding affinity prediction with persistent homology.通过同源性预测亲和力(PATH):利用持久同源性进行可解释的结合亲和力预测。
PLoS Comput Biol. 2025 Jun 27;21(6):e1013216. doi: 10.1371/journal.pcbi.1013216. eCollection 2025 Jun.
3
PLAIG: Protein-Ligand Binding Affinity Prediction Using a Novel Interaction-Based Graph Neural Network Framework.

本文引用的文献

1
MoleculeNet: a benchmark for molecular machine learning.分子网络:分子机器学习的一个基准
Chem Sci. 2017 Oct 31;9(2):513-530. doi: 10.1039/c7sc02664a. eCollection 2018 Jan 14.
2
Analysis and prediction of protein folding energy changes upon mutation by element specific persistent homology.通过元素特定的持久同调分析和预测突变时蛋白质折叠能量的变化。
Bioinformatics. 2017 Nov 15;33(22):3549-3557. doi: 10.1093/bioinformatics/btx460.
3
TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions.
PLAIG:使用基于新型相互作用的图神经网络框架进行蛋白质-配体结合亲和力预测。
ACS Bio Med Chem Au. 2025 Apr 29;5(3):447-463. doi: 10.1021/acsbiomedchemau.5c00053. eCollection 2025 Jun 18.
4
CAML: Commutative Algebra Machine Learning─A Case Study on Protein-Ligand Binding Affinity Prediction.CAML:交换代数机器学习——蛋白质-配体结合亲和力预测的案例研究
J Chem Inf Model. 2025 Jul 14;65(13):6732-6743. doi: 10.1021/acs.jcim.5c00940. Epub 2025 Jun 15.
5
Machine Learning for Toxicity Prediction Using Chemical Structures: Pillars for Success in the Real World.利用化学结构进行毒性预测的机器学习:在现实世界中取得成功的支柱。
Chem Res Toxicol. 2025 May 19;38(5):759-807. doi: 10.1021/acs.chemrestox.5c00033. Epub 2025 May 2.
6
Position: Topological Deep Learning is the New Frontier for Relational Learning.观点:拓扑深度学习是关系学习的新前沿。
Proc Mach Learn Res. 2024 Jul;235:39529-39555.
7
Persistent Directed Flag Laplacian (PDFL)-Based Machine Learning for Protein-Ligand Binding Affinity Prediction.基于持久定向旗拉普拉斯算子(PDFL)的机器学习用于蛋白质-配体结合亲和力预测
J Chem Theory Comput. 2025 Apr 22;21(8):4276-4285. doi: 10.1021/acs.jctc.5c00074. Epub 2025 Apr 5.
8
GNNSeq: A Sequence-Based Graph Neural Network for Predicting Protein-Ligand Binding Affinity.GNNSeq:一种基于序列的图神经网络,用于预测蛋白质-配体结合亲和力。
Pharmaceuticals (Basel). 2025 Feb 26;18(3):329. doi: 10.3390/ph18030329.
9
GraphBAN: An inductive graph-based approach for enhanced prediction of compound-protein interactions.GraphBAN:一种基于归纳图的方法,用于增强对化合物-蛋白质相互作用的预测。
Nat Commun. 2025 Mar 18;16(1):2541. doi: 10.1038/s41467-025-57536-9.
10
Natural Language Processing Methods for the Study of Protein-Ligand Interactions.用于蛋白质-配体相互作用研究的自然语言处理方法
J Chem Inf Model. 2025 Mar 10;65(5):2191-2213. doi: 10.1021/acs.jcim.4c01907. Epub 2025 Feb 24.
拓扑网络:用于生物分子性质预测的基于拓扑的深度卷积和多任务神经网络。
PLoS Comput Biol. 2017 Jul 27;13(7):e1005690. doi: 10.1371/journal.pcbi.1005690. eCollection 2017 Jul.
4
Integration of element specific persistent homology and machine learning for protein-ligand binding affinity prediction.用于蛋白质-配体结合亲和力预测的元素特异性持久同调与机器学习的整合
Int J Numer Method Biomed Eng. 2018 Feb;34(2). doi: 10.1002/cnm.2914. Epub 2017 Aug 16.
5
Rigidity Strengthening: A Mechanism for Protein-Ligand Binding.刚性强化:蛋白质-配体结合的一种机制。
J Chem Inf Model. 2017 Jul 24;57(7):1715-1721. doi: 10.1021/acs.jcim.7b00226. Epub 2017 Jul 12.
6
ESES: Software for Eulerian solvent excluded surface.ESES:用于欧拉溶剂排除表面的软件。
J Comput Chem. 2017 Mar 15;38(7):446-466. doi: 10.1002/jcc.24682. Epub 2017 Jan 4.
7
Boosting Docking-Based Virtual Screening with Deep Learning.深度学习增强基于对接的虚拟筛选。
J Chem Inf Model. 2016 Dec 27;56(12):2495-2506. doi: 10.1021/acs.jcim.6b00355. Epub 2016 Nov 29.
8
The impact of surface area, volume, curvature, and Lennard-Jones potential to solvation modeling.表面积、体积、曲率和 Lennard-Jones 势对溶剂化建模的影响。
J Comput Chem. 2017 Jan 5;38(1):24-36. doi: 10.1002/jcc.24512. Epub 2016 Oct 8.
9
Improving AutoDock Vina Using Random Forest: The Growing Accuracy of Binding Affinity Prediction by the Effective Exploitation of Larger Data Sets.使用随机森林改进AutoDock Vina:通过有效利用更大数据集提高结合亲和力预测的准确性。
Mol Inform. 2015 Feb;34(2-3):115-26. doi: 10.1002/minf.201400132. Epub 2015 Feb 12.
10
Modeling Epoxidation of Drug-like Molecules with a Deep Machine Learning Network.基于深度机器学习网络对类药物分子环氧化反应的建模。
ACS Cent Sci. 2015 Jul 22;1(4):168-80. doi: 10.1021/acscentsci.5b00131. Epub 2015 Jun 9.