• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 Dowker 复合物的机器学习 (DCML) 模型用于预测蛋白质-配体结合亲和力。

Dowker complex based machine learning (DCML) models for protein-ligand binding affinity prediction.

机构信息

Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore.

Chern Institute of Mathematics and LPMC, Nankai University, Tianjin, China.

出版信息

PLoS Comput Biol. 2022 Apr 6;18(4):e1009943. doi: 10.1371/journal.pcbi.1009943. eCollection 2022 Apr.

DOI:10.1371/journal.pcbi.1009943
PMID:35385478
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8985993/
Abstract

With the great advancements in experimental data, computational power and learning algorithms, artificial intelligence (AI) based drug design has begun to gain momentum recently. AI-based drug design has great promise to revolutionize pharmaceutical industries by significantly reducing the time and cost in drug discovery processes. However, a major issue remains for all AI-based learning model that is efficient molecular representations. Here we propose Dowker complex (DC) based molecular interaction representations and Riemann Zeta function based molecular featurization, for the first time. Molecular interactions between proteins and ligands (or others) are modeled as Dowker complexes. A multiscale representation is generated by using a filtration process, during which a series of DCs are generated at different scales. Combinatorial (Hodge) Laplacian matrices are constructed from these DCs, and the Riemann zeta functions from their spectral information can be used as molecular descriptors. To validate our models, we consider protein-ligand binding affinity prediction. Our DC-based machine learning (DCML) models, in particular, DC-based gradient boosting tree (DC-GBT), are tested on three most-commonly used datasets, i.e., including PDBbind-2007, PDBbind-2013 and PDBbind-2016, and extensively compared with other existing state-of-the-art models. It has been found that our DC-based descriptors can achieve the state-of-the-art results and have better performance than all machine learning models with traditional molecular descriptors. Our Dowker complex based machine learning models can be used in other tasks in AI-based drug design and molecular data analysis.

摘要

随着实验数据、计算能力和学习算法的巨大进步,基于人工智能(AI)的药物设计最近开始获得动力。基于人工智能的药物设计有望通过大大缩短药物发现过程中的时间和成本,彻底改变制药行业。然而,对于所有基于人工智能的学习模型来说,一个主要问题仍然是有效的分子表示。在这里,我们首次提出了基于道克复合物(DC)的分子相互作用表示法和基于黎曼 ζ 函数的分子特征化方法。蛋白质和配体(或其他)之间的分子相互作用被建模为道克复合物。通过使用过滤过程生成多尺度表示,在此过程中,在不同尺度上生成一系列 DC。从这些 DC 构建组合(Hodge)拉普拉斯矩阵,并从其光谱信息中使用黎曼 ζ 函数作为分子描述符。为了验证我们的模型,我们考虑了蛋白质-配体结合亲和力预测。我们的基于 DC 的机器学习(DCML)模型,特别是基于 DC 的梯度提升树(DC-GBT),在三个最常用的数据集上进行了测试,即包括 PDBbind-2007、PDBbind-2013 和 PDBbind-2016,并与其他现有的最先进的模型进行了广泛比较。结果发现,我们的基于 DC 的描述符可以达到最先进的结果,并且比所有使用传统分子描述符的机器学习模型性能更好。我们的基于道克复合物的机器学习模型可以用于基于人工智能的药物设计和分子数据分析中的其他任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0326/8985993/e0b9727edddb/pcbi.1009943.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0326/8985993/9ee6925e76bb/pcbi.1009943.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0326/8985993/e5c96b13b6ca/pcbi.1009943.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0326/8985993/4c627da6949b/pcbi.1009943.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0326/8985993/e0b9727edddb/pcbi.1009943.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0326/8985993/9ee6925e76bb/pcbi.1009943.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0326/8985993/e5c96b13b6ca/pcbi.1009943.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0326/8985993/4c627da6949b/pcbi.1009943.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0326/8985993/e0b9727edddb/pcbi.1009943.g004.jpg

相似文献

1
Dowker complex based machine learning (DCML) models for protein-ligand binding affinity prediction.基于 Dowker 复合物的机器学习 (DCML) 模型用于预测蛋白质-配体结合亲和力。
PLoS Comput Biol. 2022 Apr 6;18(4):e1009943. doi: 10.1371/journal.pcbi.1009943. eCollection 2022 Apr.
2
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction.基于持久谱超图的机器学习(PSH-ML)用于蛋白质-配体结合亲和力预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab127.
3
Forman persistent Ricci curvature (FPRC)-based machine learning models for protein-ligand binding affinity prediction.基于 Forman 持续 Ricci 曲率 (FPRC) 的蛋白质-配体结合亲和力预测机器学习模型。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab136.
4
Persistent Path-Spectral (PPS) Based Machine Learning for Protein-Ligand Binding Affinity Prediction.基于持续路径谱(PPS)的机器学习的蛋白质-配体结合亲和力预测。
J Chem Inf Model. 2023 Feb 13;63(3):1066-1075. doi: 10.1021/acs.jcim.2c01251. Epub 2023 Jan 16.
5
Ollivier Persistent Ricci Curvature-Based Machine Learning for the Protein-Ligand Binding Affinity Prediction.基于奥利维尔持续 Ricci 曲率的机器学习在蛋白质-配体结合亲和力预测中的应用。
J Chem Inf Model. 2021 Apr 26;61(4):1617-1626. doi: 10.1021/acs.jcim.0c01415. Epub 2021 Mar 16.
6
-Complex-Based Machine Learning (HCML) for the Prediction of Protein-Protein Binding Affinity Changes upon Mutation.基于复合体的机器学习(HCML)预测突变后蛋白质-蛋白质结合亲和力的变化。
J Chem Inf Model. 2022 Sep 12;62(17):3961-3969. doi: 10.1021/acs.jcim.2c00580. Epub 2022 Aug 30.
7
Molecular persistent spectral image (Mol-PSI) representation for machine learning models in drug design.用于药物设计中机器学习模型的分子持久谱图像(Mol-PSI)表示。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab527.
8
Hypergraph-based persistent cohomology (HPC) for molecular representations in drug design.基于超图的持久同调(HPC)在药物设计中的分子表示。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa411.
9
Persistent spectral based ensemble learning (PerSpect-EL) for protein-protein binding affinity prediction.基于持续谱的集成学习(PerSpect-EL)在蛋白质-蛋白质结合亲和力预测中的应用。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac024.
10
Multi-Cover Persistence (MCP)-based machine learning for polymer property prediction.基于多覆盖持久性(MCP)的机器学习在聚合物性能预测中的应用。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae465.

引用本文的文献

1
Multiscale topology-enabled structure-to-sequence transformer for protein-ligand interaction predictions.用于蛋白质-配体相互作用预测的多尺度拓扑结构到序列变压器
Nat Mach Intell. 2024 Jul;6(7):799-810. doi: 10.1038/s42256-024-00855-1. Epub 2024 Jun 21.
2
Predicting Affinity Through Homology (PATH): Interpretable binding affinity prediction with persistent homology.通过同源性预测亲和力(PATH):利用持久同源性进行可解释的结合亲和力预测。
PLoS Comput Biol. 2025 Jun 27;21(6):e1013216. doi: 10.1371/journal.pcbi.1013216. eCollection 2025 Jun.
3
ERL-ProLiGraph: Enhanced representation learning on protein-ligand graph structured data for binding affinity prediction.

本文引用的文献

1
OnionNet-2: A Convolutional Neural Network Model for Predicting Protein-Ligand Binding Affinity Based on Residue-Atom Contacting Shells.洋葱网络-2:一种基于残基-原子接触壳预测蛋白质-配体结合亲和力的卷积神经网络模型。
Front Chem. 2021 Oct 27;9:753002. doi: 10.3389/fchem.2021.753002. eCollection 2021.
2
HERMES: PERSISTENT SPECTRAL GRAPH SOFTWARE.赫尔墨斯:持久光谱图软件。
Found Data Sci. 2021 Mar;3(1):67-97. doi: 10.3934/fods.2021006.
3
Generative network complex (GNC) for drug discovery.用于药物发现的生成网络复合体(GNC)
ERL-ProLiGraph:用于结合亲和力预测的蛋白质-配体图结构数据的增强表示学习
Mol Inform. 2024 Dec;43(12):e202400044. doi: 10.1002/minf.202400044. Epub 2024 Oct 15.
4
Relational Persistent Homology for Multispecies Data with Application to the Tumor Microenvironment.关系型持久同调在多物种数据中的应用——以肿瘤微环境为例。
Bull Math Biol. 2024 Sep 17;86(11):128. doi: 10.1007/s11538-024-01353-6.
5
TopoFormer: Multiscale Topology-enabled Structure-to-Sequence Transformer for Protein-Ligand Interaction Predictions.TopoFormer:用于蛋白质-配体相互作用预测的多尺度拓扑结构序列Transformer
Res Sq. 2024 Feb 9:rs.3.rs-3640878. doi: 10.21203/rs.3.rs-3640878/v1.
6
EGGNet, a Generalizable Geometric Deep Learning Framework for Protein Complex Pose Scoring.EGGNet,一种用于蛋白质复合物姿态评分的通用几何深度学习框架。
ACS Omega. 2024 Feb 9;9(7):7471-7479. doi: 10.1021/acsomega.3c04889. eCollection 2024 Feb 20.
7
SS-GNN: A Simple-Structured Graph Neural Network for Affinity Prediction.SS-GNN:一种用于亲和力预测的结构简单的图神经网络。
ACS Omega. 2023 Jun 15;8(25):22496-22507. doi: 10.1021/acsomega.3c00085. eCollection 2023 Jun 27.
8
CAPLA: improved prediction of protein-ligand binding affinity by a deep learning approach based on a cross-attention mechanism.CAPLA:基于交叉注意力机制的深度学习方法提高了蛋白质配体结合亲和力的预测能力。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad049.
Commun Inf Syst. 2019;19(3):241-277. doi: 10.4310/cis.2019.v19.n3.a2.
4
DeepBSP-a Machine Learning Method for Accurate Prediction of Protein-Ligand Docking Structures.DeepBSP:一种用于准确预测蛋白质-配体对接结构的机器学习方法。
J Chem Inf Model. 2021 May 24;61(5):2231-2240. doi: 10.1021/acs.jcim.1c00334. Epub 2021 May 12.
5
Persistent spectral-based machine learning (PerSpect ML) for protein-ligand binding affinity prediction.用于蛋白质-配体结合亲和力预测的基于持久光谱的机器学习(PerSpect ML)。
Sci Adv. 2021 May 7;7(19). doi: 10.1126/sciadv.abc5329. Print 2021 May.
6
Forman persistent Ricci curvature (FPRC)-based machine learning models for protein-ligand binding affinity prediction.基于 Forman 持续 Ricci 曲率 (FPRC) 的蛋白质-配体结合亲和力预测机器学习模型。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab136.
7
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction.基于持久谱超图的机器学习(PSH-ML)用于蛋白质-配体结合亲和力预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab127.
8
DeepDTAF: a deep learning method to predict protein-ligand binding affinity.DeepDTAF:一种用于预测蛋白-配体结合亲和力的深度学习方法。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab072.
9
Improved Protein-Ligand Binding Affinity Prediction with Structure-Based Deep Fusion Inference.基于结构的深度融合推理提高蛋白-配体结合亲和力预测。
J Chem Inf Model. 2021 Apr 26;61(4):1583-1592. doi: 10.1021/acs.jcim.0c01306. Epub 2021 Mar 23.
10
SE-OnionNet: A Convolution Neural Network for Protein-Ligand Binding Affinity Prediction.SE-OnionNet:一种用于蛋白质-配体结合亲和力预测的卷积神经网络。
Front Genet. 2021 Feb 19;11:607824. doi: 10.3389/fgene.2020.607824. eCollection 2020.