• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于蛋白质-化合物相互作用预测的分子和相互作用组数据的深度学习整合。

Deep learning integration of molecular and interactome data for protein-compound interaction prediction.

作者信息

Watanabe Narumi, Ohnuki Yuuto, Sakakibara Yasubumi

机构信息

Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Kanagawa, 223-8522, Japan.

出版信息

J Cheminform. 2021 May 1;13(1):36. doi: 10.1186/s13321-021-00513-3.

DOI:10.1186/s13321-021-00513-3
PMID:33933121
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8088618/
Abstract

MOTIVATION

Virtual screening, which can computationally predict the presence or absence of protein-compound interactions, has attracted attention as a large-scale, low-cost, and short-term search method for seed compounds. Existing machine learning methods for predicting protein-compound interactions are largely divided into those based on molecular structure data and those based on network data. The former utilize information on proteins and compounds, such as amino acid sequences and chemical structures; the latter rely on interaction network data, such as protein-protein interactions and compound-compound interactions. However, there have been few attempts to combine both types of data in molecular information and interaction networks.

RESULTS

We developed a deep learning-based method that integrates protein features, compound features, and multiple types of interactome data to predict protein-compound interactions. We designed three benchmark datasets with different difficulties and applied them to evaluate the prediction method. The performance evaluations show that our deep learning framework for integrating molecular structure data and interactome data outperforms state-of-the-art machine learning methods for protein-compound interaction prediction tasks. The performance improvement is statistically significant according to the Wilcoxon signed-rank test. This finding reveals that the multi-interactome data captures perspectives other than amino acid sequence homology and chemical structure similarity and that both types of data synergistically improve the prediction accuracy. Furthermore, experiments on the three benchmark datasets show that our method is more robust than existing methods in accurately predicting interactions between proteins and compounds that are unseen in training samples.

摘要

动机

虚拟筛选能够通过计算预测蛋白质与化合物之间相互作用的存在与否,作为一种大规模、低成本且短期的先导化合物搜索方法而备受关注。现有的用于预测蛋白质与化合物相互作用的机器学习方法主要分为基于分子结构数据的方法和基于网络数据的方法。前者利用蛋白质和化合物的信息,如氨基酸序列和化学结构;后者则依赖于相互作用网络数据,如蛋白质 - 蛋白质相互作用和化合物 - 化合物相互作用。然而,在分子信息和相互作用网络中结合这两种类型的数据的尝试却很少。

结果

我们开发了一种基于深度学习的方法,该方法整合了蛋白质特征、化合物特征和多种类型的相互作用组数据来预测蛋白质与化合物之间的相互作用。我们设计了三个具有不同难度的基准数据集,并将其应用于评估预测方法。性能评估表明,我们用于整合分子结构数据和相互作用组数据的深度学习框架在蛋白质 - 化合物相互作用预测任务上优于现有的机器学习方法。根据威尔科克森符号秩检验,性能提升具有统计学意义。这一发现揭示了多相互作用组数据捕捉到了除氨基酸序列同源性和化学结构相似性之外的其他视角,并且这两种类型的数据协同提高了预测准确性。此外,在三个基准数据集上的实验表明,我们的方法在准确预测训练样本中未出现的蛋白质与化合物之间的相互作用方面比现有方法更稳健。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92d/8088618/1511b5f7d312/13321_2021_513_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92d/8088618/52084e2060b9/13321_2021_513_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92d/8088618/cd38d90ebcc4/13321_2021_513_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92d/8088618/51594b08c3e1/13321_2021_513_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92d/8088618/1511b5f7d312/13321_2021_513_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92d/8088618/52084e2060b9/13321_2021_513_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92d/8088618/cd38d90ebcc4/13321_2021_513_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92d/8088618/51594b08c3e1/13321_2021_513_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92d/8088618/1511b5f7d312/13321_2021_513_Fig4_HTML.jpg

相似文献

1
Deep learning integration of molecular and interactome data for protein-compound interaction prediction.用于蛋白质-化合物相互作用预测的分子和相互作用组数据的深度学习整合。
J Cheminform. 2021 May 1;13(1):36. doi: 10.1186/s13321-021-00513-3.
2
DeepCompoundNet: enhancing compound-protein interaction prediction with multimodal convolutional neural networks.深度化合物网络:利用多模态卷积神经网络增强化合物-蛋白质相互作用预测
J Biomol Struct Dyn. 2025 Feb;43(3):1414-1423. doi: 10.1080/07391102.2023.2291829. Epub 2023 Dec 12.
3
A deep learning method for predicting molecular properties and compound-protein interactions.一种用于预测分子性质和化合物-蛋白质相互作用的深度学习方法。
J Mol Graph Model. 2022 Dec;117:108283. doi: 10.1016/j.jmgm.2022.108283. Epub 2022 Aug 17.
4
How to approach machine learning-based prediction of drug/compound-target interactions.如何进行基于机器学习的药物/化合物-靶点相互作用预测。
J Cheminform. 2023 Feb 6;15(1):16. doi: 10.1186/s13321-023-00689-w.
5
Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences.基于图和序列神经网络端到端学习的化合物-蛋白质相互作用预测。
Bioinformatics. 2019 Jan 15;35(2):309-318. doi: 10.1093/bioinformatics/bty535.
6
FMGNN: A Method to Predict Compound-Protein Interaction With Pharmacophore Features and Physicochemical Properties of Amino Acids.FMGNN:一种利用药效团特征和氨基酸理化性质预测化合物-蛋白质相互作用的方法。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1030-1040. doi: 10.1109/TCBB.2022.3172340. Epub 2023 Apr 3.
7
Structure-Aware Multimodal Deep Learning for Drug-Protein Interaction Prediction.用于药物-蛋白质相互作用预测的结构感知多模态深度学习
J Chem Inf Model. 2022 Mar 14;62(5):1308-1317. doi: 10.1021/acs.jcim.2c00060. Epub 2022 Feb 24.
8
Boosting compound-protein interaction prediction by deep learning.通过深度学习增强化合物-蛋白质相互作用预测
Methods. 2016 Nov 1;110:64-72. doi: 10.1016/j.ymeth.2016.06.024. Epub 2016 Jul 1.
9
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.基于新型混合深度学习跨域知识整合方法的RNA-蛋白质结合基序挖掘
BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.
10
Evaluation of deep and shallow learning methods in chemogenomics for the prediction of drugs specificity.化学基因组学中用于预测药物特异性的深度学习和浅度学习方法评估。
J Cheminform. 2020 Feb 10;12(1):11. doi: 10.1186/s13321-020-0413-0.

引用本文的文献

1
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
2
DerivaPredict: A User-Friendly Tool for Predicting and Evaluating Active Derivatives of Natural Products.DerivaPredict:一种用于预测和评估天然产物活性衍生物的用户友好型工具。
Molecules. 2025 Apr 9;30(8):1683. doi: 10.3390/molecules30081683.
3
Recent Advances in Mass Spectrometry-Based Protein Interactome Studies.

本文引用的文献

1
Multifaceted protein-protein interaction prediction based on Siamese residual RCNN.基于孪生残差 RCNN 的多功能蛋白质-蛋白质相互作用预测。
Bioinformatics. 2019 Jul 15;35(14):i305-i314. doi: 10.1093/bioinformatics/btz328.
2
DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences.DeepConv-DTI:基于蛋白质序列卷积的深度学习预测药物-靶标相互作用
PLoS Comput Biol. 2019 Jun 14;15(6):e1007129. doi: 10.1371/journal.pcbi.1007129. eCollection 2019 Jun.
3
A Massively Parallel Selection of Small Molecule-RNA Motif Binding Partners Informs Design of an Antiviral from Sequence.
基于质谱的蛋白质相互作用组研究的最新进展
Mol Cell Proteomics. 2025 Jan;24(1):100887. doi: 10.1016/j.mcpro.2024.100887. Epub 2024 Nov 27.
4
Deep learning of multimodal networks with topological regularization for drug repositioning.用于药物重新定位的具有拓扑正则化的多模态网络的深度学习。
J Cheminform. 2024 Aug 23;16(1):103. doi: 10.1186/s13321-024-00897-y.
5
FOTF-CPI: A compound-protein interaction prediction transformer based on the fusion of optimal transport fragments.FOTF-CPI:一种基于最优传输片段融合的复合蛋白相互作用预测变压器。
iScience. 2023 Dec 15;27(1):108756. doi: 10.1016/j.isci.2023.108756. eCollection 2024 Jan 19.
6
LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening.LDS-CNN:一种基于大规模药物筛选的用于药物-靶点相互作用预测的深度学习框架。
Health Inf Sci Syst. 2023 Sep 2;11(1):42. doi: 10.1007/s13755-023-00243-w. eCollection 2023 Dec.
7
A multitask GNN-based interpretable model for discovery of selective JAK inhibitors.一种基于多任务图神经网络的可解释模型,用于发现选择性JAK抑制剂。
J Cheminform. 2022 Mar 15;14(1):16. doi: 10.1186/s13321-022-00593-9.
小分子-RNA 基序结合伴侣的大规模平行筛选为基于序列的抗病毒药物设计提供信息。
Chem. 2018 Oct 11;4(10):2384-2404. doi: 10.1016/j.chempr.2018.08.003. Epub 2018 Sep 13.
4
NeoDTI: neural integration of neighbor information from a heterogeneous network for discovering new drug-target interactions.NeoDTI:来自异构网络的邻居信息的神经整合,用于发现新的药物-靶标相互作用。
Bioinformatics. 2019 Jan 1;35(1):104-111. doi: 10.1093/bioinformatics/bty543.
5
DeepDTA: deep drug-target binding affinity prediction.深度 DTA:深度药物-靶标结合亲和力预测。
Bioinformatics. 2018 Sep 1;34(17):i821-i829. doi: 10.1093/bioinformatics/bty593.
6
End-to-End Representation Learning for Chemical-Chemical Interaction Prediction.端到端化学-化学相互作用预测的表示学习。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Sep-Oct;16(5):1436-1447. doi: 10.1109/TCBB.2018.2864149. Epub 2018 Aug 7.
7
Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences.基于图和序列神经网络端到端学习的化合物-蛋白质相互作用预测。
Bioinformatics. 2019 Jan 15;35(2):309-318. doi: 10.1093/bioinformatics/bty535.
8
Modeling polypharmacy side effects with graph convolutional networks.基于图卷积网络的药物滥用副作用建模。
Bioinformatics. 2018 Jul 1;34(13):i457-i466. doi: 10.1093/bioinformatics/bty294.
9
A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information.一种基于异构信息的药物-靶点相互作用预测及计算药物重新定位的网络集成方法。
Nat Commun. 2017 Sep 18;8(1):573. doi: 10.1038/s41467-017-00680-8.
10
Data-driven prediction of adverse drug reactions induced by drug-drug interactions.药物相互作用引起的药物不良反应的数据驱动预测。
BMC Pharmacol Toxicol. 2017 Jun 8;18(1):44. doi: 10.1186/s40360-017-0153-6.