• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多通道物理信息神经网络:探索用于药物发现的可扩展和可转移神经网络。

Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery.

作者信息

Lee Munhwan, Kim Hyeyeon, Joe Hyunwhan, Kim Hong-Gee

机构信息

Biomedical Knowledge Engineering Laboratory, Seoul National University, 1 Gwanak-ro, Seoul, Republic of Korea.

出版信息

J Cheminform. 2019 Jul 9;11(1):46. doi: 10.1186/s13321-019-0368-1.

DOI:10.1186/s13321-019-0368-1
PMID:31289963
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6617572/
Abstract

Analysis of compound-protein interactions (CPIs) has become a crucial prerequisite for drug discovery and drug repositioning. In vitro experiments are commonly used in identifying CPIs, but it is not feasible to discover the molecular and proteomic space only through experimental approaches. Machine learning's advances in predicting CPIs have made significant contributions to drug discovery. Deep neural networks (DNNs), which have recently been applied to predict CPIs, performed better than other shallow classifiers. However, such techniques commonly require a considerable volume of dense data for each training target. Although the number of publicly available CPI data has grown rapidly, public data is still sparse and has a large number of measurement errors. In this paper, we propose a novel method, Multi-channel PINN, to fully utilize sparse data in terms of representation learning. With representation learning, Multi-channel PINN can utilize three approaches of DNNs which are a classifier, a feature extractor, and an end-to-end learner. Multi-channel PINN can be fed with both low and high levels of representations and incorporates each of them by utilizing all approaches within a single model. To fully utilize sparse public data, we additionally explore the potential of transferring representations from training tasks to test tasks. As a proof of concept, Multi-channel PINN was evaluated on fifteen combinations of feature pairs to investigate how they affect the performance in terms of highest performance, initial performance, and convergence speed. The experimental results obtained indicate that the multi-channel models using protein features performed better than single-channel models or multi-channel models using compound features. Therefore, Multi-channel PINN can be advantageous when used with appropriate representations. Additionally, we pretrained models on a training task then finetuned them on a test task to figure out whether Multi-channel PINN can capture general representations for compounds and proteins. We found that there were significant differences in performance between pretrained models and non-pretrained models.

摘要

复合蛋白相互作用(CPI)分析已成为药物发现和药物重新定位的关键前提。体外实验常用于识别CPI,但仅通过实验方法发现分子和蛋白质组空间是不可行的。机器学习在预测CPI方面的进展为药物发现做出了重大贡献。最近应用于预测CPI的深度神经网络(DNN)比其他浅层分类器表现更好。然而,此类技术通常每个训练目标都需要大量密集数据。尽管公开可用的CPI数据数量迅速增长,但公共数据仍然稀疏且存在大量测量误差。在本文中,我们提出了一种新颖的方法——多通道物理信息神经网络(Multi-channel PINN),以在表示学习方面充分利用稀疏数据。通过表示学习,多通道物理信息神经网络可以利用DNN的三种方法,即分类器、特征提取器和端到端学习者。多通道物理信息神经网络可以输入低层次和高层次的表示,并通过在单个模型中利用所有方法将它们结合起来。为了充分利用稀疏的公共数据,我们还探索了将表示从训练任务转移到测试任务的潜力。作为概念验证,对多通道物理信息神经网络在十五个特征对组合上进行了评估,以研究它们如何在最高性能、初始性能和收敛速度方面影响性能。获得的实验结果表明,使用蛋白质特征的多通道模型比使用化合物特征的单通道模型或多通道模型表现更好。因此,多通道物理信息神经网络在与适当的表示一起使用时可能具有优势。此外,我们在训练任务上对模型进行预训练,然后在测试任务上对其进行微调,以弄清楚多通道物理信息神经网络是否可以捕获化合物和蛋白质的通用表示。我们发现预训练模型和未预训练模型在性能上存在显著差异。

相似文献

1
Multi-channel PINN: investigating scalable and transferable neural networks for drug discovery.多通道物理信息神经网络:探索用于药物发现的可扩展和可转移神经网络。
J Cheminform. 2019 Jul 9;11(1):46. doi: 10.1186/s13321-019-0368-1.
2
Effectively Identifying Compound-Protein Interactions by Learning from Positive and Unlabeled Examples.通过从正例和无标签样例中学习来有效识别化合物-蛋白质相互作用。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):1832-1843. doi: 10.1109/TCBB.2016.2570211. Epub 2016 May 18.
3
Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences.基于图和序列神经网络端到端学习的化合物-蛋白质相互作用预测。
Bioinformatics. 2019 Jan 15;35(2):309-318. doi: 10.1093/bioinformatics/bty535.
4
BACPI: a bi-directional attention neural network for compound-protein interaction and binding affinity prediction.BACPI:一种用于化合物-蛋白质相互作用和结合亲和力预测的双向注意力神经网络。
Bioinformatics. 2022 Mar 28;38(7):1995-2002. doi: 10.1093/bioinformatics/btac035.
5
MMCL-CPI: A multi-modal compound-protein interaction prediction model incorporating contrastive learning pre-training.MMCL-CPI:一种结合对比学习预训练的多模态化合物-蛋白质相互作用预测模型。
Comput Biol Chem. 2024 Oct;112:108137. doi: 10.1016/j.compbiolchem.2024.108137. Epub 2024 Jul 25.
6
Boosting compound-protein interaction prediction by deep learning.通过深度学习增强化合物-蛋白质相互作用预测
Methods. 2016 Nov 1;110:64-72. doi: 10.1016/j.ymeth.2016.06.024. Epub 2016 Jul 1.
7
Strategies for multi-case physics-informed neural networks for tube flows: a study using 2D flow scenarios.用于管流的多案例物理信息神经网络策略:一项使用二维流场情景的研究
Sci Rep. 2024 May 21;14(1):11577. doi: 10.1038/s41598-024-62117-9.
8
Physics-informed neural networks based on adaptive weighted loss functions for Hamilton-Jacobi equations.基于自适应加权损失函数的用于哈密顿-雅可比方程的物理信息神经网络。
Math Biosci Eng. 2022 Sep 5;19(12):12866-12896. doi: 10.3934/mbe.2022601.
9
Physics-informed neural networks for predicting gas flow dynamics and unknown parameters in diesel engines.用于预测柴油机内气体流动动力学和未知参数的物理信息神经网络。
Sci Rep. 2023 Aug 22;13(1):13683. doi: 10.1038/s41598-023-39989-4.
10
MDeePred: novel multi-channel protein featurization for deep learning-based binding affinity prediction in drug discovery.MDeePred:用于药物发现中基于深度学习的结合亲和力预测的新型多通道蛋白质特征化。
Bioinformatics. 2021 May 5;37(5):693-704. doi: 10.1093/bioinformatics/btaa858.

引用本文的文献

1
Transfer learning for drug-target interaction prediction.药物-靶标相互作用预测的迁移学习。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i103-i110. doi: 10.1093/bioinformatics/btad234.
2
Improving Compound Activity Classification via Deep Transfer and Representation Learning.通过深度迁移和表征学习改进化合物活性分类
ACS Omega. 2022 Mar 11;7(11):9465-9483. doi: 10.1021/acsomega.1c06805. eCollection 2022 Mar 22.
3
A review on machine learning approaches and trends in drug discovery.关于药物发现中机器学习方法与趋势的综述。

本文引用的文献

1
Machine Learning for Drug-Target Interaction Prediction.机器学习在药物-靶标相互作用预测中的应用。
Molecules. 2018 Aug 31;23(9):2208. doi: 10.3390/molecules23092208.
2
Virtual Chemical Libraries.虚拟化学文库。
J Med Chem. 2019 Feb 14;62(3):1116-1124. doi: 10.1021/acs.jmedchem.8b01048. Epub 2018 Sep 10.
3
Recent applications of deep learning and machine intelligence on in silico drug discovery: methods, tools and databases.深度学习和机器智能在计算机药物发现中的最新应用:方法、工具和数据库。
Comput Struct Biotechnol J. 2021 Aug 12;19:4538-4558. doi: 10.1016/j.csbj.2021.08.011. eCollection 2021.
4
Artificial intelligence and machine learning-aided drug discovery in central nervous system diseases: State-of-the-arts and future directions.人工智能和机器学习辅助中枢神经系统疾病药物发现:现状与未来方向。
Med Res Rev. 2021 May;41(3):1427-1473. doi: 10.1002/med.21764. Epub 2020 Dec 9.
Brief Bioinform. 2019 Sep 27;20(5):1878-1912. doi: 10.1093/bib/bby061.
4
MoleculeNet: a benchmark for molecular machine learning.分子网络:分子机器学习的一个基准
Chem Sci. 2017 Oct 31;9(2):513-530. doi: 10.1039/c7sc02664a. eCollection 2018 Jan 14.
5
Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules.使用数据驱动的分子连续表示法进行自动化学设计。
ACS Cent Sci. 2018 Feb 28;4(2):268-276. doi: 10.1021/acscentsci.7b00572. Epub 2018 Jan 12.
6
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Nucleic Acids Res. 2018 Mar 16;46(5):2699. doi: 10.1093/nar/gky092.
7
Mol2vec: Unsupervised Machine Learning Approach with Chemical Intuition.Mol2vec:具有化学直觉的无监督机器学习方法。
J Chem Inf Model. 2018 Jan 22;58(1):27-35. doi: 10.1021/acs.jcim.7b00616. Epub 2018 Jan 10.
8
Beyond the hype: deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set.超越炒作:使用ChEMBL生物活性基准集,深度神经网络优于现有方法。
J Cheminform. 2017 Aug 14;9(1):45. doi: 10.1186/s13321-017-0232-0.
9
Deep-learning: investigating deep neural networks hyper-parameters and comparison of performance to shallow methods for modeling bioactivity data.深度学习:研究深度神经网络超参数以及将其性能与用于生物活性数据建模的浅层方法进行比较。
J Cheminform. 2017 Jun 28;9(1):42. doi: 10.1186/s13321-017-0226-y.
10
In silico prediction of drug-target interaction networks based on drug chemical structure and protein sequences.基于药物化学结构和蛋白质序列的药物-靶标相互作用网络的计算预测。
Sci Rep. 2017 Sep 11;7(1):11174. doi: 10.1038/s41598-017-10724-0.