• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GraphEGFR:基于分子图注意力机制和指纹的多任务和迁移学习,在数据稀缺的情况下提高了针对 EGFR 家族蛋白的抑制剂生物活性预测。

GraphEGFR: Multi-task and transfer learning based on molecular graph attention mechanism and fingerprints improving inhibitor bioactivity prediction for EGFR family proteins on data scarcity.

机构信息

School of Information Science and Technology, Vidyasirimedhi Institute of Science and Technology, Rayong, Thailand.

School of Information, Computer, and Communication Technology, Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani, Thailand.

出版信息

J Comput Chem. 2024 Sep 5;45(23):2001-2023. doi: 10.1002/jcc.27388. Epub 2024 May 7.

DOI:10.1002/jcc.27388
PMID:38713612
Abstract

The proteins within the human epidermal growth factor receptor (EGFR) family, members of the tyrosine kinase receptor family, play a pivotal role in the molecular mechanisms driving the development of various tumors. Tyrosine kinase inhibitors, key compounds in targeted therapy, encounter challenges in cancer treatment due to emerging drug resistance mutations. Consequently, machine learning has undergone significant evolution to address the challenges of cancer drug discovery related to EGFR family proteins. However, the application of deep learning in this area is hindered by inherent difficulties associated with small-scale data, particularly the risk of overfitting. Moreover, the design of a model architecture that facilitates learning through multi-task and transfer learning, coupled with appropriate molecular representation, poses substantial challenges. In this study, we introduce GraphEGFR, a deep learning regression model designed to enhance molecular representation and model architecture for predicting the bioactivity of inhibitors against both wild-type and mutant EGFR family proteins. GraphEGFR integrates a graph attention mechanism for molecular graphs with deep and convolutional neural networks for molecular fingerprints. We observed that GraphEGFR models employing multi-task and transfer learning strategies generally achieve predictive performance comparable to existing competitive methods. The integration of molecular graphs and fingerprints adeptly captures relationships between atoms and enables both global and local pattern recognition. We further validated potential multi-targeted inhibitors for wild-type and mutant HER1 kinases, exploring key amino acid residues through molecular dynamics simulations to understand molecular interactions. This predictive model offers a robust strategy that could significantly contribute to overcoming the challenges of developing deep learning models for drug discovery with limited data and exploring new frontiers in multi-targeted kinase drug discovery for EGFR family proteins.

摘要

人类表皮生长因子受体(EGFR)家族中的蛋白质是酪氨酸激酶受体家族的成员,在驱动各种肿瘤发展的分子机制中起着关键作用。酪氨酸激酶抑制剂是靶向治疗的关键化合物,但由于出现耐药性突变,它们在癌症治疗中遇到了挑战。因此,机器学习在解决与 EGFR 家族蛋白相关的癌症药物发现挑战方面经历了重大发展。然而,深度学习在该领域的应用受到与小数据集相关的固有困难的阻碍,特别是过拟合的风险。此外,设计一个便于通过多任务和迁移学习进行学习的模型架构,并结合适当的分子表示形式,也面临着巨大的挑战。在这项研究中,我们引入了 GraphEGFR,这是一种深度学习回归模型,旨在增强分子表示和模型架构,以预测针对野生型和突变型 EGFR 家族蛋白的抑制剂的生物活性。GraphEGFR 将用于分子图的图注意机制与用于分子指纹的深度和卷积神经网络集成在一起。我们观察到,采用多任务和迁移学习策略的 GraphEGFR 模型通常可以实现与现有竞争方法相当的预测性能。分子图和指纹的集成巧妙地捕捉了原子之间的关系,能够进行全局和局部模式识别。我们进一步验证了针对野生型和突变型 HER1 激酶的潜在多靶向抑制剂,通过分子动力学模拟探索关键氨基酸残基,以了解分子相互作用。这种预测模型提供了一种强大的策略,可以为克服数据有限的情况下开发深度学习模型以进行药物发现的挑战以及探索针对 EGFR 家族蛋白的多靶向激酶药物发现的新前沿做出重大贡献。

相似文献

1
GraphEGFR: Multi-task and transfer learning based on molecular graph attention mechanism and fingerprints improving inhibitor bioactivity prediction for EGFR family proteins on data scarcity.GraphEGFR:基于分子图注意力机制和指纹的多任务和迁移学习,在数据稀缺的情况下提高了针对 EGFR 家族蛋白的抑制剂生物活性预测。
J Comput Chem. 2024 Sep 5;45(23):2001-2023. doi: 10.1002/jcc.27388. Epub 2024 May 7.
2
GeneralizedDTA: combining pre-training and multi-task learning to predict drug-target binding affinity for unknown drug discovery.通用 DTA:结合预训练和多任务学习,预测未知药物发现的药物-靶标结合亲和力。
BMC Bioinformatics. 2022 Sep 7;23(1):367. doi: 10.1186/s12859-022-04905-6.
3
Small molecule inhibitors targeting the EGFR/ErbB family of protein-tyrosine kinases in human cancers.针对人类癌症中 EGFR/ErbB 家族蛋白酪氨酸激酶的小分子抑制剂。
Pharmacol Res. 2019 Jan;139:395-411. doi: 10.1016/j.phrs.2018.11.014. Epub 2018 Nov 27.
4
Drug-target affinity prediction with extended graph learning-convolutional networks.基于扩展图学习卷积网络的药物-靶标亲和力预测。
BMC Bioinformatics. 2024 Feb 16;25(1):75. doi: 10.1186/s12859-024-05698-6.
5
FP-GNN: a versatile deep learning architecture for enhanced molecular property prediction.FP-GNN:一种用于增强分子性质预测的多功能深度学习架构。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac408.
6
A multi-task deep learning model for EGFR genotyping prediction and GTV segmentation of brain metastasis.用于 EGFR 基因分型预测和脑转移 GTV 分割的多任务深度学习模型。
J Transl Med. 2023 Nov 7;21(1):788. doi: 10.1186/s12967-023-04681-8.
7
Quantitative Structure-Mutation-Activity Relationship Tests (QSMART) model for protein kinase inhibitor response prediction.定量构效突变活性关系测试(QSMART)模型用于预测蛋白激酶抑制剂反应。
BMC Bioinformatics. 2020 Nov 12;21(1):520. doi: 10.1186/s12859-020-03842-6.
8
KGE-UNIT: toward the unification of molecular interactions prediction based on knowledge graph and multi-task learning on drug discovery.KGE-UNIT:基于知识图和药物发现的多任务学习的分子相互作用预测的统一方法。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae043.
9
Employing Molecular Conformations for Ligand-Based Virtual Screening with Equivariant Graph Neural Network and Deep Multiple Instance Learning.利用基于分子构象的等价图神经网络和深度多重实例学习进行配体虚拟筛选。
Molecules. 2023 Aug 9;28(16):5982. doi: 10.3390/molecules28165982.
10
Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology.利用机器学习进展进行药物发现和分子生物学中的数据整合
Methods Mol Biol. 2021;2190:167-184. doi: 10.1007/978-1-0716-0826-5_7.