• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CoGT:集成机器学习方法及其在JAK抑制剂发现中的应用

CoGT: Ensemble Machine Learning Method and Its Application on JAK Inhibitor Discovery.

作者信息

Bu Yingzi, Gao Ruoxi, Zhang Bohan, Zhang Luchen, Sun Duxin

机构信息

Department of Pharmaceutical Sciences, College of Pharmacy, University of Michigan, Ann Arbor, Michigan 48109, United States.

Department of Electrical Engineering and Computer Science, University of MichiganAnn Arbor, Michigan 48109, United States.

出版信息

ACS Omega. 2023 Mar 27;8(14):13232-13242. doi: 10.1021/acsomega.3c00160. eCollection 2023 Apr 11.

DOI:10.1021/acsomega.3c00160
PMID:37065046
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10099439/
Abstract

The discovery of new drug candidates to inhibit an intended target is a complex and resource-consuming process. A machine learning (ML) method for predicting drug-target interactions (DTI) is a potential solution to improve the efficiency. However, traditional ML approaches have limitations in accuracy. In this study, we developed a novel ensemble model CoGT for DTI prediction using multilayer perceptron (MLP), which integrated graph-based models to extract non-Euclidean molecular structures and large pretrained models, specifically chemBERTa, to process simplified molecular input line entry systems (SMILES). The performance of CoGT was evaluated using compounds inhibiting four Janus kinases (JAKs). Results showed that the large pretrained model, chemBERTa, was better than other conventional ML models in predicting DTI across multiple evaluation metrics, while the graph neural network (GNN) was effective for prediction on imbalanced data sets. To take full advantage of the strengths of these different models, we developed an ensemble model, CoGT, which outperformed other individual ML models in predicting compounds' inhibition on different isoforms of JAKs. Our data suggest that the ensemble model CoGT has the potential to accelerate the process of drug discovery.

摘要

发现用于抑制目标靶点的新药候选物是一个复杂且耗费资源的过程。一种用于预测药物 - 靶点相互作用(DTI)的机器学习(ML)方法是提高效率的潜在解决方案。然而,传统的ML方法在准确性方面存在局限性。在本研究中,我们使用多层感知器(MLP)开发了一种用于DTI预测的新型集成模型CoGT,该模型整合了基于图的模型以提取非欧几里得分子结构,并使用大型预训练模型,特别是chemBERTa,来处理简化分子输入线性表系统(SMILES)。使用抑制四种 Janus 激酶(JAKs)的化合物对CoGT的性能进行了评估。结果表明,在多个评估指标上,大型预训练模型chemBERTa在预测DTI方面优于其他传统ML模型,而图神经网络(GNN)在不平衡数据集的预测上很有效。为了充分利用这些不同模型的优势,我们开发了一个集成模型CoGT,它在预测化合物对不同JAK异构体的抑制作用方面优于其他单个ML模型。我们的数据表明,集成模型CoGT有加速药物发现过程的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/cbe3d98e9a97/ao3c00160_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/9c479fa5e550/ao3c00160_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/35896dbe3258/ao3c00160_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/c69c03466eda/ao3c00160_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/24fdd584947f/ao3c00160_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/59331dad6039/ao3c00160_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/c0ffa180aa94/ao3c00160_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/cbe3d98e9a97/ao3c00160_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/9c479fa5e550/ao3c00160_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/35896dbe3258/ao3c00160_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/c69c03466eda/ao3c00160_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/24fdd584947f/ao3c00160_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/59331dad6039/ao3c00160_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/c0ffa180aa94/ao3c00160_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/893b/10099439/cbe3d98e9a97/ao3c00160_0007.jpg

相似文献

1
CoGT: Ensemble Machine Learning Method and Its Application on JAK Inhibitor Discovery.CoGT:集成机器学习方法及其在JAK抑制剂发现中的应用
ACS Omega. 2023 Mar 27;8(14):13232-13242. doi: 10.1021/acsomega.3c00160. eCollection 2023 Apr 11.
2
Fine-tuning of BERT Model to Accurately Predict Drug-Target Interactions.微调BERT模型以准确预测药物-靶点相互作用。
Pharmaceutics. 2022 Aug 16;14(8):1710. doi: 10.3390/pharmaceutics14081710.
3
Comprehensive ensemble in QSAR prediction for drug discovery.用于药物发现的 QSAR 预测的综合集成。
BMC Bioinformatics. 2019 Oct 26;20(1):521. doi: 10.1186/s12859-019-3135-4.
4
An Ensemble Spectral Prediction (ESP) model for metabolite annotation.用于代谢物注释的集成谱预测 (ESP) 模型。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae490.
5
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.
6
Multiphysical graph neural network (MP-GNN) for COVID-19 drug design.多物理图神经网络(MP-GNN)在 COVID-19 药物设计中的应用。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac231.
7
Drug-target interaction prediction with tree-ensemble learning and output space reconstruction.基于树集成学习和输出空间重构的药物-靶标相互作用预测。
BMC Bioinformatics. 2020 Feb 7;21(1):49. doi: 10.1186/s12859-020-3379-z.
8
DTI-CDF: a cascade deep forest model towards the prediction of drug-target interactions based on hybrid features.DTI-CDF:一种基于混合特征的药物-靶标相互作用预测的级联深度森林模型。
Brief Bioinform. 2021 Jan 18;22(1):451-462. doi: 10.1093/bib/bbz152.
9
A development of a graph-based ensemble machine learning model for skin sensitization hazard and potency assessment.基于图的集成机器学习模型在皮肤致敏危害和效力评估中的开发。
J Appl Toxicol. 2022 Nov;42(11):1832-1842. doi: 10.1002/jat.4361. Epub 2022 Jul 20.
10
BE-DTI': Ensemble framework for drug target interaction prediction using dimensionality reduction and active learning.BE-DTI': 基于降维和主动学习的药物靶点相互作用预测集成框架。
Comput Methods Programs Biomed. 2018 Oct;165:151-162. doi: 10.1016/j.cmpb.2018.08.011. Epub 2018 Aug 22.

引用本文的文献

1
A gastrointestinal locally activating Janus kinase inhibitor to treat ulcerative colitis.一种用于治疗溃疡性结肠炎的胃肠道局部激活型 Janus 激酶抑制剂。
J Biol Chem. 2023 Dec;299(12):105467. doi: 10.1016/j.jbc.2023.105467. Epub 2023 Nov 17.

本文引用的文献

1
COVID-19 Multi-Targeted Drug Repurposing Using Few-Shot Learning.利用少样本学习进行COVID-19多靶点药物重新利用
Front Bioinform. 2021 Jun 15;1:693177. doi: 10.3389/fbinf.2021.693177. eCollection 2021.
2
First de novo deuterated drug poised for approval.首款有望获批的从头开始氘代药物。
Nat Rev Drug Discov. 2022 Sep;21(9):623-625. doi: 10.1038/d41573-022-00139-6.
3
Identification of novel off targets of baricitinib and tofacitinib by machine learning with a focus on thrombosis and viral infection.通过机器学习识别巴利昔替尼和托法替尼的新脱靶药物,重点关注血栓形成和病毒感染。
Sci Rep. 2022 May 12;12(1):7843. doi: 10.1038/s41598-022-11879-1.
4
A multitask GNN-based interpretable model for discovery of selective JAK inhibitors.一种基于多任务图神经网络的可解释模型,用于发现选择性JAK抑制剂。
J Cheminform. 2022 Mar 15;14(1):16. doi: 10.1186/s13321-022-00593-9.
5
Graph neural network approaches for drug-target interactions.图神经网络方法在药物-靶标相互作用中的应用。
Curr Opin Struct Biol. 2022 Apr;73:102327. doi: 10.1016/j.sbi.2021.102327. Epub 2022 Jan 21.
6
The JAK/STAT signaling pathway: from bench to clinic.JAK/STAT 信号通路:从基础到临床。
Signal Transduct Target Ther. 2021 Nov 26;6(1):402. doi: 10.1038/s41392-021-00791-1.
7
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
8
Novel Development of Predictive Feature Fingerprints to Identify Chemistry-Based Features for the Effective Drug Design of SARS-CoV-2 Target Antagonists and Inhibitors Using Machine Learning.利用机器学习开发预测特征指纹以识别基于化学的特征,用于严重急性呼吸综合征冠状病毒2(SARS-CoV-2)靶点拮抗剂和抑制剂的有效药物设计
ACS Omega. 2021 Feb 5;6(7):4857-4877. doi: 10.1021/acsomega.0c05303. eCollection 2021 Feb 23.
9
Machine learning identifies candidates for drug repurposing in Alzheimer's disease.机器学习确定阿尔茨海默病药物再利用的候选者。
Nat Commun. 2021 Feb 15;12(1):1033. doi: 10.1038/s41467-021-21330-0.
10
JAK-STAT pathway targeting for the treatment of inflammatory bowel disease.JAK-STAT 通路靶向治疗炎症性肠病。
Nat Rev Gastroenterol Hepatol. 2020 Jun;17(6):323-337. doi: 10.1038/s41575-020-0273-0. Epub 2020 Mar 19.