高效的多任务化学生物基因组学药物特异性预测。

Efficient multi-task chemogenomics for drug specificity prediction.

机构信息

Center for Computational Biology, Mines ParisTech, PSL Research University, Paris, France.

Institut Curie F-75248, Paris, France.

出版信息

PLoS One. 2018 Oct 4;13(10):e0204999. doi: 10.1371/journal.pone.0204999. eCollection 2018.

DOI:10.1371/journal.pone.0204999

PMID:30286165

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6171913/

Abstract

Adverse drug reactions, also called side effects, range from mild to fatal clinical events and significantly affect the quality of care. Among other causes, side effects occur when drugs bind to proteins other than their intended target. As experimentally testing drug specificity against the entire proteome is out of reach, we investigate the application of chemogenomics approaches. We formulate the study of drug specificity as a problem of predicting interactions between drugs and proteins at the proteome scale. We build several benchmark datasets, and propose NN-MT, a multi-task Support Vector Machine (SVM) algorithm that is trained on a limited number of data points, in order to solve the computational issues or proteome-wide SVM for chemogenomics. We compare NN-MT to different state-of-the-art methods, and show that its prediction performances are similar or better, at an efficient calculation cost. Compared to its competitors, the proposed method is particularly efficient to predict (protein, ligand) interactions in the difficult double-orphan case, i.e. when no interactions are previously known for the protein nor for the ligand. The NN-MT algorithm appears to be a good default method providing state-of-the-art or better performances, in a wide range of prediction scenario that are considered in the present study: proteome-wide prediction, protein family prediction, test (protein, ligand) pairs dissimilar to pairs in the train set, and orphan cases.

摘要

药物不良反应，也称副作用，范围从轻度到致命的临床事件，并显著影响医疗质量。除其他原因外，当药物与除预期靶标以外的蛋白质结合时，就会发生副作用。由于实验测试药物针对整个蛋白质组的特异性是无法实现的，我们研究了化学生物组学方法的应用。我们将药物特异性的研究表述为预测药物与蛋白质组范围内蛋白质之间相互作用的问题。我们构建了几个基准数据集，并提出了 NN-MT，这是一种多任务支持向量机（SVM）算法，它可以在有限数量的数据点上进行训练，以解决计算问题或蛋白质组范围的化学生物组学 SVM。我们将 NN-MT 与不同的最先进方法进行比较，并表明其预测性能相似或更好，计算成本效率更高。与竞争对手相比，该方法在预测困难的双重孤儿案例（即蛋白质和配体均无先前已知相互作用）中的（蛋白质，配体）相互作用时特别有效。NN-MT 算法似乎是一种很好的默认方法，可以在本研究中考虑的广泛预测场景中提供最先进或更好的性能：蛋白质组范围的预测、蛋白质家族预测、测试（蛋白质，配体）对与训练集中的对不相似的情况，以及孤儿案例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e27/6171913/4139c3679a38/pone.0204999.g001.jpg

相似文献

Efficient multi-task chemogenomics for drug specificity prediction.高效的多任务化学生物基因组学药物特异性预测。

PLoS One. 2018 Oct 4;13(10):e0204999. doi: 10.1371/journal.pone.0204999. eCollection 2018.

Evaluation of deep and shallow learning methods in chemogenomics for the prediction of drugs specificity.化学基因组学中用于预测药物特异性的深度学习和浅度学习方法评估。

J Cheminform. 2020 Feb 10;12(1):11. doi: 10.1186/s13321-020-0413-0.

Prediction of protein-protein interactions from amino acid sequences using a novel multi-scale continuous and discontinuous feature set.利用新型多尺度连续和非连续特征集从氨基酸序列预测蛋白质-蛋白质相互作用。

BMC Bioinformatics. 2014;15 Suppl 15(Suppl 15):S9. doi: 10.1186/1471-2105-15-S15-S9. Epub 2014 Dec 3.

Predicting drug side effects by multi-label learning and ensemble learning.通过多标签学习和集成学习预测药物副作用。

BMC Bioinformatics. 2015 Nov 4;16:365. doi: 10.1186/s12859-015-0774-y.

SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法：一种用于判别式多类别蛋白质折叠和超家族识别的工具。

BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.

Computational probing protein-protein interactions targeting small molecules.针对小分子的蛋白质-蛋白质相互作用的计算探测

Bioinformatics. 2016 Jan 15;32(2):226-34. doi: 10.1093/bioinformatics/btv528. Epub 2015 Sep 28.

Comprehensive prediction of drug-protein interactions and side effects for the human proteome.对人类蛋白质组的药物-蛋白质相互作用和副作用进行全面预测。

Sci Rep. 2015 Jun 9;5:11090. doi: 10.1038/srep11090.

Quantitative prediction of drug side effects based on drug-related features.基于药物相关特征的药物副作用定量预测

Interdiscip Sci. 2017 Sep;9(3):434-444. doi: 10.1007/s12539-017-0236-5. Epub 2017 May 17.

Drug-target interaction prediction via class imbalance-aware ensemble learning.通过类不平衡感知集成学习进行药物-靶点相互作用预测。

BMC Bioinformatics. 2016 Dec 22;17(Suppl 19):509. doi: 10.1186/s12859-016-1377-y.

Protein subcellular localization prediction using multiple kernel learning based support vector machine.基于多核学习支持向量机的蛋白质亚细胞定位预测

Mol Biosyst. 2017 Mar 28;13(4):785-795. doi: 10.1039/c6mb00860g.

引用本文的文献

Drug-Target Interactions Prediction at Scale: The Komet Algorithm with the LCIdb Dataset.大规模药物-靶点相互作用预测：Komet 算法与 LCIdb 数据集。

J Chem Inf Model. 2024 Sep 23;64(18):6938-6956. doi: 10.1021/acs.jcim.4c00422. Epub 2024 Sep 5.

Optimizing peptide inhibitors of SARS-Cov-2 nsp10/nsp16 methyltransferase predicted through molecular simulation and machine learning.通过分子模拟和机器学习预测优化严重急性呼吸综合征冠状病毒2（SARS-CoV-2）nsp10/nsp16甲基转移酶的肽抑制剂。

Inform Med Unlocked. 2022;29:100886. doi: 10.1016/j.imu.2022.100886. Epub 2022 Feb 28.

Drug Target Identification with Machine Learning: How to Choose Negative Examples.基于机器学习的药物靶点识别：如何选择负例。

Int J Mol Sci. 2021 May 12;22(10):5118. doi: 10.3390/ijms22105118.

Evaluation of deep and shallow learning methods in chemogenomics for the prediction of drugs specificity.化学基因组学中用于预测药物特异性的深度学习和浅度学习方法评估。

J Cheminform. 2020 Feb 10;12(1):11. doi: 10.1186/s13321-020-0413-0.

A Multi-Label Learning Framework for Drug Repurposing.一种用于药物再利用的多标签学习框架。

Pharmaceutics. 2019 Sep 9;11(9):466. doi: 10.3390/pharmaceutics11090466.

本文引用的文献

Docking-based inverse virtual screening: methods, applications, and challenges.基于对接的反向虚拟筛选：方法、应用及挑战

Biophys Rep. 2018;4(1):1-16. doi: 10.1007/s41048-017-0045-8. Epub 2018 Feb 1.

State of the Art Review and Report of New Tool for Drug Discovery.药物发现新工具的技术现状综述与报告

Curr Top Med Chem. 2017;17(26):2957-2976. doi: 10.2174/1568026617666170821123856.

Inferring Chemogenomic Features from Drug-Target Interaction Networks.从药物-靶点相互作用网络推断化学基因组学特征。

Mol Inform. 2013 Dec;32(11-12):991-9. doi: 10.1002/minf.201300079. Epub 2013 Dec 10.

DrugE-Rank: improving drug-target interaction prediction of new candidate drugs or targets by ensemble learning to rank.DrugE-Rank：通过集成学习排序改进新候选药物或靶点的药物-靶点相互作用预测。

Bioinformatics. 2016 Jun 15;32(12):i18-i27. doi: 10.1093/bioinformatics/btw244.

Innovation in the pharmaceutical industry: New estimates of R&D costs.制药行业的创新：研发成本的新估计

J Health Econ. 2016 May;47:20-33. doi: 10.1016/j.jhealeco.2016.01.012. Epub 2016 Feb 12.

Neighborhood Regularized Logistic Matrix Factorization for Drug-Target Interaction Prediction.用于药物-靶点相互作用预测的邻域正则化逻辑矩阵分解

PLoS Comput Biol. 2016 Feb 12;12(2):e1004760. doi: 10.1371/journal.pcbi.1004760. eCollection 2016 Feb.

Post-marketing withdrawal of 462 medicinal products because of adverse drug reactions: a systematic review of the world literature.因药物不良反应导致462种药品上市后撤市：对世界文献的系统评价

BMC Med. 2016 Feb 4;14:10. doi: 10.1186/s12916-016-0553-2.

A multiple kernel learning algorithm for drug-target interaction prediction.一种用于药物-靶点相互作用预测的多核学习算法。

BMC Bioinformatics. 2016 Jan 22;17:46. doi: 10.1186/s12859-016-0890-3.

Predicting target proteins for drug candidate compounds based on drug-induced gene expression data in a chemical structure-independent manner.基于药物诱导的基因表达数据以化学结构无关的方式预测候选药物化合物的靶蛋白。

BMC Med Genomics. 2015 Dec 18;8:82. doi: 10.1186/s12920-015-0158-1.

Toward more realistic drug-target interaction predictions.迈向更现实的药物-靶点相互作用预测。

Brief Bioinform. 2015 Mar;16(2):325-37. doi: 10.1093/bib/bbu010. Epub 2014 Apr 9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

高效的多任务化学生物基因组学药物特异性预测。

Efficient multi-task chemogenomics for drug specificity prediction.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献