使用多种成对核函数进行药物生物活性预测。

Learning with multiple pairwise kernels for drug bioactivity prediction.

机构信息

Department of Computer Science, Helsinki Institute for Information Technology HIIT, Aalto University, Espoo, Finland.

Institute for Molecular Medicine Finland FIMM, University of Helsinki, Helsinki, Finland.

出版信息

Bioinformatics. 2018 Jul 1;34(13):i509-i518. doi: 10.1093/bioinformatics/bty277.

DOI:10.1093/bioinformatics/bty277

PMID:29949975

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6022556/

Abstract

MOTIVATION

Many inference problems in bioinformatics, including drug bioactivity prediction, can be formulated as pairwise learning problems, in which one is interested in making predictions for pairs of objects, e.g. drugs and their targets. Kernel-based approaches have emerged as powerful tools for solving problems of that kind, and especially multiple kernel learning (MKL) offers promising benefits as it enables integrating various types of complex biomedical information sources in the form of kernels, along with learning their importance for the prediction task. However, the immense size of pairwise kernel spaces remains a major bottleneck, making the existing MKL algorithms computationally infeasible even for small number of input pairs.

RESULTS

We introduce pairwiseMKL, the first method for time- and memory-efficient learning with multiple pairwise kernels. pairwiseMKL first determines the mixture weights of the input pairwise kernels, and then learns the pairwise prediction function. Both steps are performed efficiently without explicit computation of the massive pairwise matrices, therefore making the method applicable to solving large pairwise learning problems. We demonstrate the performance of pairwiseMKL in two related tasks of quantitative drug bioactivity prediction using up to 167 995 bioactivity measurements and 3120 pairwise kernels: (i) prediction of anticancer efficacy of drug compounds across a large panel of cancer cell lines; and (ii) prediction of target profiles of anticancer compounds across their kinome-wide target spaces. We show that pairwiseMKL provides accurate predictions using sparse solutions in terms of selected kernels, and therefore it automatically identifies also data sources relevant for the prediction problem.

AVAILABILITY AND IMPLEMENTATION

Code is available at https://github.com/aalto-ics-kepaco.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

许多生物信息学中的推理问题，包括药物生物活性预测，可以被表述为成对学习问题，人们有兴趣对对象对（例如药物与其靶标）进行预测。基于核的方法已经成为解决这类问题的强大工具，尤其是多核学习（MKL）提供了有希望的好处，因为它能够以核的形式整合各种类型的复杂生物医学信息源，并学习它们对预测任务的重要性。然而，成对核空间的巨大规模仍然是一个主要的瓶颈，使得现有的 MKL 算法即使对于少量的输入对也在计算上不可行。

结果

我们引入了 pairwiseMKL，这是一种用于具有多个成对核的时间和内存高效学习的第一个方法。pairwiseMKL 首先确定输入成对核的混合权重，然后学习成对预测函数。这两个步骤都是在不明确计算大规模成对矩阵的情况下高效地执行的，因此使得该方法适用于解决大型成对学习问题。我们在使用多达 167995 个生物活性测量值和 3120 个成对核的两个相关任务中展示了 pairwiseMKL 的性能：（i）在大量癌细胞系中预测药物化合物的抗癌疗效；（ii）在激酶组范围内预测抗癌化合物的靶标谱。我们表明，pairwiseMKL 使用所选核的稀疏解决方案提供了准确的预测，因此它自动识别了与预测问题相关的数据来源。

可用性和实现

代码可在 https://github.com/aalto-ics-kepaco 获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8823/6022556/d350a1ae6dd2/bty277f1.jpg

相似文献

Learning with multiple pairwise kernels for drug bioactivity prediction.

Bioinformatics. 2018 Jul 1;34(13):i509-i518. doi: 10.1093/bioinformatics/bty277.

Metabolic network prediction through pairwise rational kernels.

BMC Bioinformatics. 2014 Sep 26;15(1):318. doi: 10.1186/1471-2105-15-318.

L2-norm multiple kernel learning and its application to biomedical data fusion.

BMC Bioinformatics. 2010 Jun 8;11:309. doi: 10.1186/1471-2105-11-309.

DeepDRK: a deep learning framework for drug repurposing through kernel-based multi-omics integration.

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab048.

A Drug-Target Network-Based Supervised Machine Learning Repurposing Method Allowing the Use of Multiple Heterogeneous Information Sources.

Methods Mol Biol. 2019;1903:281-289. doi: 10.1007/978-1-4939-8955-3_17.

LZW-Kernel: fast kernel utilizing variable length code blocks from LZW compressors for protein sequence classification.

Bioinformatics. 2018 Oct 1;34(19):3281-3288. doi: 10.1093/bioinformatics/bty349.

Systematic identification of feature combinations for predicting drug response with Bayesian multi-view multi-task linear regression.

Bioinformatics. 2017 Jul 15;33(14):i359-i368. doi: 10.1093/bioinformatics/btx266.

A multiple kernel learning algorithm for drug-target interaction prediction.

BMC Bioinformatics. 2016 Jan 22;17:46. doi: 10.1186/s12859-016-0890-3.

Fast and interpretable genomic data analysis using multiple approximate kernel learning.

Bioinformatics. 2022 Jun 24;38(Suppl 1):i77-i83. doi: 10.1093/bioinformatics/btac241.

Modeling drug combination effects via latent tensor reconstruction.

Bioinformatics. 2021 Jul 12;37(Suppl_1):i93-i101. doi: 10.1093/bioinformatics/btab308.

引用本文的文献

A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf355.

GS-DTA: integrating graph and sequence models for predicting drug-target binding affinity.

BMC Genomics. 2025 Feb 4;26(1):105. doi: 10.1186/s12864-025-11234-4.

Drug-target binding affinity prediction based on power graph and word2vec.

BMC Med Genomics. 2025 Jan 13;18(Suppl 1):9. doi: 10.1186/s12920-024-02073-5.

Integrating molecular, biochemical, and immunohistochemical features as predictors of hepatocellular carcinoma drug response using machine-learning algorithms.

Front Mol Biosci. 2024 Oct 16;11:1430794. doi: 10.3389/fmolb.2024.1430794. eCollection 2024.

MolMVC: Enhancing molecular representations for drug-related tasks through multi-view contrastive learning.

Bioinformatics. 2024 Sep 1;40(Suppl 2):ii190-ii197. doi: 10.1093/bioinformatics/btae386.

Leveraging multiple data types for improved compound-kinase bioactivity prediction.

Nat Commun. 2024 Aug 31;15(1):7596. doi: 10.1038/s41467-024-52055-5.

A review of deep learning methods for ligand based drug virtual screening.

Fundam Res. 2024 Mar 11;4(4):715-737. doi: 10.1016/j.fmre.2024.02.011. eCollection 2024 Jul.

GEFormerDTA: drug target affinity prediction based on transformer graph for early fusion.

Sci Rep. 2024 Mar 28;14(1):7416. doi: 10.1038/s41598-024-57879-1.

DGDTA: dynamic graph attention network for predicting drug-target binding affinity.

BMC Bioinformatics. 2023 Sep 30;24(1):367. doi: 10.1186/s12859-023-05497-5.

Dose-response prediction for in-vitro drug combination datasets: a probabilistic approach.

BMC Bioinformatics. 2023 Apr 21;24(1):161. doi: 10.1186/s12859-023-05256-6.

本文引用的文献

PharmacoDB: an integrative database for mining in vitro anticancer drug screening studies.

Nucleic Acids Res. 2018 Jan 4;46(D1):D994-D1002. doi: 10.1093/nar/gkx911.

Global proteomics profiling improves drug sensitivity prediction: results from a multi-omics, pan-cancer modeling approach.

Bioinformatics. 2018 Apr 15;34(8):1353-1362. doi: 10.1093/bioinformatics/btx766.

Computational-experimental approach to drug-target interaction mapping: A case study on kinase inhibitors.

PLoS Comput Biol. 2017 Aug 7;13(8):e1005678. doi: 10.1371/journal.pcbi.1005678. eCollection 2017 Aug.

Fast Kronecker Product Kernel Methods via Generalized Vec Trick.

IEEE Trans Neural Netw Learn Syst. 2018 Aug;29(8):3374-3387. doi: 10.1109/TNNLS.2017.2727545. Epub 2017 Aug 1.

Kinome-Wide Profiling Prediction of Small Molecules.

ChemMedChem. 2018 Mar 20;13(6):495-499. doi: 10.1002/cmdc.201700180. Epub 2017 Jun 26.

Profiling Prediction of Kinase Inhibitors: Toward the Virtual Assay.

J Med Chem. 2017 Jan 12;60(1):474-485. doi: 10.1021/acs.jmedchem.6b01611. Epub 2016 Dec 14.

Multi-omic data integration enables discovery of hidden biological regularities.

Nat Commun. 2016 Oct 26;7:13091. doi: 10.1038/ncomms13091.

Drug response prediction by inferring pathway-response associations with kernelized Bayesian matrix factorization.

Bioinformatics. 2016 Sep 1;32(17):i455-i463. doi: 10.1093/bioinformatics/btw433.

Computational models for predicting drug responses in cancer research.

Brief Bioinform. 2017 Sep 1;18(5):820-829. doi: 10.1093/bib/bbw065.

Machine Learning of Protein Interactions in Fungal Secretory Pathways.

PLoS One. 2016 Jul 21;11(7):e0159302. doi: 10.1371/journal.pone.0159302. eCollection 2016.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用多种成对核函数进行药物生物活性预测。

Learning with multiple pairwise kernels for drug bioactivity prediction.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

补充信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献