核化贝叶斯矩阵分解。

Kernelized Bayesian Matrix Factorization.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2014 Oct;36(10):2047-60. doi: 10.1109/TPAMI.2014.2313125.

DOI:10.1109/TPAMI.2014.2313125

Abstract

We extend kernelized matrix factorization with a full-Bayesian treatment and with an ability to work with multiple side information sources expressed as different kernels. Kernels have been introduced to integrate side information about the rows and columns, which is necessary for making out-of-matrix predictions. We discuss specifically binary output matrices but extensions to realvalued matrices are straightforward. We extend the state of the art in two key aspects: (i) A full-conjugate probabilistic formulation of the kernelized matrix factorization enables an efficient variational approximation, whereas full-Bayesian treatments are not computationally feasible in the earlier approaches. (ii) Multiple side information sources are included, treated as different kernels in multiple kernel learning which additionally reveals which side sources are informative. We then show that the framework can also be used for supervised and semi-supervised multilabel classification and multi-output regression, by considering samples and outputs as the domains where matrix factorization operates. Our method outperforms alternatives in predicting drug-protein interactions on two data sets. On multilabel classification, our algorithm obtains the lowest Hamming losses on 10 out of 14 data sets compared to five state-of-the-art multilabel classification algorithms. We finally show that the proposed approach outperforms alternatives in multi-output regression experiments on a yeast cell cycle data set.

摘要

我们通过全贝叶斯处理和处理多个边信息源的能力扩展了核矩阵分解，这些边信息源表示为不同的核。核被引入到矩阵外预测中，以整合关于行和列的边信息，这是必要的。我们特别讨论了二进制输出矩阵，但对实值矩阵的扩展是直接的。我们在两个关键方面扩展了现有技术：（i）核矩阵分解的全共轭概率公式化使有效的变分逼近成为可能，而早期方法中的全贝叶斯处理在计算上是不可行的。（ii）包括多个边信息源，将其视为多内核学习中的不同核，这进一步揭示了哪些边源是信息丰富的。然后，我们通过将样本和输出视为矩阵分解操作的域，表明该框架也可用于监督和半监督多标签分类和多输出回归。我们的方法在两个数据集上预测药物-蛋白质相互作用方面优于其他方法。在多标签分类方面，与五种最先进的多标签分类算法相比，我们的算法在 14 个数据集的 10 个数据集中获得了最低的汉明损失。最后，我们表明，在酵母细胞周期数据集的多输出回归实验中，所提出的方法优于其他方法。

相似文献

Kernelized Bayesian Matrix Factorization.

IEEE Trans Pattern Anal Mach Intell. 2014 Oct;36(10):2047-60. doi: 10.1109/TPAMI.2014.2313125.

Bayesian supervised dimensionality reduction.

IEEE Trans Cybern. 2013 Dec;43(6):2179-89. doi: 10.1109/TCYB.2013.2245321.

Sparse Bayesian modeling with adaptive kernel learning.

IEEE Trans Neural Netw. 2009 Jun;20(6):926-37. doi: 10.1109/TNN.2009.2014060. Epub 2009 May 5.

Probabilistic multi-class multi-kernel learning: on protein fold recognition and remote homology detection.

Bioinformatics. 2008 May 15;24(10):1264-70. doi: 10.1093/bioinformatics/btn112. Epub 2008 Mar 31.

Kernelized Sparse Bayesian Matrix Factorization.

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):391-404. doi: 10.1109/TNNLS.2020.2978761. Epub 2021 Jan 4.

Efficient Kernelized prototype based classification.

Int J Neural Syst. 2011 Dec;21(6):443-57. doi: 10.1142/S012906571100295X.

VB-MK-LMF: fusion of drugs, targets and interactions using variational Bayesian multiple kernel logistic matrix factorization.

BMC Bioinformatics. 2017 Oct 4;18(1):440. doi: 10.1186/s12859-017-1845-z.

Drug response prediction by inferring pathway-response associations with kernelized Bayesian matrix factorization.

Bioinformatics. 2016 Sep 1;32(17):i455-i463. doi: 10.1093/bioinformatics/btw433.

Bayesian methods for predicting interacting protein pairs using domain information.

Biometrics. 2007 Sep;63(3):824-33. doi: 10.1111/j.1541-0420.2007.00755.x.

Automatic relevance determination in nonnegative matrix factorization with the β-divergence.

IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1592-605. doi: 10.1109/TPAMI.2012.240.

引用本文的文献

Advances and challenges in drug repurposing in precision therapeutics of colorectal cancer.

World J Gastrointest Oncol. 2025 Jul 15;17(7):107681. doi: 10.4251/wjgo.v17.i7.107681.

Kernel Bayesian nonlinear matrix factorization based on variational inference for human-virus protein-protein interaction prediction.

Sci Rep. 2024 Mar 8;14(1):5693. doi: 10.1038/s41598-024-56208-w.

Predicting non-small cell lung cancer-related genes by a new network-based machine learning method.

Front Oncol. 2022 Sep 20;12:981154. doi: 10.3389/fonc.2022.981154. eCollection 2022.

Predicting Drug-Disease Association Based on Ensemble Strategy.

Front Genet. 2021 May 3;12:666575. doi: 10.3389/fgene.2021.666575. eCollection 2021.

Evaluation of deep and shallow learning methods in chemogenomics for the prediction of drugs specificity.

J Cheminform. 2020 Feb 10;12(1):11. doi: 10.1186/s13321-020-0413-0.

Modelling G×E with historical weather information improves genomic prediction in new environments.

Bioinformatics. 2019 Oct 15;35(20):4045-4052. doi: 10.1093/bioinformatics/btz197.

FKL-Spa-LapRLS: an accurate method for identifying human microRNA-disease association.

BMC Genomics. 2018 Dec 31;19(Suppl 10):911. doi: 10.1186/s12864-018-5273-x.

SimBoost: a read-across approach for predicting drug-target binding affinities using gradient boosting machines.

J Cheminform. 2017 Apr 18;9(1):24. doi: 10.1186/s13321-017-0209-z.

VB-MK-LMF: fusion of drugs, targets and interactions using variational Bayesian multiple kernel logistic matrix factorization.

BMC Bioinformatics. 2017 Oct 4;18(1):440. doi: 10.1186/s12859-017-1845-z.

Predicting drug-target interactions by dual-network integrated logistic matrix factorization.

Sci Rep. 2017 Jan 12;7:40376. doi: 10.1038/srep40376.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

核化贝叶斯矩阵分解。

Kernelized Bayesian Matrix Factorization.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献