广义多实例学习的核函数。

Kernels for generalized multiple-instance learning.

作者信息

Tao Qingping, Scott Stephen D, Vinodchandran N V, Osugi Thomas Takeo, Mueller Brandon

机构信息

GC Image, LLC, Lincoln, NE 68505, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2008 Dec;30(12):2084-98. doi: 10.1109/TPAMI.2007.70846.

DOI:10.1109/TPAMI.2007.70846

PMID:18988944

Abstract

The multiple-instance learning (MIL) model has been successful in numerous application areas. Recently, a generalization of this model and an algorithm for it were introduced, showing significant advantages over the conventional MIL model on certain application areas. Unfortunately, that algorithm is not scalable to high dimensions. We adapt that algorithm to one using a support vector machine with our new kernel k\wedge. This reduces the time complexity from exponential in the dimension to polynomial. Computing our new kernel is equivalent to counting the number of boxes in a discrete, bounded space that contain at least one point from each of two multisets. We show that this problem is #P-complete, but then give a fully polynomial randomized approximation scheme (FPRAS) for it. We then extend k\wedge by enriching its representation into a new kernel kmin, and also consider a normalized version of k\wedge that we call k\wedge/\vee (which may or may not not be a kernel, but whose approximation yielded positive semidefinite Gram matrices in practice). We then empirically evaluate all three measures on data from content-based image retrieval, biological sequence analysis, and the musk data sets. We found that our kernels performed well on all data sets relative to algorithms in the conventional MIL model.

摘要

多实例学习（MIL）模型在众多应用领域都取得了成功。最近，该模型的一种泛化形式及其算法被提出，在某些应用领域展现出相较于传统MIL模型的显著优势。不幸的是，该算法在高维情况下不可扩展。我们将该算法适配为使用带有新核函数(k^{\wedge})的支持向量机的算法。这将时间复杂度从维度的指数形式降低为多项式形式。计算我们的新核函数等同于计算一个离散有界空间中包含来自两个多重集各自至少一个点的盒子数量。我们证明这个问题是#P完全问题，但随后给出了一个针对它的完全多项式随机近似算法（FPRAS）。然后，我们通过将其表示丰富为新核函数(k_{min})来扩展(k^{\wedge})，并且还考虑了(k^{\wedge})的一个归一化版本，我们称之为(k^{\wedge}/\vee)（它可能是也可能不是一个核函数，但在实践中其近似产生了正定的格拉姆矩阵）。然后，我们基于基于内容的图像检索、生物序列分析和麝香数据集的数据对这三种度量进行实证评估。我们发现相对于传统MIL模型中的算法，我们的核函数在所有数据集上都表现良好。

相似文献

Kernels for generalized multiple-instance learning.广义多实例学习的核函数。

IEEE Trans Pattern Anal Mach Intell. 2008 Dec;30(12):2084-98. doi: 10.1109/TPAMI.2007.70846.

Kernel discriminant analysis for positive definite and indefinite kernels.用于正定和不定核的核判别分析。

IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1017-32. doi: 10.1109/TPAMI.2008.290.

A scalable kernel-based semisupervised metric learning algorithm with out-of-sample generalization ability.一种具有样本外泛化能力的可扩展的基于核的半监督度量学习算法。

Neural Comput. 2008 Nov;20(11):2839-61. doi: 10.1162/neco.2008.05-07-528.

Sparse multiple kernel learning for signal processing applications.稀疏多核学习在信号处理中的应用。

IEEE Trans Pattern Anal Mach Intell. 2010 May;32(5):788-98. doi: 10.1109/TPAMI.2009.98.

Density-weighted Nyström method for computing large kernel eigensystems.用于计算大型核特征系统的密度加权奈斯特罗姆方法。

Neural Comput. 2009 Jan;21(1):121-46. doi: 10.1162/neco.2008.11-07-651.

Learning kernels from biological networks by maximizing entropy.通过最大化熵从生物网络中学习内核。

Bioinformatics. 2004 Aug 4;20 Suppl 1:i326-33. doi: 10.1093/bioinformatics/bth906.

Multisurface proximal support vector machine classification via generalized eigenvalues.基于广义特征值的多表面近端支持向量机分类

IEEE Trans Pattern Anal Mach Intell. 2006 Jan;28(1):69-74. doi: 10.1109/TPAMI.2006.17.

Efficient tracking of the dominant eigenspace of a normalized kernel matrix.归一化核矩阵主导特征空间的高效跟踪。

Neural Comput. 2008 Feb;20(2):523-54. doi: 10.1162/neco.2007.05-06-213.

Efficient recognition of highly similar 3D objects in range images.在距离图像中高效识别高度相似的三维物体。

IEEE Trans Pattern Anal Mach Intell. 2009 Jan;31(1):172-9. doi: 10.1109/TPAMI.2008.176.

SemiBoost: boosting for semi-supervised learning.半增强算法：用于半监督学习的增强算法

IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2000-14. doi: 10.1109/TPAMI.2008.235.

引用本文的文献

Predicting MHC-II binding affinity using multiple instance regression.使用多实例回归预测 MHC-II 结合亲和力。

IEEE/ACM Trans Comput Biol Bioinform. 2011 Jul-Aug;8(4):1067-79. doi: 10.1109/TCBB.2010.94.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

广义多实例学习的核函数。

Kernels for generalized multiple-instance learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献