• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

不确定邻近学习:综述

Indefinite Proximity Learning: A Review.

作者信息

Schleif Frank-Michael, Tino Peter

机构信息

University of Birmingham, School of Computer Science, B15 2TT, Birmingham, U.K.

出版信息

Neural Comput. 2015 Oct;27(10):2039-96. doi: 10.1162/NECO_a_00770. Epub 2015 Aug 27.

DOI:10.1162/NECO_a_00770
PMID:26313601
Abstract

Efficient learning of a data analysis task strongly depends on the data representation. Most methods rely on (symmetric) similarity or dissimilarity representations by means of metric inner products or distances, providing easy access to powerful mathematical formalisms like kernel or branch-and-bound approaches. Similarities and dissimilarities are, however, often naturally obtained by nonmetric proximity measures that cannot easily be handled by classical learning algorithms. Major efforts have been undertaken to provide approaches that can either directly be used for such data or to make standard methods available for these types of data. We provide a comprehensive survey for the field of learning with nonmetric proximities. First, we introduce the formalism used in nonmetric spaces and motivate specific treatments for nonmetric proximity data. Second, we provide a systematization of the various approaches. For each category of approaches, we provide a comparative discussion of the individual algorithms and address complexity issues and generalization properties. In a summarizing section, we provide a larger experimental study for the majority of the algorithms on standard data sets. We also address the problem of large-scale proximity learning, which is often overlooked in this context and of major importance to make the method relevant in practice. The algorithms we discuss are in general applicable for proximity-based clustering, one-class classification, classification, regression, and embedding approaches. In the experimental part, we focus on classification tasks.

摘要

高效学习数据分析任务很大程度上依赖于数据表示。大多数方法借助度量内积或距离依赖于(对称的)相似性或不相似性表示,这便于使用强大的数学形式体系,如核方法或分支定界法。然而,相似性和不相似性通常是通过非度量接近度度量自然获得的,而经典学习算法难以处理这些度量。人们已经做出了重大努力来提供可以直接用于此类数据的方法,或者使标准方法适用于这些类型的数据。我们对非度量接近度学习领域进行了全面综述。首先,我们介绍非度量空间中使用的形式体系,并阐述对非度量接近度数据的特定处理方法。其次,我们对各种方法进行了系统化整理。对于每一类方法,我们对各个算法进行了比较讨论,并探讨了复杂度问题和泛化特性。在总结部分,我们针对标准数据集上的大多数算法进行了规模更大的实验研究。我们还讨论了大规模接近度学习问题,这个问题在此背景下常常被忽视,但对于使该方法在实际中具有相关性至关重要。我们讨论的算法通常适用于基于接近度的聚类、单类分类、分类、回归和嵌入方法。在实验部分,我们重点关注分类任务。

相似文献

1
Indefinite Proximity Learning: A Review.不确定邻近学习:综述
Neural Comput. 2015 Oct;27(10):2039-96. doi: 10.1162/NECO_a_00770. Epub 2015 Aug 27.
2
Protein Sequence Analysis by Proximities.基于邻近性的蛋白质序列分析
Methods Mol Biol. 2016;1362:185-95. doi: 10.1007/978-1-4939-3106-4_12.
3
Topographic mapping of large dissimilarity data sets.大型差异数据集的地形制图。
Neural Comput. 2010 Sep 1;22(9):2229-84. doi: 10.1162/NECO_a_00012.
4
Kernel discriminant analysis for positive definite and indefinite kernels.用于正定和不定核的核判别分析。
IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1017-32. doi: 10.1109/TPAMI.2008.290.
5
A scalable kernel-based semisupervised metric learning algorithm with out-of-sample generalization ability.一种具有样本外泛化能力的可扩展的基于核的半监督度量学习算法。
Neural Comput. 2008 Nov;20(11):2839-61. doi: 10.1162/neco.2008.05-07-528.
6
Automated induction of heterogeneous proximity measures for supervised spectral embedding.自动化诱导监督谱嵌入的异质邻近度测度。
IEEE Trans Neural Netw Learn Syst. 2013 Oct;24(10):1575-87. doi: 10.1109/TNNLS.2013.2261613.
7
On component-wise dissimilarity measures and metric properties in pattern recognition.关于模式识别中的逐分量差异度量和度量性质
PeerJ Comput Sci. 2022 Oct 10;8:e1106. doi: 10.7717/peerj-cs.1106. eCollection 2022.
8
Online multiple kernel similarity learning for visual search.在线多核相似性学习的视觉搜索。
IEEE Trans Pattern Anal Mach Intell. 2014 Mar;36(3):536-49. doi: 10.1109/TPAMI.2013.149.
9
Ties in proximity and clustering compounds.与邻近性和聚类化合物相关。
J Chem Inf Comput Sci. 2001 Jan-Feb;41(1):134-46. doi: 10.1021/ci000069q.
10
Calibration by correlation using metric embedding from nonmetric similarities.使用非度量相似性的度量嵌入进行关联校准。
IEEE Trans Pattern Anal Mach Intell. 2013 Oct;35(10):2357-70. doi: 10.1109/TPAMI.2013.34.

引用本文的文献

1
A unified framework for the integration of multiple hierarchical clusterings or networks from multi-source data.一种用于整合多源数据中多个层次聚类或网络的统一框架。
BMC Bioinformatics. 2021 Aug 4;22(1):392. doi: 10.1186/s12859-021-04303-4.
2
Generalized Term Similarity for Feature Selection in Text Classification Using Quadratic Programming.基于二次规划的文本分类特征选择中的广义术语相似度
Entropy (Basel). 2020 Mar 30;22(4):395. doi: 10.3390/e22040395.