• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于差异数据的半监督学习。

Semisupervised learning from dissimilarity data.

作者信息

Trosset Michael W, Priebe Carey E, Park Youngser, Miller Michael I

机构信息

Department of Statistics, Indiana University, Bloomington, IN 47405, USA.

出版信息

Comput Stat Data Anal. 2008 Jun 15;52(10):4643-4657. doi: 10.1016/j.csda.2008.02.030.

DOI:10.1016/j.csda.2008.02.030
PMID:20407600
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2856100/
Abstract

The following two-stage approach to learning from dissimilarity data is described: (1) embed both labeled and unlabeled objects in a Euclidean space; then (2) train a classifier on the labeled objects. The use of linear discriminant analysis for (2), which naturally invites the use of classical multidimensional scaling for (1), is emphasized. The choice of the dimension of the Euclidean space in (1) is a model selection problem; too few or too many dimensions can degrade classifier performance. The question of how the inclusion of unlabeled objects in (1) affects classifier performance is investigated. In the case of spherical covariances, including unlabeled objects in (1) is demonstrably superior. Several examples are presented.

摘要

本文描述了一种从差异数据中学习的两阶段方法

(1) 将有标签和无标签的对象嵌入欧几里得空间;然后 (2) 在有标签的对象上训练分类器。强调在 (2) 中使用线性判别分析,这自然会促使在 (1) 中使用经典多维缩放。(1) 中欧几里得空间维度的选择是一个模型选择问题;维度太少或太多都会降低分类器性能。研究了在 (1) 中包含无标签对象如何影响分类器性能的问题。在球形协方差的情况下,在 (1) 中包含无标签对象明显更优。给出了几个例子。

相似文献

1
Semisupervised learning from dissimilarity data.基于差异数据的半监督学习。
Comput Stat Data Anal. 2008 Jun 15;52(10):4643-4657. doi: 10.1016/j.csda.2008.02.030.
2
Spherical and Hyperbolic Embeddings of Data.球面和双曲嵌入数据。
IEEE Trans Pattern Anal Mach Intell. 2014 Nov;36(11):2255-69. doi: 10.1109/TPAMI.2014.2316836.
3
Enhancement of breast CADx with unlabeled data.利用无标注数据增强乳腺 CADx。
Med Phys. 2010 Aug;37(8):4155-72. doi: 10.1118/1.3455704.
4
Semisupervised Feature Selection via Structured Manifold Learning.基于结构流形学习的半监督特征选择。
IEEE Trans Cybern. 2022 Jul;52(7):5756-5766. doi: 10.1109/TCYB.2021.3052847. Epub 2022 Jul 4.
5
Efficient Active Learning by Querying Discriminative and Representative Samples and Fully Exploiting Unlabeled Data.通过查询有判别力和代表性的样本并充分利用未标记数据实现高效主动学习。
IEEE Trans Neural Netw Learn Syst. 2021 Sep;32(9):4111-4122. doi: 10.1109/TNNLS.2020.3016928. Epub 2021 Aug 31.
6
Semisupervised generalized discriminant analysis.半监督广义判别分析
IEEE Trans Neural Netw. 2011 Aug;22(8):1207-17. doi: 10.1109/TNN.2011.2156808. Epub 2011 Jun 30.
7
Semisupervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle.基于最大熵原理的混合生成/判别式分类器的半监督学习
IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):424-37. doi: 10.1109/TPAMI.2007.70710.
8
Joint Label Inference and Discriminant Embedding.联合标签推理与判别式嵌入
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4413-4423. doi: 10.1109/TNNLS.2021.3057270. Epub 2022 Aug 31.
9
How to Trust Unlabeled Data? Instance Credibility Inference for Few-Shot Learning.如何信任未标记的数据?小样本学习中的实例可信度推断。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6240-6253. doi: 10.1109/TPAMI.2021.3086140. Epub 2022 Sep 14.
10
On component-wise dissimilarity measures and metric properties in pattern recognition.关于模式识别中的逐分量差异度量和度量性质
PeerJ Comput Sci. 2022 Oct 10;8:e1106. doi: 10.7717/peerj-cs.1106. eCollection 2022.

引用本文的文献

1
Matchability of heterogeneous networks pairs.异构网络对的匹配性
Inf inference. 2020 Dec;9(4):749-783. doi: 10.1093/imaiai/iaz031. Epub 2020 Jan 6.
2
A multivariate distance-based analytic framework for connectome-wide association studies.一种基于多元距离的连接组学全基因组关联研究分析框架。
Neuroimage. 2014 Jun;93 Pt 1(0 1):74-94. doi: 10.1016/j.neuroimage.2014.02.024. Epub 2014 Feb 28.
3
Bios2mds: an R package for comparing orthologous protein families by metric multidimensional scaling.Bios2mds:一个通过度量多维尺度比较直系同源蛋白家族的 R 包。
BMC Bioinformatics. 2012 Jun 15;13:133. doi: 10.1186/1471-2105-13-133.
4
Metric distances between hippocampal shapes indicate different rates of change over time in nondemented and demented subjects.海马形状的度量距离表明非痴呆和痴呆受试者随时间变化的速率不同。
Curr Alzheimer Res. 2012 Oct;9(8):972-81. doi: 10.2174/156720512803251138.
5
Multidimensional scaling reveals the main evolutionary pathways of class A G-protein-coupled receptors.多维标度揭示了 A 类 G 蛋白偶联受体的主要进化途径。
PLoS One. 2011 Apr 22;6(4):e19094. doi: 10.1371/journal.pone.0019094.

本文引用的文献

1
Multidimensional scaling.多维尺度分析
Annu Rev Psychol. 1980;31:607-49. doi: 10.1146/annurev.ps.31.020180.003135.
2
Collaborative computational anatomy: an MRI morphometry study of the human brain via diffeomorphic metric mapping.协作计算解剖学:通过微分同胚度量映射对人类大脑进行的磁共振成像形态测量研究。
Hum Brain Mapp. 2009 Jul;30(7):2132-41. doi: 10.1002/hbm.20655.
3
Multidimensional scaling of similarity.相似度的多维缩放
Psychometrika. 1965 Dec;30(4):379-93. doi: 10.1007/BF02289530.