• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高维判别分析中的未观测类别与额外变量。

Unobserved classes and extra variables in high-dimensional discriminant analysis.

作者信息

Fop Michael, Mattei Pierre-Alexandre, Bouveyron Charles, Murphy Thomas Brendan

机构信息

School of Mathematics & Statistics, University College Dublin, Dublin, Ireland.

Université Côte d'Azur, Inria, CNRS, Laboratoire J.A. Dieudonné, Maasai team, Nice, France.

出版信息

Adv Data Anal Classif. 2022;16(1):55-92. doi: 10.1007/s11634-021-00474-3. Epub 2022 Mar 1.

DOI:10.1007/s11634-021-00474-3
PMID:35308632
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8924148/
Abstract

In supervised classification problems, the test set may contain data points belonging to classes not observed in the learning phase. Moreover, the same units in the test data may be measured on a set of additional variables recorded at a subsequent stage with respect to when the learning sample was collected. In this situation, the classifier built in the learning phase needs to adapt to handle potential unknown classes and the extra dimensions. We introduce a model-based discriminant approach, Dimension-Adaptive Mixture Discriminant Analysis (D-AMDA), which can detect unobserved classes and adapt to the increasing dimensionality. Model estimation is carried out via a full inductive approach based on an EM algorithm. The method is then embedded in a more general framework for adaptive variable selection and classification suitable for data of large dimensions. A simulation study and an artificial experiment related to classification of adulterated honey samples are used to validate the ability of the proposed framework to deal with complex situations.

摘要

在监督分类问题中,测试集可能包含属于学习阶段未观察到的类别的数据点。此外,测试数据中的相同单元可能是在相对于学习样本收集时间的后续阶段记录的一组附加变量上进行测量的。在这种情况下,在学习阶段构建的分类器需要进行调整,以处理潜在的未知类别和额外的维度。我们引入了一种基于模型的判别方法,即维度自适应混合判别分析(D-AMDA),它可以检测未观察到的类别并适应不断增加的维度。模型估计是通过基于期望最大化(EM)算法的全归纳方法进行的。然后,该方法被嵌入到一个更通用的框架中,用于适合大维度数据的自适应变量选择和分类。一项模拟研究以及与掺假蜂蜜样本分类相关的人工实验被用于验证所提出框架处理复杂情况的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/d34ad01b4557/11634_2021_474_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/157a5accfe26/11634_2021_474_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/b331ac3f4b70/11634_2021_474_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/b0f5cd4a69bb/11634_2021_474_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/1e8eb48109de/11634_2021_474_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/6e66940e23a2/11634_2021_474_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/b2cd92f18e33/11634_2021_474_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/d6a01647c15d/11634_2021_474_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/edfb1efe264c/11634_2021_474_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/376d73bc39b1/11634_2021_474_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/bfb8de306ca8/11634_2021_474_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/cd18c896b58b/11634_2021_474_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/666b8d211aca/11634_2021_474_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/d34ad01b4557/11634_2021_474_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/157a5accfe26/11634_2021_474_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/b331ac3f4b70/11634_2021_474_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/b0f5cd4a69bb/11634_2021_474_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/1e8eb48109de/11634_2021_474_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/6e66940e23a2/11634_2021_474_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/b2cd92f18e33/11634_2021_474_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/d6a01647c15d/11634_2021_474_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/edfb1efe264c/11634_2021_474_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/376d73bc39b1/11634_2021_474_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/bfb8de306ca8/11634_2021_474_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/cd18c896b58b/11634_2021_474_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/666b8d211aca/11634_2021_474_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb50/8924148/d34ad01b4557/11634_2021_474_Fig13_HTML.jpg

相似文献

1
Unobserved classes and extra variables in high-dimensional discriminant analysis.高维判别分析中的未观测类别与额外变量。
Adv Data Anal Classif. 2022;16(1):55-92. doi: 10.1007/s11634-021-00474-3. Epub 2022 Mar 1.
2
Machine learning models based on the dimensionality reduction of standard automated perimetry data for glaucoma diagnosis.基于标准自动视野计数据降维的青光眼诊断机器学习模型。
Artif Intell Med. 2019 Mar;94:110-116. doi: 10.1016/j.artmed.2019.02.006. Epub 2019 Feb 25.
3
Multilinear discriminant analysis for face recognition.用于人脸识别的多线性判别分析。
IEEE Trans Image Process. 2007 Jan;16(1):212-20. doi: 10.1109/tip.2006.884929.
4
Coupled dimensionality reduction and classification for supervised and semi-supervised multilabel learning.用于监督和半监督多标签学习的耦合降维和分类
Pattern Recognit Lett. 2014 Mar 1;38:132-141. doi: 10.1016/j.patrec.2013.11.021.
5
Active learning for solving the incomplete data problem in facial age classification by the furthest nearest-neighbor criterion.基于最远距离最近邻准则的主动学习在解决面部年龄分类中不完全数据问题的应用。
IEEE Trans Image Process. 2011 Jul;20(7):2049-62. doi: 10.1109/TIP.2011.2106794. Epub 2011 Jan 17.
6
Robust Face Recognition Based on a New Supervised Kernel Subspace Learning Method.基于新监督核子空间学习方法的稳健人脸识别。
Sensors (Basel). 2019 Apr 6;19(7):1643. doi: 10.3390/s19071643.
7
Tumor classification based on orthogonal linear discriminant analysis.基于正交线性判别分析的肿瘤分类
Biomed Mater Eng. 2014;24(1):1399-405. doi: 10.3233/BME-130944.
8
A general soft label based linear discriminant analysis for semi-supervised dimensionality reduction.基于通用软标签的半监督降维线性判别分析。
Neural Netw. 2014 Jul;55:83-97. doi: 10.1016/j.neunet.2014.03.005. Epub 2014 Apr 13.
9
Infinite Bayesian Max-Margin Discriminant Projection.无限贝叶斯最大间隔判别投影
IEEE Trans Cybern. 2022 May;52(5):3936-3946. doi: 10.1109/TCYB.2020.3017638. Epub 2022 May 19.
10
An Adaptive Deep Learning Framework for Dynamic Image Classification in the Internet of Things Environment.一种适用于物联网环境中动态图像分类的自适应深度学习框架。
Sensors (Basel). 2020 Oct 14;20(20):5811. doi: 10.3390/s20205811.

本文引用的文献

1
The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances.伟大的时间序列分类竞赛:对近期算法进展的综述与实验评估
Data Min Knowl Discov. 2017;31(3):606-660. doi: 10.1007/s10618-016-0483-9. Epub 2016 Nov 23.
2
clustvarsel: A Package Implementing Variable Selection for Gaussian Model-Based Clustering in R.clustvarsel:一个在R语言中为基于高斯模型的聚类实现变量选择的程序包。
J Stat Softw. 2018 Apr;84. doi: 10.18637/jss.v084.i01. Epub 2018 Apr 17.
3
mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models.
mclust 5:使用高斯有限混合模型进行聚类、分类和密度估计
R J. 2016 Aug;8(1):289-317.
4
Improved initialisation of model-based clustering using Gaussian hierarchical partitions.使用高斯层次划分改进基于模型的聚类初始化。
Adv Data Anal Classif. 2015 Dec;9(4):447-460. doi: 10.1007/s11634-015-0220-z. Epub 2015 Oct 26.
5
The application of sparse estimation of covariance matrix to quadratic discriminant analysis.协方差矩阵的稀疏估计在二次判别分析中的应用。
BMC Bioinformatics. 2015 Feb 18;16:48. doi: 10.1186/s12859-014-0443-6.
6
General subspace learning with corrupted training data via graph embedding.基于图嵌入的带损坏训练数据的广义子空间学习。
IEEE Trans Image Process. 2013 Nov;22(11):4380-93. doi: 10.1109/TIP.2013.2273665. Epub 2013 Jul 22.
7
Multifrequency species classification of acoustic-trawl survey data using semi-supervised learning with class discovery.基于类发现的半监督学习对声纳拖网调查数据进行多频物种分类。
J Acoust Soc Am. 2012 Feb;131(2):EL184-90. doi: 10.1121/1.3678685.
8
Variable Selection and Updating In Model-Based Discriminant Analysis for High Dimensional Data with Food Authenticity Applications.基于模型的判别分析中高维数据的变量选择与更新及其在食品真实性应用中的研究
Ann Appl Stat. 2010 Mar 1;4(1):396-421. doi: 10.1214/09-AOAS279.
9
Variable selection for clustering with Gaussian mixture models.用于高斯混合模型聚类的变量选择
Biometrics. 2009 Sep;65(3):701-9. doi: 10.1111/j.1541-0420.2008.01160.x. Epub 2009 Feb 4.
10
Application of Fourier transform midinfrared spectroscopy to the discrimination between Irish artisanal honey and such honey adulterated with various sugar syrups.傅里叶变换中红外光谱法在鉴别爱尔兰手工蜂蜜与掺有各种糖浆的掺假蜂蜜中的应用。
J Agric Food Chem. 2006 Aug 23;54(17):6166-71. doi: 10.1021/jf0613785.