• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于马氏距离-田口法的高维小样本数据分类优化方法。

Optimized Mahalanobis-Taguchi System for High-Dimensional Small Sample Data Classification.

机构信息

School of Science, Wuhan University of Technology, Wuhan 430070, China.

出版信息

Comput Intell Neurosci. 2020 Apr 26;2020:4609423. doi: 10.1155/2020/4609423. eCollection 2020.

DOI:10.1155/2020/4609423
PMID:32405295
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7199641/
Abstract

The Mahalanobis-Taguchi system (MTS) is a multivariate data diagnosis and prediction technology, which is widely used to optimize large sample data or unbalanced data, but it is rarely used for high-dimensional small sample data. In this paper, the optimized MTS for the classification of high-dimensional small sample data is discussed from two aspects, namely, the inverse matrix instability of the covariance matrix and the instability of feature selection. Firstly, based on regularization and smoothing techniques, this paper proposes a modified Mahalanobis metric to calculate the Mahalanobis distance, which is aimed at reducing the influence of the inverse matrix instability under small sample conditions. Secondly, the minimum redundancy-maximum relevance (mRMR) algorithm is introduced into the MTS for the instability problem of feature selection. By using the mRMR algorithm and signal-to-noise ratio (SNR), a two-stage feature selection method is proposed: the mRMR algorithm is first used to remove noise and redundant variables; the orthogonal table and SNR are then used to screen the combination of variables that make great contribution to classification. Then, the feasibility and simplicity of the optimized MTS are shown in five datasets from the UCI database. The Mahalanobis distance based on regularization and smoothing techniques (RS-MD) is more robust than the traditional Mahalanobis distance. The two-stage feature selection method improves the effectiveness of feature selection for MTS. Finally, the optimized MTS is applied to email classification of the Spambase dataset. The results show that the optimized MTS outperforms the classical MTS and the other 3 machine learning algorithms.

摘要

马氏距离-田口系统(MTS)是一种多元数据分析和预测技术,广泛应用于优化大样本数据或不平衡数据,但很少用于高维小样本数据。本文从协方差矩阵逆矩阵不稳定性和特征选择不稳定性两个方面讨论了用于高维小样本数据分类的优化 MTS。首先,基于正则化和平滑技术,本文提出了一种改进的马氏度量来计算马氏距离,旨在减少小样本条件下逆矩阵不稳定性的影响。其次,将最小冗余最大相关性(mRMR)算法引入 MTS 中,以解决特征选择的不稳定性问题。通过使用 mRMR 算法和信噪比(SNR),提出了一种两阶段特征选择方法:首先使用 mRMR 算法去除噪声和冗余变量;然后使用正交表和 SNR 筛选对分类有较大贡献的变量组合。然后,从 UCI 数据库中的五个数据集展示了优化 MTS 的可行性和简单性。基于正则化和平滑技术的马氏距离(RS-MD)比传统的马氏距离更稳健。两阶段特征选择方法提高了 MTS 的特征选择有效性。最后,将优化 MTS 应用于 Spambase 数据集的电子邮件分类。结果表明,优化 MTS 优于经典 MTS 和其他 3 种机器学习算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/e9038b370927/CIN2020-4609423.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/5fa2a49fca4e/CIN2020-4609423.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/0e45eac18702/CIN2020-4609423.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/e688b1a37d88/CIN2020-4609423.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/bc8833d784dc/CIN2020-4609423.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/f23d65440855/CIN2020-4609423.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/209c12d68845/CIN2020-4609423.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/e9038b370927/CIN2020-4609423.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/5fa2a49fca4e/CIN2020-4609423.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/0e45eac18702/CIN2020-4609423.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/e688b1a37d88/CIN2020-4609423.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/bc8833d784dc/CIN2020-4609423.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/f23d65440855/CIN2020-4609423.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/209c12d68845/CIN2020-4609423.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6086/7199641/e9038b370927/CIN2020-4609423.alg.001.jpg

相似文献

1
Optimized Mahalanobis-Taguchi System for High-Dimensional Small Sample Data Classification.基于马氏距离-田口法的高维小样本数据分类优化方法。
Comput Intell Neurosci. 2020 Apr 26;2020:4609423. doi: 10.1155/2020/4609423. eCollection 2020.
2
Modified Mahalanobis-Taguchi System based on proper orthogonal decomposition for high-dimensional-small-sample-size data classification.基于正则正交分解的改进马氏距离-田口系统在高维小样本数据分类中的应用。
Math Biosci Eng. 2020 Dec 7;18(1):426-444. doi: 10.3934/mbe.2021023.
3
Modified Mahalanobis Taguchi System for Imbalance Data Classification.用于不平衡数据分类的改进马氏田口系统
Comput Intell Neurosci. 2017;2017:5874896. doi: 10.1155/2017/5874896. Epub 2017 Jul 24.
4
Learning a Mahalanobis Distance-Based Dynamic Time Warping Measure for Multivariate Time Series Classification.学习基于马氏距离的动态时间规整度量方法进行多元时间序列分类。
IEEE Trans Cybern. 2016 Jun;46(6):1363-74. doi: 10.1109/TCYB.2015.2426723. Epub 2015 May 8.
5
Hybrid Feature Selection Algorithm mRMR-ICA for Cancer Classification from Microarray Gene Expression Data.用于从微阵列基因表达数据进行癌症分类的混合特征选择算法mRMR-ICA
Comb Chem High Throughput Screen. 2018;21(6):420-430. doi: 10.2174/1386207321666180601074349.
6
The Hybrid Feature Selection Algorithm Based on Maximum Minimum Backward Selection Search Strategy for Liver Tissue Pathological Image Classification.
Comput Math Methods Med. 2016;2016:7369137. doi: 10.1155/2016/7369137. Epub 2016 Jul 31.
7
Two-stage feature selection for classification of gene expression data based on an improved Salp Swarm Algorithm.基于改进的鹽蝽群算法的基因表达数据分类的两阶段特征选择
Math Biosci Eng. 2022 Sep 19;19(12):13747-13781. doi: 10.3934/mbe.2022641.
8
Adaptive Multiclass Mahalanobis Taguchi System for Bearing Fault Diagnosis under Variable Conditions.变工况下基于自适应多类马氏田口系统的滚动轴承故障诊断
Sensors (Basel). 2018 Dec 21;19(1):26. doi: 10.3390/s19010026.
9
Appearance and characterization of fruit image textures for quality sorting using wavelet transform and genetic algorithms.利用小波变换和遗传算法对水果图像纹理进行外观和特征描述,以实现品质分选。
J Texture Stud. 2018 Feb;49(1):65-83. doi: 10.1111/jtxs.12284. Epub 2017 Aug 6.
10
Classification complexity in myoelectric pattern recognition.肌电模式识别中的分类复杂性
J Neuroeng Rehabil. 2017 Jul 10;14(1):68. doi: 10.1186/s12984-017-0283-5.

引用本文的文献

1
Few-shot pulse wave contour classification based on multi-scale feature extraction.基于多尺度特征提取的少脉冲波轮廓分类。
Sci Rep. 2021 Feb 12;11(1):3762. doi: 10.1038/s41598-021-83134-y.

本文引用的文献

1
Priority Setting for the Management of Chemicals Using the Globally Harmonized System and Multivariate Analysis: Use of the Mahalanobis-Taguchi System.利用全球协调系统和多元分析进行化学品管理的优先级设定:马哈拉诺比斯-田口系统的使用。
Int J Environ Res Public Health. 2019 Aug 27;16(17):3119. doi: 10.3390/ijerph16173119.
2
Regional energy security performance evaluation in China using MTGS and SPA-TOPSIS.基于 MTGS 和 SPA-TOPSIS 的中国区域能源安全绩效评价
Sci Total Environ. 2019 Dec 15;696:133817. doi: 10.1016/j.scitotenv.2019.133817. Epub 2019 Aug 8.
3
Adaptive Multiclass Mahalanobis Taguchi System for Bearing Fault Diagnosis under Variable Conditions.
变工况下基于自适应多类马氏田口系统的滚动轴承故障诊断
Sensors (Basel). 2018 Dec 21;19(1):26. doi: 10.3390/s19010026.
4
Modified Mahalanobis Taguchi System for Imbalance Data Classification.用于不平衡数据分类的改进马氏田口系统
Comput Intell Neurosci. 2017;2017:5874896. doi: 10.1155/2017/5874896. Epub 2017 Jul 24.
5
A feature selection method based on multiple kernel learning with expression profiles of different types.一种基于多内核学习和不同类型表达谱的特征选择方法。
BioData Min. 2017 Feb 2;10:4. doi: 10.1186/s13040-017-0124-x. eCollection 2017.
6
Evaluating feature-selection stability in next-generation proteomics.评估新一代蛋白质组学中特征选择的稳定性。
J Bioinform Comput Biol. 2016 Oct;14(5):1650029. doi: 10.1142/S0219720016500293. Epub 2016 Aug 3.
7
Person Re-Identification by Dual-Regularized KISS Metric Learning.
IEEE Trans Image Process. 2016 Jun;25(6):2726-2738. doi: 10.1109/TIP.2016.2553446. Epub 2016 Apr 12.
8
A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data.应用于微阵列数据的特征选择与特征提取方法综述
Adv Bioinformatics. 2015;2015:198363. doi: 10.1155/2015/198363. Epub 2015 Jun 11.
9
Person reidentification by minimum classification error-based KISS metric learning.基于最小分类错误的 KISS 度量学习的行人再识别。
IEEE Trans Cybern. 2015 Feb;45(2):242-52. doi: 10.1109/TCYB.2014.2323992. Epub 2014 Jun 3.
10
Modified quadratic discriminant functions and the application to chinese character recognition.修正二次判别函数及其在汉字识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 1987 Jan;9(1):149-53. doi: 10.1109/tpami.1987.4767881.