基于转移的增广最小二乘支持向量机缺失数据分类器。

A Transfer-Based Additive LS-SVM Classifier for Handling Missing Data.

出版信息

IEEE Trans Cybern. 2020 Feb;50(2):739-752. doi: 10.1109/TCYB.2018.2872800. Epub 2018 Oct 15.

DOI:10.1109/TCYB.2018.2872800

Abstract

The performance of a classifier might greatly deteriorate due to missing data. Many different techniques to handle this problem have been developed. In this paper, we solve the problem of missing data using a novel transfer learning perspective and show that when an additive least squares support vector machine (LS-SVM) is adopted, model transfer learning can be used to enhance the classification performance on incomplete training datasets. A novel transfer-based additive LS-SVM classifier is accordingly proposed. This method also simultaneously determines the influence of classification errors caused by each incomplete sample using a fast leave-one-out cross validation strategy, as an alternative way to clean the training data to further improve the data quality. The proposed method has been applied to seven public datasets. The experimental results indicate that the proposed method achieves at least comparable, if not better, performance than case deletion, mean imputation, and k -nearest neighbor imputation methods, followed by the standard LS-SVM and support vector machine classifiers. Moreover, a case study on a community healthcare dataset using the proposed method is presented in detail, which particularly highlights the contributions and benefits of the proposed method to this real-world application.

摘要

由于数据缺失，分类器的性能可能会大大下降。已经开发了许多不同的技术来处理这个问题。在本文中，我们使用一种新颖的迁移学习视角来解决缺失数据的问题，并表明当采用加法最小二乘支持向量机 (LS-SVM) 时，可以使用模型迁移学习来增强对不完整训练数据集的分类性能。因此，提出了一种新颖的基于迁移的加法 LS-SVM 分类器。该方法还使用快速留一交叉验证策略同时确定每个不完整样本引起的分类错误的影响，作为清理训练数据的另一种方法，以进一步提高数据质量。该方法已应用于七个公共数据集。实验结果表明，该方法至少与案例删除、均值插补和 K-最近邻插补方法的性能相当，如果不比它们更好，其次是标准 LS-SVM 和支持向量机分类器。此外，还详细介绍了使用该方法对社区医疗保健数据集的案例研究，特别强调了该方法对这一实际应用的贡献和好处。

相似文献

A Transfer-Based Additive LS-SVM Classifier for Handling Missing Data.基于转移的增广最小二乘支持向量机缺失数据分类器。

IEEE Trans Cybern. 2020 Feb;50(2):739-752. doi: 10.1109/TCYB.2018.2872800. Epub 2018 Oct 15.

Tackling Missing Data in Community Health Studies Using Additive LS-SVM Classifier.利用加性最小二乘支持向量机分类器解决社区卫生研究中的缺失数据问题。

IEEE J Biomed Health Inform. 2018 Mar;22(2):579-587. doi: 10.1109/JBHI.2016.2634587. Epub 2016 Dec 1.

Handling missing values in support vector machine classifiers.支持向量机分类器中缺失值的处理

Neural Netw. 2005 Jun-Jul;18(5-6):684-92. doi: 10.1016/j.neunet.2005.06.025.

On mining incomplete medical datasets: Ordering imputation and classification.关于挖掘不完整医学数据集：排序插补与分类。

Technol Health Care. 2015;23(5):619-25. doi: 10.3233/THC-151018.

Training sparse least squares support vector machines by the QR decomposition.通过 QR 分解训练稀疏最小二乘支持向量机。

Neural Netw. 2018 Oct;106:175-184. doi: 10.1016/j.neunet.2018.07.008. Epub 2018 Jul 19.

Global-local least-squares support vector machine (GLocal-LS-SVM).全局-局部最小二乘支持向量机（GLocal-LS-SVM）。

PLoS One. 2023 Apr 27;18(4):e0285131. doi: 10.1371/journal.pone.0285131. eCollection 2023.

Improving the separability of motor imagery EEG signals using a cross correlation-based least square support vector machine for brain-computer interface.基于互相关的最小二乘支持向量机提高脑-机接口中运动想象 EEG 信号的可分离性。

IEEE Trans Neural Syst Rehabil Eng. 2012 Jul;20(4):526-38. doi: 10.1109/TNSRE.2012.2184838. Epub 2012 Jan 23.

[Classification technique for hyperspectral image based on subspace of bands feature extraction and LS-SVM].基于波段特征提取子空间和最小二乘支持向量机的高光谱图像分类技术

Guang Pu Xue Yu Guang Pu Fen Xi. 2011 May;31(5):1314-7.

Vicinal support vector classifier using supervised kernel-based clustering.基于监督核聚类的邻接支持向量分类器。

Artif Intell Med. 2014 Mar;60(3):189-96. doi: 10.1016/j.artmed.2014.01.003. Epub 2014 Feb 7.

Improving classification of mature microRNA by solving class imbalance problem.通过解决类别不平衡问题改进成熟微小RNA的分类。

Sci Rep. 2016 May 16;6:25941. doi: 10.1038/srep25941.

引用本文的文献

Conceptual framework as a guide to choose appropriate imputation method for missing values in a clinical structured dataset.概念框架作为选择临床结构化数据集中缺失值的适当插补方法的指南。

BMC Med Res Methodol. 2025 Feb 20;25(1):43. doi: 10.1186/s12874-025-02496-3.

Identify the most appropriate imputation method for handling missing values in clinical structured datasets: a systematic review.识别处理临床结构化数据集缺失值的最合适插补方法：系统评价。

BMC Med Res Methodol. 2024 Aug 28;24(1):188. doi: 10.1186/s12874-024-02310-6.

Handling Missing Data in Health Economics and Outcomes Research (HEOR): A Systematic Review and Practical Recommendations.处理健康经济学和结果研究（HEOR）中的缺失数据：系统评价和实用建议。

Pharmacoeconomics. 2023 Dec;41(12):1589-1601. doi: 10.1007/s40273-023-01297-0. Epub 2023 Jul 25.

A hybrid anomaly detection method for high dimensional data.一种用于高维数据的混合异常检测方法。

PeerJ Comput Sci. 2023 Jan 12;9:e1199. doi: 10.7717/peerj-cs.1199. eCollection 2023.

Anomaly detection for blueberry data using sparse autoencoder-support vector machine.基于稀疏自编码器-支持向量机的蓝莓数据异常检测

PeerJ Comput Sci. 2023 Mar 10;9:e1214. doi: 10.7717/peerj-cs.1214. eCollection 2023.

Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent.使用多层感知机和动量梯度下降进行离散缺失数据插补。

Sensors (Basel). 2022 Jul 28;22(15):5645. doi: 10.3390/s22155645.

Systematic Mapping Study of AI/Machine Learning in Healthcare and Future Directions.医疗保健领域人工智能/机器学习的系统映射研究及未来方向

SN Comput Sci. 2021;2(6):461. doi: 10.1007/s42979-021-00848-6. Epub 2021 Sep 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于转移的增广最小二乘支持向量机缺失数据分类器。

A Transfer-Based Additive LS-SVM Classifier for Handling Missing Data.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献