• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于线性和基于非线性核的判别函数的边际最大化特征消除方法。

Margin-maximizing feature elimination methods for linear and nonlinear kernel-based discriminant functions.

作者信息

Aksu Yaman, Miller David J, Kesidis George, Yang Qing X

机构信息

Electrical Engineering Department, Pennsylvania State University, University Park, PA 16802, USA.

出版信息

IEEE Trans Neural Netw. 2010 May;21(5):701-17. doi: 10.1109/TNN.2010.2041069. Epub 2010 Feb 25.

DOI:10.1109/TNN.2010.2041069
PMID:20194055
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3580203/
Abstract

Feature selection for classification in high-dimensional spaces can improve generalization, reduce classifier complexity, and identify important, discriminating feature "markers." For support vector machine (SVM) classification, a widely used technique is recursive feature elimination (RFE). We demonstrate that RFE is not consistent with margin maximization, central to the SVM learning approach. We thus propose explicit margin-based feature elimination (MFE) for SVMs and demonstrate both improved margin and improved generalization, compared with RFE. Moreover, for the case of a nonlinear kernel, we show that RFE assumes that the squared weight vector 2-norm is strictly decreasing as features are eliminated. We demonstrate this is not true for the Gaussian kernel and, consequently, RFE may give poor results in this case. MFE for nonlinear kernels gives better margin and generalization. We also present an extension which achieves further margin gains, by optimizing only two degrees of freedom--the hyperplane's intercept and its squared 2-norm--with the weight vector orientation fixed. We finally introduce an extension that allows margin slackness. We compare against several alternatives, including RFE and a linear programming method that embeds feature selection within the classifier design. On high-dimensional gene microarray data sets, University of California at Irvine (UCI) repository data sets, and Alzheimer's disease brain image data, MFE methods give promising results.

摘要

高维空间中用于分类的特征选择可以提高泛化能力、降低分类器复杂度,并识别重要的、具有区分性的特征“标记”。对于支持向量机(SVM)分类,一种广泛使用的技术是递归特征消除(RFE)。我们证明RFE与SVM学习方法的核心——最大化间隔不一致。因此,我们为支持向量机提出了基于间隔的显式特征消除(MFE),并证明与RFE相比,它在间隔和泛化能力方面都有所提高。此外,对于非线性核的情况,我们表明RFE假设随着特征的消除,平方权重向量2-范数严格递减。我们证明对于高斯核并非如此,因此,在这种情况下RFE可能会给出较差的结果。非线性核的MFE能提供更好的间隔和泛化能力。我们还提出了一种扩展方法,通过仅优化两个自由度——超平面的截距及其平方2-范数——同时固定权重向量方向,从而实现进一步的间隔增益。我们最终引入了一种允许间隔松弛的扩展方法。我们与几种替代方法进行了比较,包括RFE和一种在分类器设计中嵌入特征选择的线性规划方法。在高维基因微阵列数据集、加州大学欧文分校(UCI)存储库数据集以及阿尔茨海默病脑图像数据上,MFE方法取得了有前景的结果。

相似文献

1
Margin-maximizing feature elimination methods for linear and nonlinear kernel-based discriminant functions.用于线性和基于非线性核的判别函数的边际最大化特征消除方法。
IEEE Trans Neural Netw. 2010 May;21(5):701-17. doi: 10.1109/TNN.2010.2041069. Epub 2010 Feb 25.
2
Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE.基于最大间隔准则的递归基因选择:与支持向量机递归特征消除法的比较
BMC Bioinformatics. 2006 Dec 25;7:543. doi: 10.1186/1471-2105-7-543.
3
Development of two-stage SVM-RFE gene selection strategy for microarray expression data analysis.用于微阵列表达数据分析的两阶段支持向量机-递归特征消除基因选择策略的开发。
IEEE/ACM Trans Comput Biol Bioinform. 2007 Jul-Sep;4(3):365-81. doi: 10.1109/TCBB.2007.70224.
4
Improving the computational efficiency of recursive cluster elimination for gene selection.提高递归聚类消除基因选择的计算效率。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Jan-Mar;8(1):122-9. doi: 10.1109/TCBB.2010.44.
5
MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data.MSVM-RFE:用于DNA微阵列数据多类基因选择的SVM-RFE扩展方法
Bioinformatics. 2007 May 1;23(9):1106-14. doi: 10.1093/bioinformatics/btm036.
6
SVM-RFE: selection and visualization of the most relevant features through non-linear kernels.SVM-RFE:通过非线性核选择和可视化最相关特征。
BMC Bioinformatics. 2018 Nov 19;19(1):432. doi: 10.1186/s12859-018-2451-4.
7
Multiple SVM-RFE for gene selection in cancer classification with expression data.用于基于表达数据的癌症分类中基因选择的多重支持向量机递归特征消除法
IEEE Trans Nanobioscience. 2005 Sep;4(3):228-34. doi: 10.1109/tnb.2005.853657.
8
Recursive cluster elimination (RCE) for classification and feature selection from gene expression data.用于从基因表达数据中进行分类和特征选择的递归聚类消除法(RCE)
BMC Bioinformatics. 2007 May 2;8:144. doi: 10.1186/1471-2105-8-144.
9
An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.基于基因表达数据的多支持向量机技术的高效特征选择策略。
Biomed Res Int. 2018 Aug 30;2018:7538204. doi: 10.1155/2018/7538204. eCollection 2018.
10
An efficient model selection for linear discriminant function-based recursive feature elimination.基于线性判别函数的递归特征消除的有效模型选择。
J Biomed Inform. 2022 May;129:104070. doi: 10.1016/j.jbi.2022.104070. Epub 2022 Apr 15.

引用本文的文献

1
BIGFormer: A Graph Transformer With Local Structure Awareness for Diagnosis and Pathogenesis Identification of Alzheimer's Disease Using Imaging Genetic Data.BIGFormer:一种具有局部结构感知能力的图变换器,用于利用影像遗传学数据诊断和识别阿尔茨海默病的发病机制
IEEE J Biomed Health Inform. 2025 Jan;29(1):495-506. doi: 10.1109/JBHI.2024.3442468. Epub 2025 Jan 7.
2
ASmiR: a machine learning framework for prediction of abiotic stress-specific miRNAs in plants.ASmiR:一种用于预测植物非生物胁迫特异性微小RNA的机器学习框架。
Funct Integr Genomics. 2023 Mar 20;23(2):92. doi: 10.1007/s10142-023-01014-2.
3
Machine learning-based prediction of cognitive outcomes in de novo Parkinson's disease.

本文引用的文献

1
A problem of dimensionality: a simple example.维度问题:一个简单的例子。
IEEE Trans Pattern Anal Mach Intell. 1979 Mar;1(3):306-7. doi: 10.1109/tpami.1979.4766926.
2
Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.基于互信息的特征选择:最大依赖、最大相关和最小冗余准则。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1226-38. doi: 10.1109/TPAMI.2005.159.
3
HAMMER: hierarchical attribute matching mechanism for elastic registration.HAMMER:用于弹性配准的分层属性匹配机制
基于机器学习对新发帕金森病认知结局的预测
NPJ Parkinsons Dis. 2022 Nov 7;8(1):150. doi: 10.1038/s41531-022-00409-5.
4
ASRmiRNA: Abiotic Stress-Responsive miRNA Prediction in Plants by Using Machine Learning Algorithms with Pseudo -Tuple Nucleotide Compositional Features.ASRmiRNA:基于伪元组核苷酸组成特征的机器学习算法预测植物非生物胁迫响应 miRNA
Int J Mol Sci. 2022 Jan 30;23(3):1612. doi: 10.3390/ijms23031612.
5
Identifying discriminative features for diagnosis of Kashin-Beck disease among adolescents.鉴别青少年大骨节病的诊断特征。
BMC Musculoskelet Disord. 2021 Sep 18;22(1):801. doi: 10.1186/s12891-021-04514-z.
6
Callosal thickness profiles for prognosticating conversion from mild cognitive impairment to Alzheimer's disease: A classification approach.胼胝体厚度谱预测轻度认知障碍向阿尔茨海默病的转化:一种分类方法。
Brain Behav. 2018 Dec;8(12):e01142. doi: 10.1002/brb3.1142. Epub 2018 Nov 22.
7
FEATURE ELIMINATION IN KERNEL MACHINES IN MODERATELY HIGH DIMENSIONS.适度高维核机器中的特征消除
Ann Stat. 2019 Feb;47(1):497-526. doi: 10.1214/18-AOS1696.
8
SVM-RFE based feature selection and Taguchi parameters optimization for multiclass SVM classifier.基于支持向量机-递归特征消除的多类支持向量机分类器特征选择及田口参数优化
ScientificWorldJournal. 2014;2014:795624. doi: 10.1155/2014/795624. Epub 2014 Sep 10.
9
HIV-associated distal neuropathic pain is associated with smaller total cerebral cortical gray matter.与HIV相关的远端神经性疼痛与全脑皮质灰质体积较小有关。
J Neurovirol. 2014 Jun;20(3):209-18. doi: 10.1007/s13365-014-0236-8. Epub 2014 Feb 19.
10
T2 texture index of cartilage can predict early symptomatic OA progression: data from the osteoarthritis initiative.软骨 T2 纹理指数可预测早期有症状 OA 的进展:来自骨关节炎倡议的数据。
Osteoarthritis Cartilage. 2013 Oct;21(10):1550-7. doi: 10.1016/j.joca.2013.06.007. Epub 2013 Jun 15.
IEEE Trans Med Imaging. 2002 Nov;21(11):1421-39. doi: 10.1109/TMI.2002.803111.
4
Multiclass cancer diagnosis using tumor gene expression signatures.利用肿瘤基因表达特征进行多类癌症诊断。
Proc Natl Acad Sci U S A. 2001 Dec 18;98(26):15149-54. doi: 10.1073/pnas.211566398. Epub 2001 Dec 11.
5
Support vector machine classification and validation of cancer tissue samples using microarray expression data.使用微阵列表达数据对癌组织样本进行支持向量机分类与验证。
Bioinformatics. 2000 Oct;16(10):906-14. doi: 10.1093/bioinformatics/16.10.906.