• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于距离的核,用于通过支持向量机进行分类。

A distance-based kernel for classification via Support Vector Machines.

作者信息

Amaya-Tejera Nazhir, Gamarra Margarita, Vélez Jorge I, Zurek Eduardo

机构信息

Department of Computer Science, Universidad del Norte, Barranquilla, Colombia.

Department of Industrial Engineering, Universidad del Norte, Barranquilla, Colombia.

出版信息

Front Artif Intell. 2024 Feb 26;7:1287875. doi: 10.3389/frai.2024.1287875. eCollection 2024.

DOI:10.3389/frai.2024.1287875
PMID:38469159
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10925654/
Abstract

Support Vector Machines (SVMs) are a type of supervised machine learning algorithm widely used for classification tasks. In contrast to traditional methods that split the data into separate training and testing sets, here we propose an innovative approach where subsets of the original data are randomly selected to train the model multiple times. This iterative training process aims to identify a representative data subset, leading to improved inferences about the population. Additionally, we introduce a novel distance-based kernel specifically designed for binary-type features based on a similarity matrix that efficiently handles both binary and multi-class classification problems. Computational experiments on publicly available datasets of varying sizes demonstrate that our proposed method significantly outperforms existing approaches in terms of classification accuracy. Furthermore, the distance-based kernel achieves superior performance compared to other well-known kernels from the literature and those used in previous studies on the same datasets. These findings validate the effectiveness of our proposed classification method and distance-based kernel for SVMs. By leveraging random subset selection and a unique kernel design, we achieve notable improvements in classification accuracy. These results have significant implications for diverse classification problems in Machine Learning and data analysis.

摘要

支持向量机(SVM)是一种广泛应用于分类任务的监督式机器学习算法。与将数据拆分为单独的训练集和测试集的传统方法不同,我们在此提出一种创新方法,即从原始数据子集中随机选择子集多次训练模型。这种迭代训练过程旨在识别具有代表性的数据子集,从而改进对总体的推断。此外,我们基于相似性矩阵引入了一种专门为二元类型特征设计的新型基于距离的核,该核能够有效处理二元和多类分类问题。对不同大小的公开可用数据集进行的计算实验表明,我们提出的方法在分类准确率方面显著优于现有方法。此外,与文献中其他知名核以及之前在相同数据集上的研究中使用的核相比,基于距离的核具有更优的性能。这些发现验证了我们为支持向量机提出的分类方法和基于距离的核的有效性。通过利用随机子集选择和独特的核设计,我们在分类准确率方面取得了显著提高。这些结果对机器学习和数据分析中的各种分类问题具有重要意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a156/10925654/a7b596129e82/frai-07-1287875-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a156/10925654/9e6dcfdd4fd9/frai-07-1287875-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a156/10925654/d7d62bc516f8/frai-07-1287875-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a156/10925654/a7b596129e82/frai-07-1287875-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a156/10925654/9e6dcfdd4fd9/frai-07-1287875-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a156/10925654/d7d62bc516f8/frai-07-1287875-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a156/10925654/a7b596129e82/frai-07-1287875-g0002.jpg

相似文献

1
A distance-based kernel for classification via Support Vector Machines.一种基于距离的核,用于通过支持向量机进行分类。
Front Artif Intell. 2024 Feb 26;7:1287875. doi: 10.3389/frai.2024.1287875. eCollection 2024.
2
Vicinal support vector classifier using supervised kernel-based clustering.基于监督核聚类的邻接支持向量分类器。
Artif Intell Med. 2014 Mar;60(3):189-96. doi: 10.1016/j.artmed.2014.01.003. Epub 2014 Feb 7.
3
Stochastic subset selection for learning with kernel machines.用于核机器学习的随机子集选择
IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):616-26. doi: 10.1109/TSMCB.2011.2171680. Epub 2011 Oct 27.
4
Efficient $\chi ^{2}$ Kernel Linearization via Random Feature Maps.通过随机特征映射实现高效的 $\chi ^{2}$ 核线性化。
IEEE Trans Neural Netw Learn Syst. 2016 Nov;27(11):2448-2453. doi: 10.1109/TNNLS.2015.2476659. Epub 2015 Sep 23.
5
Correlation kernels for support vector machines classification with applications in cancer data.支持向量机分类的相关核函数及其在癌症数据中的应用。
Comput Math Methods Med. 2012;2012:205025. doi: 10.1155/2012/205025. Epub 2012 Aug 7.
6
Nonlinear Deep Kernel Learning for Image Annotation.用于图像标注的非线性深度核学习
IEEE Trans Image Process. 2017 Apr;26(4):1820-1832. doi: 10.1109/TIP.2017.2666038. Epub 2017 Feb 8.
7
Kernel design for RNA classification using Support Vector Machines.使用支持向量机进行RNA分类的内核设计
Int J Data Min Bioinform. 2006;1(1):57-76. doi: 10.1504/ijdmb.2006.009921.
8
Nonparallel support vector machines for pattern classification.用于模式分类的非平行支持向量机。
IEEE Trans Cybern. 2014 Jul;44(7):1067-79. doi: 10.1109/TCYB.2013.2279167. Epub 2013 Sep 5.
9
Gene Classification Based on Multi-Class SVMs with Systematic Sampling and Hierarchical Clustering (SSHC) Algorithm.基于系统抽样和层次聚类 (SSHC) 算法的多类 SVM 基因分类。
Adv Exp Med Biol. 2021;1338:231-237. doi: 10.1007/978-3-030-78775-2_28.
10
LZW-Kernel: fast kernel utilizing variable length code blocks from LZW compressors for protein sequence classification.LZW-Kernel:快速内核,利用 LZW 压缩器中的变长码块对蛋白质序列进行分类。
Bioinformatics. 2018 Oct 1;34(19):3281-3288. doi: 10.1093/bioinformatics/bty349.

引用本文的文献

1
Detection of Aspergilloma Disease Using Feature-Selection-Based Vision Transformers.基于特征选择的视觉变换器检测曲菌球病
Diagnostics (Basel). 2024 Dec 26;15(1):26. doi: 10.3390/diagnostics15010026.
2
Machine Learning Driven by Magnetic Resonance Imaging for the Classification of Alzheimer Disease Progression: Systematic Review and Meta-Analysis.磁共振成像驱动的机器学习用于阿尔茨海默病进展分类:系统评价与荟萃分析
JMIR Aging. 2024 Dec 23;7:e59370. doi: 10.2196/59370.
3
Detection of Thymoma Disease Using mRMR Feature Selection and Transformer Models.

本文引用的文献

1
Sparse SVM for Sufficient Data Reduction.用于充分数据约简的稀疏支持向量机
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5560-5571. doi: 10.1109/TPAMI.2021.3075339. Epub 2022 Aug 4.
2
A Hybrid Data Mining Model to Predict Coronary Artery Disease Cases Using Non-Invasive Clinical Data.一种使用非侵入性临床数据预测冠状动脉疾病病例的混合数据挖掘模型。
J Med Syst. 2016 Jul;40(7):178. doi: 10.1007/s10916-016-0536-z. Epub 2016 Jun 11.
3
A Computer Program for Classifying Plants.一个用于植物分类的计算机程序。
基于最小冗余最大相关(mRMR)特征选择和Transformer模型的胸腺瘤疾病检测
Diagnostics (Basel). 2024 Sep 29;14(19):2169. doi: 10.3390/diagnostics14192169.
4
Long Non-Coding RNAs and Alzheimer's Disease: Towards Personalized Diagnosis.长非编码 RNA 与阿尔茨海默病:迈向个体化诊断。
Int J Mol Sci. 2024 Jul 11;25(14):7641. doi: 10.3390/ijms25147641.
Science. 1960 Oct 21;132(3434):1115-8. doi: 10.1126/science.132.3434.1115.
4
International application of a new probability algorithm for the diagnosis of coronary artery disease.一种用于诊断冠状动脉疾病的新概率算法的国际应用。
Am J Cardiol. 1989 Aug 1;64(5):304-10. doi: 10.1016/0002-9149(89)90524-9.