• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

概率分类向量机

Probabilistic classification vector machines.

作者信息

Chen Huanhuan, Tino Peter, Yao Xin

机构信息

Centre of Excellence for Research in Computational Intelligence and Applications (CERCIA), School of Computer Science, University of Birmingham, Birmingham B15 2TT, UK.

出版信息

IEEE Trans Neural Netw. 2009 Jun;20(6):901-14. doi: 10.1109/TNN.2009.2014161. Epub 2009 Apr 24.

DOI:10.1109/TNN.2009.2014161
PMID:19398403
Abstract

In this paper, a sparse learning algorithm, probabilistic classification vector machines (PCVMs), is proposed. We analyze relevance vector machines (RVMs) for classification problems and observe that adopting the same prior for different classes may lead to unstable solutions. In order to tackle this problem, a signed and truncated Gaussian prior is adopted over every weight in PCVMs, where the sign of prior is determined by the class label, i.e., +1 or -1. The truncated Gaussian prior not only restricts the sign of weights but also leads to a sparse estimation of weight vectors, and thus controls the complexity of the model. In PCVMs, the kernel parameters can be optimized simultaneously within the training algorithm. The performance of PCVMs is extensively evaluated on four synthetic data sets and 13 benchmark data sets using three performance metrics, error rate (ERR), area under the curve of receiver operating characteristic (AUC), and root mean squared error (RMSE). We compare PCVMs with soft-margin support vector machines (SVM(Soft)), hard-margin support vector machines (SVM(Hard)), SVM with the kernel parameters optimized by PCVMs (SVM(PCVM)), relevance vector machines (RVMs), and some other baseline classifiers. Through five replications of twofold cross-validation F test, i.e., 5 x 2 cross-validation F test, over single data sets and Friedman test with the corresponding post-hoc test to compare these algorithms over multiple data sets, we notice that PCVMs outperform other algorithms, including SVM(Soft), SVM(Hard), RVM, and SVM(PCVM), on most of the data sets under the three metrics, especially under AUC. Our results also reveal that the performance of SVM(PCVM) is slightly better than SVM(Soft), implying that the parameter optimization algorithm in PCVMs is better than cross validation in terms of performance and computational complexity. In this paper, we also discuss the superiority of PCVMs' formulation using maximum a posteriori (MAP) analysis and margin analysis, which explain the empirical success of PCVMs.

摘要

本文提出了一种稀疏学习算法——概率分类向量机(PCVMs)。我们分析了用于分类问题的相关向量机(RVMs),并观察到对不同类别采用相同的先验可能会导致不稳定的解。为了解决这个问题,PCVMs对每个权重采用了有符号截断高斯先验,其中先验的符号由类别标签(即 +1 或 -1)确定。截断高斯先验不仅限制了权重的符号,还导致权重向量的稀疏估计,从而控制了模型的复杂度。在PCVMs中,核参数可以在训练算法中同时进行优化。使用三种性能指标,即错误率(ERR)、接收者操作特征曲线下面积(AUC)和均方根误差(RMSE),在四个合成数据集和13个基准数据集上对PCVMs的性能进行了广泛评估。我们将PCVMs与软间隔支持向量机(SVM(Soft))、硬间隔支持向量机(SVM(Hard))、核参数由PCVMs优化的支持向量机(SVM(PCVM))、相关向量机(RVMs)以及其他一些基线分类器进行了比较。通过对单个数据集进行五次重复的二折交叉验证F检验,即5×2交叉验证F检验,并对多个数据集进行Friedman检验及相应的事后检验来比较这些算法,我们注意到在这三个指标下,尤其是在AUC指标下,PCVMs在大多数数据集上优于其他算法,包括SVM(Soft)、SVM(Hard)、RVM和SVM(PCVM)。我们的结果还表明,SVM(PCVM)的性能略优于SVM(Soft),这意味着PCVMs中的参数优化算法在性能和计算复杂度方面优于交叉验证。在本文中,我们还使用最大后验(MAP)分析和间隔分析讨论了PCVMs公式的优越性,这解释了PCVMs在经验上的成功。

相似文献

1
Probabilistic classification vector machines.概率分类向量机
IEEE Trans Neural Netw. 2009 Jun;20(6):901-14. doi: 10.1109/TNN.2009.2014161. Epub 2009 Apr 24.
2
Bias in error estimation when using cross-validation for model selection.在使用交叉验证进行模型选择时误差估计中的偏差。
BMC Bioinformatics. 2006 Feb 23;7:91. doi: 10.1186/1471-2105-7-91.
3
SVMs modeling for highly imbalanced classification.用于高度不平衡分类的支持向量机建模
IEEE Trans Syst Man Cybern B Cybern. 2009 Feb;39(1):281-8. doi: 10.1109/TSMCB.2008.2002909. Epub 2008 Dec 9.
4
Data classification with radial basis function networks based on a novel kernel density estimation algorithm.基于一种新型核密度估计算法的径向基函数网络数据分类
IEEE Trans Neural Netw. 2005 Jan;16(1):225-36. doi: 10.1109/TNN.2004.836229.
5
Two criteria for model selection in multiclass support vector machines.多类支持向量机中模型选择的两个标准。
IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1432-48. doi: 10.1109/TSMCB.2008.927272.
6
Face recognition using total margin-based adaptive fuzzy support vector machines.使用基于总余量的自适应模糊支持向量机的人脸识别。
IEEE Trans Neural Netw. 2007 Jan;18(1):178-92. doi: 10.1109/TNN.2006.883013.
7
Sparse Bayesian modeling with adaptive kernel learning.基于自适应核学习的稀疏贝叶斯建模
IEEE Trans Neural Netw. 2009 Jun;20(6):926-37. doi: 10.1109/TNN.2009.2014060. Epub 2009 May 5.
8
Posterior probability support vector machines for unbalanced data.用于不平衡数据的后验概率支持向量机
IEEE Trans Neural Netw. 2005 Nov;16(6):1561-73. doi: 10.1109/TNN.2005.857955.
9
A practical approach to model selection for support vector machines with a Gaussian kernel.一种用于具有高斯核的支持向量机的模型选择实用方法。
IEEE Trans Syst Man Cybern B Cybern. 2011 Apr;41(2):330-40. doi: 10.1109/TSMCB.2010.2053026. Epub 2010 Aug 9.
10
Training hard-margin support vector machines using greedy stagewise algorithm.使用贪婪逐阶段算法训练硬间隔支持向量机。
IEEE Trans Neural Netw. 2008 Aug;19(8):1446-55. doi: 10.1109/TNN.2008.2000576.

引用本文的文献

1
Assessment of fracture risk in diabetic patients.糖尿病患者骨折风险评估
J Diabetes Metab Disord. 2024 Aug 13;23(2):1653-1663. doi: 10.1007/s40200-024-01474-8. eCollection 2024 Dec.
2
Machine learning algorithms for predicting the risk of fracture in patients with diabetes in China.用于预测中国糖尿病患者骨折风险的机器学习算法
Heliyon. 2023 Jul 11;9(7):e18186. doi: 10.1016/j.heliyon.2023.e18186. eCollection 2023 Jul.
3
Implementation of Machine Learning on Human Frequency-Following Responses: A Tutorial.机器学习在人类频率跟随反应中的应用:教程
Semin Hear. 2022 Oct 26;43(3):251-274. doi: 10.1055/s-0042-1756219. eCollection 2022 Aug.
4
A Novel Ensemble Adaptive Sparse Bayesian Transfer Learning Machine for Nonlinear Large-Scale Process Monitoring.一种新颖的集成自适应稀疏贝叶斯迁移学习机,用于非线性大规模过程监测。
Sensors (Basel). 2020 Oct 28;20(21):6139. doi: 10.3390/s20216139.
5
Exploring the Potential of Spherical Harmonics and PCVM for Compounds Activity Prediction.探索球谐函数和 PCVM 在化合物活性预测中的应用潜力。
Int J Mol Sci. 2019 May 2;20(9):2175. doi: 10.3390/ijms20092175.
6
Improving Prediction of Self-interacting Proteins Using Stacked Sparse Auto-Encoder with PSSM profiles.利用 PSSM 特征的堆叠稀疏自编码器改进自相互作用蛋白的预测。
Int J Biol Sci. 2018 May 23;14(8):983-991. doi: 10.7150/ijbs.23817. eCollection 2018.
7
Detection of Interactions between Proteins by Using Legendre Moments Descriptor to Extract Discriminatory Information Embedded in PSSM.利用勒让德矩描述符提取PSSM中嵌入的鉴别信息来检测蛋白质之间的相互作用
Molecules. 2017 Aug 18;22(8):1366. doi: 10.3390/molecules22081366.
8
A Generic Transferable EEG Decoder for Online Detection of Error Potential in Target Selection.一种用于在线检测目标选择中错误电位的通用可转移脑电图解码器。
Front Neurosci. 2017 May 2;11:226. doi: 10.3389/fnins.2017.00226. eCollection 2017.
9
PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences.PCVMZM:使用概率分类向量机模型结合泽尼克矩描述符从蛋白质序列预测蛋白质-蛋白质相互作用
Int J Mol Sci. 2017 May 11;18(5):1029. doi: 10.3390/ijms18051029.
10
Using Genetic Programming with Prior Formula Knowledge to Solve Symbolic Regression Problem.运用带有先验公式知识的遗传编程来解决符号回归问题。
Comput Intell Neurosci. 2016;2016:1021378. doi: 10.1155/2016/1021378. Epub 2015 Dec 24.