使用单类支持向量机进行异常值检测：在黑色素瘤预后中的应用

Outlier Detection with One-Class SVMs: An Application to Melanoma Prognosis.

作者信息

Dreiseitl Stephan, Osl Melanie, Scheibböck Christian, Binder Michael

机构信息

Dept. of Software Engineering, Upper Austria University of Applied Sciences, Hagenberg, Austria.

出版信息

AMIA Annu Symp Proc. 2010 Nov 13;2010:172-6.

PMID:21346963

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3041295/

Abstract

BACKGROUND

Medical diagnosis and prognosis using machine learning methods is usually represented as a supervised classification problem, where a model is built to distinguish "normal" from "abnormal" cases. If cases are available from only one class, this approach is not feasible.

OBJECTIVE

To evaluate the performance of classification via outlier detection by one-class support vector machines (SVMs) as a means of identifying abnormal cases in the domain of melanoma prognosis.

METHODS

Empirical evaluation of one-class SVMs on a data set for predicting the presence or absence of metastases in melanoma patients, and comparison with regular SVMs and artificial neural networks.

RESULTS

One-class SVMs achieve an area under the ROC curve (AUC) of 0.71; two-class algorithms achieve AUCs between 0.5 and 0.84, depending on the available number of cases from the minority class.

CONCLUSION

One-class SVMs offer a viable alternative to two-class classification algorithms if class distribution is heavily imbalanced.

摘要

背景

使用机器学习方法进行医学诊断和预后评估通常被表示为一个监督分类问题，即构建一个模型来区分“正常”和“异常”病例。如果仅能获取来自一个类别的病例，这种方法就不可行。

目的

评估通过单类支持向量机（SVM）进行异常检测的分类性能，以此作为识别黑色素瘤预后领域异常病例的一种手段。

方法

对用于预测黑色素瘤患者是否存在转移的数据集进行单类SVM的实证评估，并与常规SVM和人工神经网络进行比较。

结果

单类SVM的ROC曲线下面积（AUC）为0.71；二类算法的AUC在0.5至0.84之间波动，具体取决于少数类别的可用病例数量。

结论

如果类分布严重失衡，单类SVM为二类分类算法提供了一种可行的替代方案。

相似文献

Outlier Detection with One-Class SVMs: An Application to Melanoma Prognosis.使用单类支持向量机进行异常值检测：在黑色素瘤预后中的应用

AMIA Annu Symp Proc. 2010 Nov 13;2010:172-6.

A support vector machine using the lazy learning approach for multi-class classification.一种采用懒惰学习方法进行多类分类的支持向量机。

J Med Eng Technol. 2006 Mar-Apr;30(2):73-7. doi: 10.1080/03091900500095729.

Semi-supervised clinical text classification with Laplacian SVMs: an application to cancer case management.基于拉普拉斯支持向量机的半监督临床文本分类：在癌症病例管理中的应用。

J Biomed Inform. 2013 Oct;46(5):869-75. doi: 10.1016/j.jbi.2013.06.014. Epub 2013 Jul 8.

SVMs modeling for highly imbalanced classification.用于高度不平衡分类的支持向量机建模

IEEE Trans Syst Man Cybern B Cybern. 2009 Feb;39(1):281-8. doi: 10.1109/TSMCB.2008.2002909. Epub 2008 Dec 9.

Hybrid neural network with cost-sensitive support vector machine for class-imbalanced multimodal data.基于代价敏感支持向量机的混合神经网络分类器用于处理类不平衡多模态数据。

Neural Netw. 2020 Oct;130:176-184. doi: 10.1016/j.neunet.2020.06.026. Epub 2020 Jul 3.

Application of Artificial Intelligence for Preoperative Diagnostic and Prognostic Prediction in Epithelial Ovarian Cancer Based on Blood Biomarkers.基于血液生物标志物的人工智能在卵巢上皮性癌术前诊断和预后预测中的应用。

Clin Cancer Res. 2019 May 15;25(10):3006-3015. doi: 10.1158/1078-0432.CCR-18-3378. Epub 2019 Apr 11.

Feature space interpretation of SVMs with indefinite kernels.具有不定核的支持向量机的特征空间解释

IEEE Trans Pattern Anal Mach Intell. 2005 Apr;27(4):482-492. doi: 10.1109/TPAMI.2005.78.

Near-Bayesian Support Vector Machines for imbalanced data classification with equal or unequal misclassification costs.带有同等或不等误分类代价的不平衡数据分类的近贝叶斯支持向量机。

Neural Netw. 2015 Oct;70:39-52. doi: 10.1016/j.neunet.2015.06.005. Epub 2015 Jul 8.

Training hard-margin support vector machines using greedy stagewise algorithm.使用贪婪逐阶段算法训练硬间隔支持向量机。

IEEE Trans Neural Netw. 2008 Aug;19(8):1446-55. doi: 10.1109/TNN.2008.2000576.

Expert guided natural language processing using one-class classification.使用单类分类的专家指导自然语言处理。

J Am Med Inform Assoc. 2015 Sep;22(5):962-6. doi: 10.1093/jamia/ocv010. Epub 2015 Jun 10.

引用本文的文献

One-class support vector machines for detecting population drift in deployed machine learning medical diagnostics.用于检测已部署机器学习医学诊断中群体漂移的单类支持向量机。

Sci Rep. 2025 Apr 9;15(1):12157. doi: 10.1038/s41598-025-94427-x.

SMN deficiency perturbs monoamine neurotransmitter metabolism in spinal muscular atrophy.运动神经元存活基因缺失会扰乱脊髓性肌萎缩症中的单胺神经递质代谢。

Commun Biol. 2023 Nov 13;6(1):1155. doi: 10.1038/s42003-023-05543-1.

Detecting outliers beyond tolerance limits derived from statistical process control in patient-specific quality assurance.在基于患者个体的质量保证的统计过程控制中，检测超出容忍限的离群值。

J Appl Clin Med Phys. 2024 Feb;25(2):e14154. doi: 10.1002/acm2.14154. Epub 2023 Sep 8.

Towards addressing unauthorized sharing of subscriptions.致力于解决订阅的未经授权共享问题。

Appl Intell (Dordr). 2022;52(15):17090-17102. doi: 10.1007/s10489-021-02812-6. Epub 2021 Oct 16.

Classification and Automated Interpretation of Spinal Posture Data Using a Pathology-Independent Classifier and Explainable Artificial Intelligence (XAI).使用与病理学无关的分类器和可解释人工智能（XAI）对脊柱姿势数据进行分类和自动解释。

Sensors (Basel). 2021 Sep 21;21(18):6323. doi: 10.3390/s21186323.

A micro-XRT image analysis and machine learning methodology for the characterisation of multi-particulate capsule formulations.一种用于多颗粒胶囊制剂表征的微X射线断层扫描图像分析和机器学习方法。

Int J Pharm X. 2020 Jan 15;2:100041. doi: 10.1016/j.ijpx.2020.100041. eCollection 2020 Dec.

Enhancement of hepatitis virus immunoassay outcome predictions in imbalanced routine pathology data by data balancing and feature selection before the application of support vector machines.通过数据平衡和特征选择，提高支持向量机应用前不平衡常规病理数据中肝炎病毒免疫测定结果预测的准确性。

BMC Med Inform Decis Mak. 2017 Aug 14;17(1):121. doi: 10.1186/s12911-017-0522-5.

本文引用的文献

Unsupervised spatiotemporal fMRI data analysis using support vector machines.使用支持向量机的无监督时空功能磁共振成像数据分析

Neuroimage. 2009 Aug 1;47(1):204-12. doi: 10.1016/j.neuroimage.2009.03.054. Epub 2009 Mar 31.

Diagnostic value of melanoma inhibitory activity serum marker in the follow-up of patients with stage I or II cutaneous melanoma.黑色素瘤抑制活性血清标志物在Ⅰ期或Ⅱ期皮肤黑色素瘤患者随访中的诊断价值

Melanoma Res. 2009 Feb;19(1):17-23. doi: 10.1097/CMR.0b013e32831bc78c.

3D reconstruction of head MRI based on one class support vector machine with immune algorithm.

Annu Int Conf IEEE Eng Med Biol Soc. 2007;2007:6016-9. doi: 10.1109/IEMBS.2007.4353719.

Melanoma inhibiting activity protein (MIA), beta-2 microglobulin and lactate dehydrogenase (LDH) in metastatic melanoma.转移性黑色素瘤中的黑色素瘤抑制活性蛋白（MIA）、β2微球蛋白和乳酸脱氢酶（LDH）

Anticancer Res. 2007 Jan-Feb;27(1B):595-9.

Extraction of brain tumor from MR images using one-class support vector machine.使用一类支持向量机从磁共振图像中提取脑肿瘤。

Conf Proc IEEE Eng Med Biol Soc. 2005;2005:6411-4. doi: 10.1109/IEMBS.2005.1615965.

S-100beta and MIA in advanced melanoma in relation to prognostic factors.晚期黑色素瘤中S-100β和黑色素瘤抑制活性与预后因素的关系。

Anticancer Res. 2005 May-Jun;25(3A):1779-82.

The revised American Joint Committee on Cancer staging system for melanoma.修订后的美国癌症联合委员会黑色素瘤分期系统。

Semin Oncol. 2002 Aug;29(4):361-9. doi: 10.1053/sonc.2002.34115.

Estimating the support of a high-dimensional distribution.估计高维分布的支撑集。

Neural Comput. 2001 Jul;13(7):1443-71. doi: 10.1162/089976601750264965.

The meaning and use of the area under a receiver operating characteristic (ROC) curve.接受者操作特征（ROC）曲线下面积的意义及应用。

Radiology. 1982 Apr;143(1):29-36. doi: 10.1148/radiology.143.1.7063747.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验