支持向量机及其集成方法在乳腺癌预测中的应用

SVM and SVM Ensembles in Breast Cancer Prediction.

作者信息

Huang Min-Wei, Chen Chih-Wen, Lin Wei-Chao, Ke Shih-Wen, Tsai Chih-Fong

机构信息

Department of Psychiatry, Chiayi Branch, Taichung Veterans General Hospital, Chiayi, Taiwan.

Department of Pharmacy, Kaohsiung Municipal Chinese Medical Hospital, Kaohsiung, Taiwan.

出版信息

PLoS One. 2017 Jan 6;12(1):e0161501. doi: 10.1371/journal.pone.0161501. eCollection 2017.

DOI:10.1371/journal.pone.0161501

PMID:28060807

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5217832/

Abstract

Breast cancer is an all too common disease in women, making how to effectively predict it an active research problem. A number of statistical and machine learning techniques have been employed to develop various breast cancer prediction models. Among them, support vector machines (SVM) have been shown to outperform many related techniques. To construct the SVM classifier, it is first necessary to decide the kernel function, and different kernel functions can result in different prediction performance. However, there have been very few studies focused on examining the prediction performances of SVM based on different kernel functions. Moreover, it is unknown whether SVM classifier ensembles which have been proposed to improve the performance of single classifiers can outperform single SVM classifiers in terms of breast cancer prediction. Therefore, the aim of this paper is to fully assess the prediction performance of SVM and SVM ensembles over small and large scale breast cancer datasets. The classification accuracy, ROC, F-measure, and computational times of training SVM and SVM ensembles are compared. The experimental results show that linear kernel based SVM ensembles based on the bagging method and RBF kernel based SVM ensembles with the boosting method can be the better choices for a small scale dataset, where feature selection should be performed in the data pre-processing stage. For a large scale dataset, RBF kernel based SVM ensembles based on boosting perform better than the other classifiers.

摘要

乳腺癌是女性中极为常见的疾病，这使得如何有效预测乳腺癌成为一个活跃的研究课题。许多统计和机器学习技术已被用于开发各种乳腺癌预测模型。其中，支持向量机（SVM）已被证明优于许多相关技术。要构建SVM分类器，首先需要确定核函数，不同的核函数会导致不同的预测性能。然而，很少有研究专注于检验基于不同核函数的SVM的预测性能。此外，为提高单分类器性能而提出的SVM分类器集成在乳腺癌预测方面是否能优于单SVM分类器尚不清楚。因此，本文的目的是全面评估SVM和SVM集成在小规模和大规模乳腺癌数据集上的预测性能。比较了训练SVM和SVM集成的分类准确率、ROC、F值和计算时间。实验结果表明，基于装袋法的线性核SVM集成和基于提升法的RBF核SVM集成对于小规模数据集可能是更好的选择，在数据预处理阶段应进行特征选择。对于大规模数据集，基于提升法的RBF核SVM集成比其他分类器表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7943/5217832/4cee23cc05b0/pone.0161501.g001.jpg

相似文献

SVM and SVM Ensembles in Breast Cancer Prediction.支持向量机及其集成方法在乳腺癌预测中的应用

PLoS One. 2017 Jan 6;12(1):e0161501. doi: 10.1371/journal.pone.0161501. eCollection 2017.

Vicinal support vector classifier using supervised kernel-based clustering.基于监督核聚类的邻接支持向量分类器。

Artif Intell Med. 2014 Mar;60(3):189-96. doi: 10.1016/j.artmed.2014.01.003. Epub 2014 Feb 7.

Computer-Aided Detection of Incidental Lumbar Spine Fractures from Routine Dual-Energy X-Ray Absorptiometry (DEXA) Studies Using a Support Vector Machine (SVM) Classifier.基于支持向量机（SVM）分类器的常规双能 X 射线吸收法（DEXA）研究中偶然性腰椎骨折的计算机辅助检测。

J Digit Imaging. 2020 Feb;33(1):204-210. doi: 10.1007/s10278-019-00224-0.

Data-driven diagnosis of spinal abnormalities using feature selection and machine learning algorithms.基于特征选择和机器学习算法的脊柱异常数据驱动诊断。

PLoS One. 2020 Feb 6;15(2):e0228422. doi: 10.1371/journal.pone.0228422. eCollection 2020.

White box radial basis function classifiers with component selection for clinical prediction models.基于组件选择的白盒径向基函数分类器在临床预测模型中的应用。

Artif Intell Med. 2014 Jan;60(1):53-64. doi: 10.1016/j.artmed.2013.10.001. Epub 2013 Oct 18.

A reliable method for colorectal cancer prediction based on feature selection and support vector machine.基于特征选择和支持向量机的结直肠癌预测可靠方法。

Med Biol Eng Comput. 2019 Apr;57(4):901-912. doi: 10.1007/s11517-018-1930-0. Epub 2018 Nov 26.

Hadamard Kernel SVM with applications for breast cancer outcome predictions.用于乳腺癌预后预测的阿达马核支持向量机。

BMC Syst Biol. 2017 Dec 21;11(Suppl 7):138. doi: 10.1186/s12918-017-0514-1.

Protein subcellular localization prediction using multiple kernel learning based support vector machine.基于多核学习支持向量机的蛋白质亚细胞定位预测

Mol Biosyst. 2017 Mar 28;13(4):785-795. doi: 10.1039/c6mb00860g.

Classification of Benign and Malignant Breast Masses on Mammograms for Large Datasets using Core Vector Machines.基于核向量机的大样本乳腺钼靶图像良恶性肿块分类

Curr Med Imaging. 2020;16(6):703-710. doi: 10.2174/1573405615666190801121506.

Machine learning models in breast cancer survival prediction.用于乳腺癌生存预测的机器学习模型。

Technol Health Care. 2016;24(1):31-42. doi: 10.3233/THC-151071.

引用本文的文献

Improving Hepatitis B outcome prediction with ensemble machine learning: A study on predictive models and interpretability.利用集成机器学习改善乙型肝炎预后预测：关于预测模型与可解释性的研究

Digit Health. 2025 Jun 16;11:20552076251350755. doi: 10.1177/20552076251350755. eCollection 2025 Jan-Dec.

Personalized predictions of neoadjuvant chemotherapy response in breast cancer using machine learning and full-field digital mammography radiomics.利用机器学习和全场数字化乳腺摄影影像组学对乳腺癌新辅助化疗反应进行个性化预测。

Front Med (Lausanne). 2025 Apr 17;12:1582560. doi: 10.3389/fmed.2025.1582560. eCollection 2025.

Innovative approach towards early prediction of ovarian cancer: Machine learning- enabled XAI techniques.卵巢癌早期预测的创新方法：基于机器学习的可解释人工智能技术

Heliyon. 2024 Apr 15;10(9):e29197. doi: 10.1016/j.heliyon.2024.e29197. eCollection 2024 May 15.

Identification of novel diagnostic biomarkers associated with liver metastasis in colon adenocarcinoma by machine learning.通过机器学习鉴定与结肠腺癌肝转移相关的新型诊断生物标志物

Discov Oncol. 2024 Oct 10;15(1):542. doi: 10.1007/s12672-024-01398-y.

Reconstruction of Protein-Protein Interaction Network Based on DGO-SVM Method.基于DGO-SVM方法的蛋白质-蛋白质相互作用网络重建

Curr Issues Mol Biol. 2024 Jul 12;46(7):7353-7372. doi: 10.3390/cimb46070436.

Smart Biosensor for Breast Cancer Survival Prediction Based on Multi-View Multi-Way Graph Learning.基于多视图多向图学习的乳腺癌生存预测智能生物传感器

Sensors (Basel). 2024 May 21;24(11):3289. doi: 10.3390/s24113289.

Consumer electronics based smart technologies for enhanced terahertz healthcare having an integration of split learning with medical imaging.基于消费电子产品的智能技术，通过与医学成像相结合的分裂学习，用于增强太赫兹医疗保健。

Sci Rep. 2024 May 6;14(1):10412. doi: 10.1038/s41598-024-58741-0.

Exploring the potential of machine learning in gynecological care: a review.探索机器学习在妇科护理中的潜力：综述。

Arch Gynecol Obstet. 2024 Jun;309(6):2347-2365. doi: 10.1007/s00404-024-07479-1. Epub 2024 Apr 16.

Machine learning models for predicting the onset of chronic kidney disease after surgery in patients with renal cell carcinoma.机器学习模型预测肾细胞癌患者手术后慢性肾脏病的发生。

BMC Med Inform Decis Mak. 2024 Mar 22;24(1):85. doi: 10.1186/s12911-024-02473-8.

Blood Biomarkers Panels for Screening of Colorectal Cancer and Adenoma on a Machine Learning-Assisted Detection Platform.基于机器学习辅助检测平台的用于结直肠癌和腺瘤筛查的血液生物标志物检测面板。

Cancer Control. 2023 Jan-Dec;30:10732748231222109. doi: 10.1177/10732748231222109.

本文引用的文献

Machine learning applications in cancer prognosis and prediction.机器学习在癌症预后和预测中的应用。

Comput Struct Biotechnol J. 2014 Nov 15;13:8-17. doi: 10.1016/j.csbj.2014.11.005. eCollection 2015.

Applications of machine learning in cancer prediction and prognosis.机器学习在癌症预测和预后中的应用。

Cancer Inform. 2007 Feb 11;2:59-77.

Splice site identification using probabilistic parameters and SVM classification.使用概率参数和支持向量机分类进行剪接位点识别。

BMC Bioinformatics. 2006 Dec 18;7 Suppl 5(Suppl 5):S15. doi: 10.1186/1471-2105-7-S5-S15.

Hybrid genetic algorithms for feature selection.用于特征选择的混合遗传算法

IEEE Trans Pattern Anal Mach Intell. 2004 Nov;26(11):1424-37. doi: 10.1109/TPAMI.2004.105.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

支持向量机及其集成方法在乳腺癌预测中的应用

SVM and SVM Ensembles in Breast Cancer Prediction.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献