基于主成分的支持向量机（PC-SVM）：一种用于软件缺陷检测的混合技术。

Principal component based support vector machine (PC-SVM): a hybrid technique for software defect detection.

作者信息

Mustaqeem Mohd, Saqib Mohd

机构信息

CSE Department, Institute of Technology & Management (A.K.T.U), Aligarh, U.P India.

Mathematic and Computing Department, Indian Institute of Technology (ISM), Dhanbad, Jharkhand India.

出版信息

Cluster Comput. 2021;24(3):2581-2595. doi: 10.1007/s10586-021-03282-8. Epub 2021 Apr 16.

DOI:10.1007/s10586-021-03282-8

PMID:33880074

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8050160/

Abstract

Defects are the major problems in the current situation and predicting them is also a difficult task. Researchers and scientists have developed many software defects prediction techniques to overcome this very helpful issue. But to some extend there is a need for an algorithm/method to predict defects with more accuracy, reduce time and space complexities. All the previous research conducted on the data without feature reduction lead to the curse of dimensionality. We brought up a machine learning hybrid approach by combining Principal component Analysis (PCA) and Support vector machines (SVM) to overcome the ongoing problem. We have employed PROMISE (CM1: 344 observations, KC1: 2109 observations) data from the directory of NASA to conduct our research. We split the dataset into training (CM1: 240 observations, KC1: 1476 observations) dataset and testing (CM1: 104 observations, KC1: 633 observations) datasets. Using PCA, we find the principal components for feature optimization which reduce the time complexity. Then, we applied SVM for classification due to very native qualities over traditional and conventional methods. We also employed the GridSearchCV method for hyperparameter tuning. In the proposed hybrid model we have found better accuracy (CM1: 95.2%, KC1: 86.6%) than other methods. The proposed model also presents higher evaluation in the terms of other criteria. As a limitation, the only problem with SVM is there is no probabilistic explanation for classification which may very rigid towards classifications. In the future, some other method may also introduce which can overcome this limitation and keep a soft probabilistic based margin for classification on the optimal hyperplane.

摘要

缺陷是当前形势下的主要问题，预测缺陷也是一项艰巨的任务。研究人员和科学家已经开发了许多软件缺陷预测技术来克服这个非常有用的问题。但在某种程度上，需要一种算法/方法来更准确地预测缺陷，降低时间和空间复杂度。之前对未进行特征约简的数据所做的所有研究都导致了维度灾难。我们提出了一种将主成分分析（PCA）和支持向量机（SVM）相结合的机器学习混合方法来解决当前的问题。我们使用了来自美国国家航空航天局目录的PROMISE（CM1：344个观测值，KC1：2109个观测值）数据来进行我们的研究。我们将数据集分为训练集（CM1：240个观测值，KC1：1476个观测值）和测试集（CM1：104个观测值，KC1：633个观测值）。使用PCA，我们找到了用于特征优化的主成分，这降低了时间复杂度。然后，由于SVM相对于传统方法具有非常天然的优势，我们将其应用于分类。我们还使用了GridSearchCV方法进行超参数调整。在所提出的混合模型中，我们发现其准确率（CM1：95.2%，KC1：86.6%）比其他方法更高。所提出的模型在其他标准方面也表现出更高的评估结果。作为一个局限性，SVM唯一的问题是对于分类没有概率解释，这可能对分类非常严格。未来，可能还会引入其他一些方法来克服这个局限性，并在最优超平面上保持基于软概率的分类边界。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa35/8050160/c4f7a0f3cf4e/10586_2021_3282_Fig1_HTML.jpg

相似文献

Principal component based support vector machine (PC-SVM): a hybrid technique for software defect detection.

Cluster Comput. 2021;24(3):2581-2595. doi: 10.1007/s10586-021-03282-8. Epub 2021 Apr 16.

Analysis of Hybrid Feature Optimization Techniques Based on the Classification Accuracy of Brain Tumor Regions Using Machine Learning and Further Evaluation Based on the Institute Test Data.

J Med Phys. 2024 Jan-Mar;49(1):22-32. doi: 10.4103/jmp.jmp_77_23. Epub 2024 Mar 30.

Epileptic seizure detection in EEG signal with GModPCA and support vector machine.

Biomed Mater Eng. 2017;28(2):141-157. doi: 10.3233/BME-171663.

Top scoring pairs for feature selection in machine learning and applications to cancer outcome prediction.

BMC Bioinformatics. 2011 Sep 23;12:375. doi: 10.1186/1471-2105-12-375.

Seminal quality prediction using data mining methods.

Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.

A three-stage expert system based on support vector machines for thyroid disease diagnosis.

J Med Syst. 2012 Jun;36(3):1953-63. doi: 10.1007/s10916-011-9655-8. Epub 2011 Feb 1.

Vicinal support vector classifier using supervised kernel-based clustering.

Artif Intell Med. 2014 Mar;60(3):189-96. doi: 10.1016/j.artmed.2014.01.003. Epub 2014 Feb 7.

A fuzzy based feature selection from independent component subspace for machine learning classification of microarray data.

Genom Data. 2016 Feb 23;8:4-15. doi: 10.1016/j.gdata.2016.02.012. eCollection 2016 Jun.

An enhanced approach for predicting air pollution using quantum support vector machine.

Sci Rep. 2024 Aug 22;14(1):19521. doi: 10.1038/s41598-024-69663-2.

A PCA aided cross-covariance scheme for discriminative feature extraction from EEG signals.

Comput Methods Programs Biomed. 2017 Jul;146:47-57. doi: 10.1016/j.cmpb.2017.05.009. Epub 2017 May 24.

引用本文的文献

Integrating temporal convolutional networks with metaheuristic optimization for accurate software defect prediction.

PLoS One. 2025 May 12;20(5):e0319562. doi: 10.1371/journal.pone.0319562. eCollection 2025.

Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning.

PeerJ Comput Sci. 2024 Feb 28;10:e1860. doi: 10.7717/peerj-cs.1860. eCollection 2024.

A trustworthy hybrid model for transparent software defect prediction: SPAM-XAI.

PLoS One. 2024 Jul 11;19(7):e0307112. doi: 10.1371/journal.pone.0307112. eCollection 2024.

Functional data geometric morphometrics with machine learning for craniodental shape classification in shrews.

Sci Rep. 2024 Jul 6;14(1):15579. doi: 10.1038/s41598-024-66246-z.

Explainable machine learning model for predicting furosemide responsiveness in patients with oliguric acute kidney injury.

Ren Fail. 2023 Dec;45(1):2151468. doi: 10.1080/0886022X.2022.2151468.

PCA-Based Incremental Extreme Learning Machine (PCA-IELM) for COVID-19 Patient Diagnosis Using Chest X-Ray Images.

Comput Intell Neurosci. 2022 Jul 4;2022:9107430. doi: 10.1155/2022/9107430. eCollection 2022.

Machine-Learning Prediction of Postoperative Pituitary Hormonal Outcomes in Nonfunctioning Pituitary Adenomas: A Multicenter Study.

Front Endocrinol (Lausanne). 2021 Oct 7;12:748725. doi: 10.3389/fendo.2021.748725. eCollection 2021.

本文引用的文献

Cost-Sensitive Radial Basis Function Neural Network Classifier for Software Defect Prediction.

ScientificWorldJournal. 2016;2016:2401496. doi: 10.1155/2016/2401496. Epub 2016 Sep 21.

Current practice in software development for computational neuroscience and how to improve it.

PLoS Comput Biol. 2014 Jan;10(1):e1003376. doi: 10.1371/journal.pcbi.1003376. Epub 2014 Jan 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于主成分的支持向量机（PC-SVM）：一种用于软件缺陷检测的混合技术。

Principal component based support vector machine (PC-SVM): a hybrid technique for software defect detection.

作者信息

Mustaqeem Mohd, Saqib Mohd

机构信息

CSE Department, Institute of Technology & Management (A.K.T.U), Aligarh, U.P India.

Mathematic and Computing Department, Indian Institute of Technology (ISM), Dhanbad, Jharkhand India.

出版信息

Cluster Comput. 2021;24(3):2581-2595. doi: 10.1007/s10586-021-03282-8. Epub 2021 Apr 16.

DOI:10.1007/s10586-021-03282-8

PMID:33880074

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8050160/

Abstract

摘要

基于主成分的支持向量机（PC-SVM）：一种用于软件缺陷检测的混合技术。

Principal component based support vector machine (PC-SVM): a hybrid technique for software defect detection.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于主成分的支持向量机（PC-SVM）：一种用于软件缺陷检测的混合技术。

Principal component based support vector machine (PC-SVM): a hybrid technique for software defect detection.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献