基于生物启发算法和超级学习者的临床数据集特征选择与分类。

Feature Selection and Classification of Clinical Datasets Using Bioinspired Algorithms and Super Learner.

机构信息

Ramanujan Computing Centre, Anna University, Chennai 600025, India.

Department of Computer Science and Engineering, Anna University, Chennai 600025, India.

出版信息

Comput Math Methods Med. 2021 May 17;2021:6662420. doi: 10.1155/2021/6662420. eCollection 2021.

DOI:10.1155/2021/6662420

PMID:34055041

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8149240/

Abstract

A computer-aided diagnosis (CAD) system that employs a super learner to diagnose the presence or absence of a disease has been developed. Each clinical dataset is preprocessed and split into training set (60%) and testing set (40%). A wrapper approach that uses three bioinspired algorithms, namely, cat swarm optimization (CSO), krill herd (KH) ,and bacterial foraging optimization (BFO) with the classification accuracy of support vector machine (SVM) as the fitness function has been used for feature selection. The selected features of each bioinspired algorithm are stored in three separate databases. The features selected by each bioinspired algorithm are used to train three back propagation neural networks (BPNN) independently using the conjugate gradient algorithm (CGA). Classifier testing is performed by using the testing set on each trained classifier, and the diagnostic results obtained are used to evaluate the performance of each classifier. The classification results obtained for each instance of the testing set of the three classifiers and the class label associated with each instance of the testing set will be the candidate instances for training and testing the super learner. The training set comprises of 80% of the instances, and the testing set comprises of 20% of the instances. Experimentation has been carried out using seven clinical datasets from the University of California Irvine (UCI) machine learning repository. The super learner has achieved a classification accuracy of 96.83% for Wisconsin diagnostic breast cancer dataset (WDBC), 86.36% for Statlog heart disease dataset (SHD), 94.74% for hepatocellular carcinoma dataset (HCC), 90.48% for hepatitis dataset (HD), 81.82% for vertebral column dataset (VCD), 84% for Cleveland heart disease dataset (CHD), and 70% for Indian liver patient dataset (ILP).

摘要

已经开发出一种使用超级学习者来诊断疾病存在与否的计算机辅助诊断 (CAD) 系统。每个临床数据集都经过预处理，并分为训练集（60%）和测试集（40%）。使用三种仿生算法，即猫群优化（CSO）、磷虾群（KH）和细菌觅食优化（BFO），并以支持向量机（SVM）的分类准确性作为适应度函数的包装方法用于特征选择。每个仿生算法选择的特征都存储在三个单独的数据库中。每个仿生算法选择的特征用于使用共轭梯度算法（CGA）独立地训练三个反向传播神经网络（BPNN）。通过使用每个训练分类器的测试集来执行分类器测试，并使用获得的诊断结果来评估每个分类器的性能。使用三个分类器的测试集的每个实例的分类结果和与测试集的每个实例相关联的类别标签将是用于训练和测试超级学习者的候选实例。训练集由 80%的实例组成，测试集由 20%的实例组成。使用来自加利福尼亚大学欧文分校 (UCI) 机器学习存储库的七个临床数据集进行了实验。超级学习者在威斯康星州诊断乳腺癌数据集 (WDBC) 中达到了 96.83%的分类准确性，在 Statlog 心脏病数据集 (SHD) 中达到了 86.36%，在肝细胞癌数据集 (HCC) 中达到了 94.74%，在肝炎数据集 (HD) 中达到了 90.48%，在脊椎数据集 (VCD) 中达到了 81.82%，在克利夫兰心脏病数据集 (CHD) 中达到了 84%，在印度肝病患者数据集 (ILP) 中达到了 70%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b6/8149240/19bcc9d2b47b/CMMM2021-6662420.001.jpg

相似文献

Feature Selection and Classification of Clinical Datasets Using Bioinspired Algorithms and Super Learner.基于生物启发算法和超级学习者的临床数据集特征选择与分类。

Comput Math Methods Med. 2021 May 17;2021:6662420. doi: 10.1155/2021/6662420. eCollection 2021.

Correlation-Based Ensemble Feature Selection Using Bioinspired Algorithms and Classification Using Backpropagation Neural Network.基于生物启发算法的相关性集成特征选择和反向传播神经网络分类。

Comput Math Methods Med. 2019 Sep 23;2019:7398307. doi: 10.1155/2019/7398307. eCollection 2019.

Knowledge mining from clinical datasets using rough sets and backpropagation neural network.使用粗糙集和反向传播神经网络从临床数据集中进行知识挖掘。

Comput Math Methods Med. 2015;2015:460189. doi: 10.1155/2015/460189. Epub 2015 Mar 4.

Feature selection using binary particle swarm optimization and support vector machines for medical diagnosis.基于二进制粒子群优化算法和支持向量机的医学诊断特征选择

Biomed Tech (Berl). 2012 Oct;57(5):395-402. doi: 10.1515/bmt-2012-0009.

Clinical data classification using an enhanced SMOTE and chaotic evolutionary feature selection.使用增强型SMOTE和混沌进化特征选择的临床数据分类

Comput Biol Med. 2020 Nov;126:103991. doi: 10.1016/j.compbiomed.2020.103991. Epub 2020 Sep 18.

Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms.基于评分和相关系数的特征选择在使用机器学习算法预测心力衰竭诊断中的应用。

Comput Math Methods Med. 2021 Dec 20;2021:8500314. doi: 10.1155/2021/8500314. eCollection 2021.

Classification of Benign and Malignant Breast Masses on Mammograms for Large Datasets using Core Vector Machines.基于核向量机的大样本乳腺钼靶图像良恶性肿块分类

Curr Med Imaging. 2020;16(6):703-710. doi: 10.2174/1573405615666190801121506.

Support vector machine based diagnostic system for breast cancer using swarm intelligence.基于群智能的支持向量机乳腺癌诊断系统。

J Med Syst. 2012 Aug;36(4):2505-19. doi: 10.1007/s10916-011-9723-0. Epub 2011 May 3.

A new machine learning technique for an accurate diagnosis of coronary artery disease.一种用于准确诊断冠心病的新机器学习技术。

Comput Methods Programs Biomed. 2019 Oct;179:104992. doi: 10.1016/j.cmpb.2019.104992. Epub 2019 Jul 24.

Reviewing ensemble classification methods in breast cancer.综述乳腺癌中的集成分类方法。

Comput Methods Programs Biomed. 2019 Aug;177:89-112. doi: 10.1016/j.cmpb.2019.05.019. Epub 2019 May 20.

引用本文的文献

A Deep Learning and Explainable Artificial Intelligence based Scheme for Breast Cancer Detection.一种基于深度学习和可解释人工智能的乳腺癌检测方案。

Sci Rep. 2025 Sep 1;15(1):32125. doi: 10.1038/s41598-024-80535-7.

CSA-DE-LR: enhancing cardiovascular disease diagnosis with a novel hybrid machine learning approach.CSA-DE-LR：采用新型混合机器学习方法增强心血管疾病诊断

PeerJ Comput Sci. 2024 Jul 18;10:e2197. doi: 10.7717/peerj-cs.2197. eCollection 2024.

A voting-based machine learning approach for classifying biological and clinical datasets.基于投票的机器学习方法在生物和临床数据集分类中的应用。

BMC Bioinformatics. 2023 Apr 11;24(1):140. doi: 10.1186/s12859-023-05274-4.

Integrating Internet multisource big data to predict the occurrence and development of COVID-19 cryptic transmission.整合互联网多源大数据以预测新型冠状病毒肺炎隐匿传播的发生与发展。

NPJ Digit Med. 2022 Oct 28;5(1):161. doi: 10.1038/s41746-022-00704-8.

Surface and Structural Studies of Age-Related Changes in Dental Enamel: An Animal Model.牙釉质年龄相关变化的表面及结构研究：一种动物模型

Materials (Basel). 2022 Jun 3;15(11):3993. doi: 10.3390/ma15113993.

本文引用的文献

Clinical data classification using an enhanced SMOTE and chaotic evolutionary feature selection.使用增强型SMOTE和混沌进化特征选择的临床数据分类

Comput Biol Med. 2020 Nov;126:103991. doi: 10.1016/j.compbiomed.2020.103991. Epub 2020 Sep 18.

Computer-Aided Diagnosis system for diagnosis of pulmonary emphysema using bio-inspired algorithms.基于生物启发算法的肺气肿计算机辅助诊断系统。

Comput Biol Med. 2020 Sep;124:103940. doi: 10.1016/j.compbiomed.2020.103940. Epub 2020 Jul 31.

Comput Math Methods Med. 2019 Sep 23;2019:7398307. doi: 10.1155/2019/7398307. eCollection 2019.

Computer-assisted Medical Decision-making System for Diagnosis of Urticaria.用于荨麻疹诊断的计算机辅助医学决策系统

MDM Policy Pract. 2016 Nov 9;1(1):2381468316677752. doi: 10.1177/2381468316677752. eCollection 2016 Jul-Dec.

Feature selection using ant colony optimization with tandem-run recruitment to diagnose bronchitis from CT scan images.使用带有串联运行招募的蚁群优化进行特征选择，以从 CT 扫描图像中诊断支气管炎。

Comput Methods Programs Biomed. 2017 Jul;145:115-125. doi: 10.1016/j.cmpb.2017.04.009. Epub 2017 Apr 18.

A Temporal Mining Framework for Classifying Un-Evenly Spaced Clinical Data: An Approach for Building Effective Clinical Decision-Making System.一种用于对非均匀间隔临床数据进行分类的时态挖掘框架：构建有效临床决策系统的方法。

Appl Clin Inform. 2016 Jan 13;7(1):1-21. doi: 10.4338/ACI-2015-08-RA-0102. eCollection 2016.

A Q-backpropagated time delay neural network for diagnosing severity of gait disturbances in Parkinson's disease.一种用于诊断帕金森病步态障碍严重程度的Q反向传播时间延迟神经网络。

J Biomed Inform. 2016 Apr;60:169-76. doi: 10.1016/j.jbi.2016.01.014. Epub 2016 Feb 2.

A new cluster-based oversampling method for improving survival prediction of hepatocellular carcinoma patients.一种基于聚类的新过采样方法，用于改善肝细胞癌患者的生存预测。

J Biomed Inform. 2015 Dec;58:49-59. doi: 10.1016/j.jbi.2015.09.012. Epub 2015 Sep 28.

A Swarm Optimization approach for clinical knowledge mining.基于群集智能优化算法的临床知识挖掘方法

Comput Methods Programs Biomed. 2015 Oct;121(3):137-48. doi: 10.1016/j.cmpb.2015.05.007. Epub 2015 Jun 6.

Knowledge mining from clinical datasets using rough sets and backpropagation neural network.使用粗糙集和反向传播神经网络从临床数据集中进行知识挖掘。

Comput Math Methods Med. 2015;2015:460189. doi: 10.1155/2015/460189. Epub 2015 Mar 4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于生物启发算法和超级学习者的临床数据集特征选择与分类。

Feature Selection and Classification of Clinical Datasets Using Bioinspired Algorithms and Super Learner.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献