• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于随机全局和局部最优搜索算法的癌症诊断子集生成。

Random Global and Local Optimal Search Algorithm Based Subset Generation for Diagnosis of Cancer.

机构信息

Department of Information Technology, Dr. Mahalingam College of Engineering and Technology, Pollachi, Tamilnadu, India.

出版信息

Curr Med Imaging. 2020;16(3):249-261. doi: 10.2174/1573405614666180720152838.

DOI:10.2174/1573405614666180720152838
PMID:32133955
Abstract

BACKGROUND

Data mining algorithms are extensively used to classify the data, in which prediction of disease using minimal computation time plays a vital role.

OBJECTIVES

The aim of this paper is to develop the classification model from reduced features and instances.

METHODS

In this paper we proposed four search algorithms for feature selection the first algorithm is Random Global Optimal (RGO) search algorithm for searching the continuous, global optimal subset of features from the random population. The second is Global and Local Optimal (GLO) search algorithm for searching the global and local optimal subset of features from population. The third one is Random Local Optimal (RLO) search algorithm for generating random, local optimal subset of features from the random population. Finally the Random Global and Optimal (RGLO) search algorithm for searching the continuous, global and local optimal subset of features from the random population. RGLO search algorithm combines the properties of first three stated algorithm. The subsets of features generated from the proposed four search algorithms are evaluated using the consistency based subset evaluation measure. Instance based learning algorithm is applied to the resulting feature dataset to reduce the instances that are redundant or irrelevant for classification. The model developed using naïve Bayesian classifier from the reduced features and instances is validated with the tenfold cross validation.

RESULTS

Classification accuracy based on RGLO search algorithm using naïve Bayesian classifier is 94.82% for Breast, 97.4% for DLBCL, 98.83% for SRBCT and 98.89% for Leukemia datasets.

CONCLUSION

The RGLO search based reduced features results in the high prediction rate with less computational time when compared with the complete dataset and other proposed subset generation algorithm.

摘要

背景

数据挖掘算法被广泛用于对数据进行分类,其中使用最小计算时间预测疾病起着至关重要的作用。

目的

本文的目的是从减少的特征和实例中开发分类模型。

方法

在本文中,我们提出了四种特征选择搜索算法,第一种算法是随机全局最优(RGO)搜索算法,用于从随机种群中搜索连续的、全局最优的特征子集。第二种是全局和局部最优(GLO)搜索算法,用于从种群中搜索全局和局部最优的特征子集。第三种是随机局部最优(RLO)搜索算法,用于从随机种群中生成随机的、局部最优的特征子集。最后是随机全局和最优(RGLO)搜索算法,用于从随机种群中搜索连续的、全局和局部最优的特征子集。RGLO 搜索算法结合了前三种算法的特性。使用基于一致性的子集评估度量对从提出的四种搜索算法生成的特征子集进行评估。实例基学习算法应用于生成的特征数据集,以减少对分类冗余或无关的实例。从减少的特征和实例中使用朴素贝叶斯分类器开发的模型通过十折交叉验证进行验证。

结果

基于 RGLO 搜索算法和朴素贝叶斯分类器的分类准确率分别为乳腺癌数据集 94.82%、DLBCL 数据集 97.4%、SRBCT 数据集 98.83%和白血病数据集 98.89%。

结论

与完整数据集和其他提出的子集生成算法相比,基于 RGLO 搜索的减少特征的结果具有较高的预测率和较少的计算时间。

相似文献

1
Random Global and Local Optimal Search Algorithm Based Subset Generation for Diagnosis of Cancer.基于随机全局和局部最优搜索算法的癌症诊断子集生成。
Curr Med Imaging. 2020;16(3):249-261. doi: 10.2174/1573405614666180720152838.
2
Classification of Medical Datasets Using SVMs with Hybrid Evolutionary Algorithms Based on Endocrine-Based Particle Swarm Optimization and Artificial Bee Colony Algorithms.基于基于内分泌粒子群优化和人工蜂群算法的混合进化算法的 SVM 对医疗数据集进行分类。
J Med Syst. 2015 Oct;39(10):306. doi: 10.1007/s10916-015-0306-3. Epub 2015 Aug 20.
3
An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset.一种用于多维数据集分类的高效特征子集选择算法。
ScientificWorldJournal. 2015;2015:821798. doi: 10.1155/2015/821798. Epub 2015 Sep 28.
4
A Tri-Stage Wrapper-Filter Feature Selection Framework for Disease Classification.三阶段包装器-过滤器特征选择框架用于疾病分类。
Sensors (Basel). 2021 Aug 18;21(16):5571. doi: 10.3390/s21165571.
5
Improved intelligent water drop-based hybrid feature selection method for microarray data processing.基于智能水滴的改进型混合特征选择方法在微阵列数据处理中的应用。
Comput Biol Chem. 2023 Apr;103:107809. doi: 10.1016/j.compbiolchem.2022.107809. Epub 2023 Jan 13.
6
An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.基于基因表达数据的多支持向量机技术的高效特征选择策略。
Biomed Res Int. 2018 Aug 30;2018:7538204. doi: 10.1155/2018/7538204. eCollection 2018.
7
An Innovative Excited-ACS-IDGWO Algorithm for Optimal Biomedical Data Feature Selection.一种创新的基于激发 ACS-IDGWO 算法的最优生物医学数据特征选择方法。
Biomed Res Int. 2020 Aug 17;2020:8506365. doi: 10.1155/2020/8506365. eCollection 2020.
8
Correlation-Based Ensemble Feature Selection Using Bioinspired Algorithms and Classification Using Backpropagation Neural Network.基于生物启发算法的相关性集成特征选择和反向传播神经网络分类。
Comput Math Methods Med. 2019 Sep 23;2019:7398307. doi: 10.1155/2019/7398307. eCollection 2019.
9
An improved binary particle swarm optimization algorithm for clinical cancer biomarker identification in microarray data.一种用于微阵列数据中临床癌症生物标志物识别的改进二元粒子群优化算法。
Comput Methods Programs Biomed. 2024 Feb;244:107987. doi: 10.1016/j.cmpb.2023.107987. Epub 2023 Dec 21.
10
Feature selection for elderly faller classification based on wearable sensors.基于可穿戴传感器的老年人跌倒者分类特征选择
J Neuroeng Rehabil. 2017 May 30;14(1):47. doi: 10.1186/s12984-017-0255-9.

引用本文的文献

1
Block Matching Pyramid Algorithm-Based Analysis on Efficacy of Shexiang Baoxin Pills Guided by Echocardiogram (ECG) on Patients with Angina Pectoris in Coronary Heart Disease.基于子波匹配金字塔算法的麝香保心丸治疗冠心病心绞痛患者的超声心动图(ECG)疗效分析。
J Healthc Eng. 2021 Aug 6;2021:3819900. doi: 10.1155/2021/3819900. eCollection 2021.