• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于基因选择问题的基于迭代贪婪算法的混合黑寡妇优化算法

Hybrid black widow optimization with iterated greedy algorithm for gene selection problems.

作者信息

Alweshah Mohammed, Aldabbas Yasmeen, Abu-Salih Bilal, Oqeil Saleh, Hasan Hazem S, Alkhalaileh Saleh, Kassaymeh Sofian

机构信息

Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan.

Department of Computer Science, King Abdullah II School of Information Technology, The University of Jordan, Amman, Jordan.

出版信息

Heliyon. 2023 Sep 14;9(9):e20133. doi: 10.1016/j.heliyon.2023.e20133. eCollection 2023 Sep.

DOI:10.1016/j.heliyon.2023.e20133
PMID:37809602
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10559925/
Abstract

Gene Selection (GS) is a strategy method targeted at reducing redundancy, limited expressiveness, and low informativeness in gene expression datasets obtained by DNA Microarray technology. These datasets contain a plethora of diverse and high-dimensional samples and genes, with a significant discrepancy in the number of samples and genes present. The complexities of GS are especially noticeable in the context of microarray expression data analysis, owing to the inherent data imbalance. The main goal of this study is to offer a simplified and computationally effective approach to dealing with the conundrum of attribute selection in microarray gene expression data. We use the Black Widow Optimization algorithm (BWO) in the context of GS to achieve this, using two unique methodologies: the unaltered BWO variation and the hybridized BWO variant combined with the Iterated Greedy algorithm (BWO-IG). By improving the local search capabilities of BWO, this hybridization attempts to promote more efficient gene selection. A series of tests was carried out using nine benchmark datasets that were obtained from the gene expression data repository in the pursuit of empirical validation. The results of these tests conclusively show that the BWO-IG technique performs better than the traditional BWO algorithm. Notably, the hybridized BWO-IG technique excels in the efficiency of local searches, making it easier to identify relevant genes and producing findings with higher levels of reliability in terms of accuracy and the degree of gene pruning. Additionally, a comparison analysis is done against five modern wrapper Feature Selection (FS) methodologies, namely BIMFOHHO, BMFO, BHHO, BCS, and BBA, in order to put the suggested BWO-IG method's effectiveness into context. The comparison that follows highlights BWO-IG's obvious superiority in reducing the number of selected genes while also obtaining remarkably high classification accuracy. The key findings were an average classification accuracy of 94.426, average fitness values of 0.061, and an average number of selected genes of 2933.767.

摘要

基因选择(GS)是一种策略方法,旨在减少通过DNA微阵列技术获得的基因表达数据集中的冗余、有限的表达能力和低信息性。这些数据集包含大量多样的高维样本和基因,样本和基因的数量存在显著差异。由于固有的数据不平衡,基因选择的复杂性在微阵列表达数据分析的背景下尤为明显。本研究的主要目标是提供一种简化且计算高效的方法来处理微阵列基因表达数据中的属性选择难题。我们在基因选择的背景下使用黑寡妇优化算法(BWO)来实现这一目标,采用两种独特的方法:未改变的BWO变体以及与迭代贪婪算法相结合的杂交BWO变体(BWO-IG)。通过提高BWO的局部搜索能力,这种杂交试图促进更有效的基因选择。为了进行实证验证,使用从基因表达数据库获得的九个基准数据集进行了一系列测试。这些测试结果最终表明,BWO-IG技术的性能优于传统的BWO算法。值得注意的是,杂交的BWO-IG技术在局部搜索效率方面表现出色,使其更容易识别相关基因,并在准确性和基因剪枝程度方面产生具有更高可靠性的结果。此外,针对五种现代包装特征选择(FS)方法,即BIMFOHHO、BMFO、BHHO、BCS和BBA,进行了比较分析,以便将所提出的BWO-IG方法的有效性置于背景中。随后的比较突出了BWO-IG在减少所选基因数量的同时还能获得极高分类准确率方面的明显优势。关键结果是平均分类准确率为94.426,平均适应度值为0.061,平均所选基因数量为2933.767。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/bcbdd1ba523f/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/ac6eaf884d6c/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/5a46c49508aa/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/360d9252ea07/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/6f5b35e3f544/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/2c8d839f7767/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/bcbdd1ba523f/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/ac6eaf884d6c/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/5a46c49508aa/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/360d9252ea07/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/6f5b35e3f544/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/2c8d839f7767/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5792/10559925/bcbdd1ba523f/gr6.jpg

相似文献

1
Hybrid black widow optimization with iterated greedy algorithm for gene selection problems.用于基因选择问题的基于迭代贪婪算法的混合黑寡妇优化算法
Heliyon. 2023 Sep 14;9(9):e20133. doi: 10.1016/j.heliyon.2023.e20133. eCollection 2023 Sep.
2
ST-AL: a hybridized search based metaheuristic computational algorithm towards optimization of high dimensional industrial datasets.ST-AL:一种基于混合搜索的元启发式计算算法,用于优化高维工业数据集。
Soft comput. 2022 May 9:1-29. doi: 10.1007/s00500-022-07115-7.
3
A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification.基于 ReliefF 和蚁群优化算法的混合基因选择方法在肿瘤分类中的应用。
Sci Rep. 2019 Jun 20;9(1):8978. doi: 10.1038/s41598-019-45223-x.
4
A new population initialization of metaheuristic algorithms based on hybrid fuzzy rough set for high-dimensional gene data feature selection.基于混合模糊粗糙集的启发式算法在高维基因数据特征选择中的一种新的种群初始化方法。
Comput Biol Med. 2023 Nov;166:107538. doi: 10.1016/j.compbiomed.2023.107538. Epub 2023 Oct 4.
5
MSBWO: A Multi-Strategies Improved Beluga Whale Optimization Algorithm for Feature Selection.MSBWO:一种用于特征选择的多策略改进白鲸优化算法
Biomimetics (Basel). 2024 Sep 22;9(9):572. doi: 10.3390/biomimetics9090572.
6
A novel bio-inspired hybrid multi-filter wrapper gene selection method with ensemble classifier for microarray data.一种用于微阵列数据的、基于集成分类器的新型生物启发式混合多滤波器包装基因选择方法。
Neural Comput Appl. 2023;35(16):11531-11561. doi: 10.1007/s00521-021-06459-9. Epub 2021 Sep 12.
7
Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis.用于医学诊断特征选择的具有贪婪交叉的冠状病毒群体免疫优化器
Knowl Based Syst. 2022 Jan 10;235:107629. doi: 10.1016/j.knosys.2021.107629. Epub 2021 Oct 29.
8
Binary Sand Cat Swarm Optimization Algorithm for Wrapper Feature Selection on Biological Data.用于生物数据包装器特征选择的二进制沙猫群优化算法
Biomimetics (Basel). 2023 Jul 14;8(3):310. doi: 10.3390/biomimetics8030310.
9
Parameter estimation of proton exchange membrane fuel cell using a novel meta-heuristic algorithm.使用新型启发式算法对质子交换膜燃料电池进行参数估计。
Environ Sci Pollut Res Int. 2021 Jul;28(26):34511-34526. doi: 10.1007/s11356-021-13097-0. Epub 2021 Mar 2.
10
Improved intelligent water drop-based hybrid feature selection method for microarray data processing.基于智能水滴的改进型混合特征选择方法在微阵列数据处理中的应用。
Comput Biol Chem. 2023 Apr;103:107809. doi: 10.1016/j.compbiolchem.2022.107809. Epub 2023 Jan 13.

引用本文的文献

1
Optimized models and deep learning methods for drug response prediction in cancer treatments: a review.癌症治疗中药物反应预测的优化模型和深度学习方法:综述
PeerJ Comput Sci. 2024 Mar 25;10:e1903. doi: 10.7717/peerj-cs.1903. eCollection 2024.

本文引用的文献

1
Machine Learning and Deep Learning Applications in Multiple Myeloma Diagnosis, Prognosis, and Treatment Selection.机器学习与深度学习在多发性骨髓瘤诊断、预后及治疗选择中的应用
Cancers (Basel). 2022 Jan 25;14(3):606. doi: 10.3390/cancers14030606.
2
Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis.用于医学诊断特征选择的具有贪婪交叉的冠状病毒群体免疫优化器
Knowl Based Syst. 2022 Jan 10;235:107629. doi: 10.1016/j.knosys.2021.107629. Epub 2021 Oct 29.
3
An Efficient hybrid filter-wrapper metaheuristic-based gene selection method for high dimensional datasets.
基于高效混合过滤-包装元启发式算法的高维数据集基因选择方法。
Sci Rep. 2019 Dec 9;9(1):18580. doi: 10.1038/s41598-019-54987-1.
4
Co-ABC: Correlation artificial bee colony algorithm for biomarker gene discovery using gene expression profile.协同人工蜂群算法(Co-ABC):利用基因表达谱发现生物标志物基因的相关人工蜂群算法
Saudi J Biol Sci. 2018 Jul;25(5):895-903. doi: 10.1016/j.sjbs.2017.12.012. Epub 2018 Jan 3.
5
Multiobjective binary biogeography based optimization for feature selection using gene expression data.基于多目标二进制生物地理学优化的基因表达数据特征选择。
IEEE Trans Nanobioscience. 2013 Dec;12(4):343-53. doi: 10.1109/tnb.2013.2294716.
6
Sibling cannibalism in a web-building spider: effects of density and shared environment.结网蜘蛛中的同胞相食现象:密度和共享环境的影响
Behav Processes. 2014 Jul;106:12-6. doi: 10.1016/j.beproc.2014.03.011. Epub 2014 Apr 12.
7
Gene selection from microarray data for cancer classification--a machine learning approach.基于机器学习方法从微阵列数据中进行癌症分类的基因选择
Comput Biol Chem. 2005 Feb;29(1):37-46. doi: 10.1016/j.compbiolchem.2004.11.001.
8
Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.癌症的分子分类:通过基因表达监测进行类别发现和类别预测。
Science. 1999 Oct 15;286(5439):531-7. doi: 10.1126/science.286.5439.531.