• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图的基因选择方法,用于使用多目标 PSO 算法解决医学诊断问题。

A graph-based gene selection method for medical diagnosis problems using a many-objective PSO algorithm.

机构信息

Faculty of Computer Engineering, K. N. Toosi University of Technology, Tehran, Iran.

出版信息

BMC Med Inform Decis Mak. 2021 Nov 27;21(1):333. doi: 10.1186/s12911-021-01696-3.

DOI:10.1186/s12911-021-01696-3
PMID:34838034
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8627636/
Abstract

BACKGROUND

Gene expression data play an important role in bioinformatics applications. Although there may be a large number of features in such data, they mainly tend to contain only a few samples. This can negatively impact the performance of data mining and machine learning algorithms. One of the most effective approaches to alleviate this problem is to use gene selection methods. The aim of gene selection is to reduce the dimensions (features) of gene expression data leading to eliminating irrelevant and redundant genes.

METHODS

This paper presents a hybrid gene selection method based on graph theory and a many-objective particle swarm optimization (PSO) algorithm. To this end, a filter method is first utilized to reduce the initial space of the genes. Then, the gene space is represented as a graph to apply a graph clustering method to group the genes into several clusters. Moreover, the many-objective PSO algorithm is utilized to search an optimal subset of genes according to several criteria, which include classification error, node centrality, specificity, edge centrality, and the number of selected genes. A repair operator is proposed to cover the whole space of the genes and ensure that at least one gene is selected from each cluster. This leads to an increasement in the diversity of the selected genes.

RESULTS

To evaluate the performance of the proposed method, extensive experiments are conducted based on seven datasets and two evaluation measures. In addition, three classifiers-Decision Tree (DT), Support Vector Machine (SVM), and K-Nearest Neighbors (KNN)-are utilized to compare the effectiveness of the proposed gene selection method with other state-of-the-art methods. The results of these experiments demonstrate that our proposed method not only achieves more accurate classification, but also selects fewer genes than other methods.

CONCLUSION

This study shows that the proposed multi-objective PSO algorithm simultaneously removes irrelevant and redundant features using several different criteria. Also, the use of the clustering algorithm and the repair operator has improved the performance of the proposed method by covering the whole space of the problem.

摘要

背景

基因表达数据在生物信息学应用中起着重要作用。尽管这些数据中可能有大量的特征,但它们主要倾向于只包含少数样本。这会对数据挖掘和机器学习算法的性能产生负面影响。缓解这个问题的最有效方法之一是使用基因选择方法。基因选择的目的是减少基因表达数据的维度(特征),从而消除不相关和冗余的基因。

方法

本文提出了一种基于图论和多目标粒子群优化(PSO)算法的混合基因选择方法。为此,首先利用过滤方法来减少基因的初始空间。然后,将基因空间表示为一个图,应用图聚类方法将基因分为几个簇。此外,利用多目标 PSO 算法根据分类错误、节点中心度、特异性、边中心度和选择的基因数量等多个标准搜索最优的基因子集。提出了一个修复算子来覆盖基因的整个空间,并确保从每个簇中至少选择一个基因。这导致所选基因的多样性增加。

结果

为了评估所提出方法的性能,基于七个数据集和两个评估指标进行了广泛的实验。此外,利用三个分类器——决策树(DT)、支持向量机(SVM)和 K-最近邻(KNN)——将所提出的基因选择方法与其他最先进的方法进行了有效性比较。这些实验的结果表明,所提出的方法不仅实现了更准确的分类,而且选择的基因比其他方法更少。

结论

本研究表明,所提出的多目标 PSO 算法使用多个不同的标准同时去除不相关和冗余的特征。此外,聚类算法和修复算子的使用通过覆盖问题的整个空间,提高了所提出方法的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/779b/8627636/8b6ab29f47f5/12911_2021_1696_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/779b/8627636/8239a694fc5e/12911_2021_1696_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/779b/8627636/2dfccabb506b/12911_2021_1696_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/779b/8627636/638a8c87f68a/12911_2021_1696_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/779b/8627636/8b6ab29f47f5/12911_2021_1696_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/779b/8627636/8239a694fc5e/12911_2021_1696_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/779b/8627636/2dfccabb506b/12911_2021_1696_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/779b/8627636/638a8c87f68a/12911_2021_1696_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/779b/8627636/8b6ab29f47f5/12911_2021_1696_Fig4_HTML.jpg

相似文献

1
A graph-based gene selection method for medical diagnosis problems using a many-objective PSO algorithm.基于图的基因选择方法,用于使用多目标 PSO 算法解决医学诊断问题。
BMC Med Inform Decis Mak. 2021 Nov 27;21(1):333. doi: 10.1186/s12911-021-01696-3.
2
A comparative analysis of feature selection models for spatial analysis of floods using hybrid metaheuristic and machine learning models.使用混合元启发式算法和机器学习模型进行洪水空间分析的特征选择模型的比较分析
Environ Sci Pollut Res Int. 2024 May;31(23):33495-33514. doi: 10.1007/s11356-024-33389-5. Epub 2024 Apr 29.
3
A Tri-Stage Wrapper-Filter Feature Selection Framework for Disease Classification.三阶段包装器-过滤器特征选择框架用于疾病分类。
Sensors (Basel). 2021 Aug 18;21(16):5571. doi: 10.3390/s21165571.
4
Hybrid Feature-Learning-Based PSO-PCA Feature Engineering Approach for Blood Cancer Classification.基于混合特征学习的粒子群优化-主成分分析特征工程方法用于血癌分类
Diagnostics (Basel). 2023 Aug 14;13(16):2672. doi: 10.3390/diagnostics13162672.
5
An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.基于基因表达数据的多支持向量机技术的高效特征选择策略。
Biomed Res Int. 2018 Aug 30;2018:7538204. doi: 10.1155/2018/7538204. eCollection 2018.
6
Seminal quality prediction using data mining methods.使用数据挖掘方法进行精液质量预测。
Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.
7
Classification of Medical Datasets Using SVMs with Hybrid Evolutionary Algorithms Based on Endocrine-Based Particle Swarm Optimization and Artificial Bee Colony Algorithms.基于基于内分泌粒子群优化和人工蜂群算法的混合进化算法的 SVM 对医疗数据集进行分类。
J Med Syst. 2015 Oct;39(10):306. doi: 10.1007/s10916-015-0306-3. Epub 2015 Aug 20.
8
Support vector machine based diagnostic system for breast cancer using swarm intelligence.基于群智能的支持向量机乳腺癌诊断系统。
J Med Syst. 2012 Aug;36(4):2505-19. doi: 10.1007/s10916-011-9723-0. Epub 2011 May 3.
9
Integration of multi-objective PSO based feature selection and node centrality for medical datasets.基于多目标 PSO 的特征选择和节点中心性在医学数据集上的集成。
Genomics. 2020 Nov;112(6):4370-4384. doi: 10.1016/j.ygeno.2020.07.027. Epub 2020 Jul 25.
10
A gene selection algorithm for microarray cancer classification using an improved particle swarm optimization.基于改进型粒子群算法的基因选择算法在微阵列癌症分类中的应用
Sci Rep. 2024 Aug 23;14(1):19613. doi: 10.1038/s41598-024-68744-6.

引用本文的文献

1
Gene selection based on adaptive neighborhood-preserving multi-objective particle swarm optimization.基于自适应邻域保持多目标粒子群优化的基因选择
PeerJ Comput Sci. 2025 May 28;11:e2872. doi: 10.7717/peerj-cs.2872. eCollection 2025.

本文引用的文献

1
Integration of multi-objective PSO based feature selection and node centrality for medical datasets.基于多目标 PSO 的特征选择和节点中心性在医学数据集上的集成。
Genomics. 2020 Nov;112(6):4370-4384. doi: 10.1016/j.ygeno.2020.07.027. Epub 2020 Jul 25.
2
A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification.基于 ReliefF 和蚁群优化算法的混合基因选择方法在肿瘤分类中的应用。
Sci Rep. 2019 Jun 20;9(1):8978. doi: 10.1038/s41598-019-45223-x.
3
A hybrid gene selection method based on gene scoring strategy and improved particle swarm optimization.
基于基因评分策略和改进粒子群优化的混合基因选择方法。
BMC Bioinformatics. 2019 Jun 10;20(Suppl 8):289. doi: 10.1186/s12859-019-2773-x.
4
Gene selection using hybrid binary black hole algorithm and modified binary particle swarm optimization.使用混合二进制黑洞算法和改进二进制粒子群优化的基因选择。
Genomics. 2019 Jul;111(4):669-686. doi: 10.1016/j.ygeno.2018.04.004. Epub 2018 Apr 14.
5
A New Representation in PSO for Discretization-Based Feature Selection.PSO 中基于离散化的特征选择的新表示。
IEEE Trans Cybern. 2018 Jun;48(6):1733-1746. doi: 10.1109/TCYB.2017.2714145. Epub 2017 Jun 23.
6
A Gene Selection Method for Microarray Data Based on Binary PSO Encoding Gene-to-Class Sensitivity Information.一种基于二进制粒子群优化编码基因到类敏感性信息的微阵列数据基因选择方法。
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):85-96. doi: 10.1109/TCBB.2015.2465906.
7
Cancer Feature Selection and Classification Using a Binary Quantum-Behaved Particle Swarm Optimization and Support Vector Machine.基于二进制量子行为粒子群优化算法和支持向量机的癌症特征选择与分类
Comput Math Methods Med. 2016;2016:3572705. doi: 10.1155/2016/3572705. Epub 2016 Aug 24.
8
Gene selection for cancer classification with the help of bees.借助蜜蜂进行癌症分类的基因选择
BMC Med Genomics. 2016 Aug 10;9 Suppl 2(Suppl 2):47. doi: 10.1186/s12920-016-0204-7.
9
A hybrid gene selection approach for microarray data classification using cellular learning automata and ant colony optimization.一种使用细胞学习自动机和蚁群优化的用于微阵列数据分类的混合基因选择方法。
Genomics. 2016 Jun;107(6):231-8. doi: 10.1016/j.ygeno.2016.05.001. Epub 2016 May 3.
10
Biomarker Discovery Based on Hybrid Optimization Algorithm and Artificial Neural Networks on Microarray Data for Cancer Classification.基于混合优化算法和人工神经网络的微阵列数据生物标志物发现用于癌症分类
J Med Signals Sens. 2015 Apr-Jun;5(2):88-96.