基于互信息和遗传算法的两阶段混合基因选择在癌症数据分类中的应用。

Two-Stage Hybrid Gene Selection Using Mutual Information and Genetic Algorithm for Cancer Data Classification.

机构信息

School of Computing, Kalasalingam Academy of Research and Education, Krishnankoil, Virudhunagar, India.

School of Electronics & Electrical Technology, Kalasalingam Academy of Research and Education, Krishnankoil, Virudhunagar, India.

出版信息

J Med Syst. 2019 Jun 17;43(8):235. doi: 10.1007/s10916-019-1372-8.

DOI:10.1007/s10916-019-1372-8

PMID:31209677

Abstract

Cancer is a deadly disease which requires a very complex and costly treatment. Microarray data classification plays an important role in cancer treatment. An efficient gene selection technique to select the more promising genes is necessary for cancer classification. Here, we propose a Two-stage MI-GA Gene Selection algorithm for selecting informative genes in cancer data classification. In the first stage, Mutual Information based gene selection is applied which selects only the genes that have high information related to the cancer. The genes which have high mutual information value are given as input to the second stage. The Genetic Algorithm based gene selection is applied in the second stage to identify and select the optimal set of genes required for accurate classification. For classification, Support Vector Machine (SVM) is used. The proposed MI-GA gene selection approach is applied to Colon, Lung and Ovarian cancer datasets and the results show that the proposed gene selection approach results in higher classification accuracy compared to the existing methods.

摘要

癌症是一种致命的疾病，需要非常复杂和昂贵的治疗。微阵列数据分析分类在癌症治疗中起着重要作用。为了进行癌症分类，有必要选择一种有效的基因选择技术来选择更有前途的基因。在这里，我们提出了一种两阶段 MI-GA 基因选择算法，用于选择癌症数据分类中的信息基因。在第一阶段，应用基于互信息的基因选择，仅选择与癌症高度相关的基因。将具有高互信息值的基因作为输入提供给第二阶段。在第二阶段应用基于遗传算法的基因选择，以识别和选择准确分类所需的最佳基因集。对于分类，使用支持向量机（SVM）。将提出的 MI-GA 基因选择方法应用于结肠、肺和卵巢癌数据集，结果表明，与现有方法相比，所提出的基因选择方法可实现更高的分类准确性。

相似文献

Two-Stage Hybrid Gene Selection Using Mutual Information and Genetic Algorithm for Cancer Data Classification.基于互信息和遗传算法的两阶段混合基因选择在癌症数据分类中的应用。

J Med Syst. 2019 Jun 17;43(8):235. doi: 10.1007/s10916-019-1372-8.

mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.mRMR-ABC：一种利用微阵列基因表达谱进行癌症分类的混合基因选择算法。

Biomed Res Int. 2015;2015:604910. doi: 10.1155/2015/604910. Epub 2015 Apr 15.

Hybrid Feature Selection Algorithm mRMR-ICA for Cancer Classification from Microarray Gene Expression Data.用于从微阵列基因表达数据进行癌症分类的混合特征选择算法mRMR-ICA

Comb Chem High Throughput Screen. 2018;21(6):420-430. doi: 10.2174/1386207321666180601074349.

A novel gene selection algorithm for cancer classification using microarray datasets.一种使用微阵列数据集进行癌症分类的新基因选择算法。

BMC Med Genomics. 2019 Jan 15;12(1):10. doi: 10.1186/s12920-018-0447-6.

Development of a two-stage gene selection method that incorporates a novel hybrid approach using the cuckoo optimization algorithm and harmony search for cancer classification.一种两阶段基因选择方法的开发，该方法结合了一种使用布谷鸟优化算法和和声搜索的新型混合方法用于癌症分类。

J Biomed Inform. 2017 Mar;67:11-20. doi: 10.1016/j.jbi.2017.01.016. Epub 2017 Feb 3.

Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification.基于信息增益和支持向量机的混合方法在癌症分类基因选择中的应用

Genomics Proteomics Bioinformatics. 2017 Dec;15(6):389-395. doi: 10.1016/j.gpb.2017.08.002. Epub 2017 Dec 12.

An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.基于基因表达数据的多支持向量机技术的高效特征选择策略。

Biomed Res Int. 2018 Aug 30;2018:7538204. doi: 10.1155/2018/7538204. eCollection 2018.

Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification.遗传蜂群（GBC）算法：一种用于微阵列癌症分类的新基因选择方法。

Comput Biol Chem. 2015 Jun;56:49-60. doi: 10.1016/j.compbiolchem.2015.03.001. Epub 2015 Mar 18.

An efficient statistical feature selection approach for classification of gene expression data.一种用于基因表达数据分类的高效统计特征选择方法。

J Biomed Inform. 2011 Aug;44(4):529-35. doi: 10.1016/j.jbi.2011.01.001. Epub 2011 Jan 15.

C-HMOSHSSA: Gene selection for cancer classification using multi-objective meta-heuristic and machine learning methods.C-HMOSHSSA：使用多目标元启发式和机器学习方法进行癌症分类的基因选择。

Comput Methods Programs Biomed. 2019 Sep;178:219-235. doi: 10.1016/j.cmpb.2019.06.029. Epub 2019 Jun 29.

引用本文的文献

Identifying candidate biomarkers for detecting bronchogenic carcinoma stages using metaheuristic algorithms based on information fusion theory.基于信息融合理论，利用元启发式算法识别用于检测支气管源性癌分期的候选生物标志物。

Discov Oncol. 2025 Apr 29;16(1):632. doi: 10.1007/s12672-025-02395-5.

Transforming Cancer Classification: The Role of Advanced Gene Selection.转变癌症分类：先进基因选择的作用。

Diagnostics (Basel). 2024 Nov 22;14(23):2632. doi: 10.3390/diagnostics14232632.

The relationship between low-carbohydrate diet score, dietary macronutrient intake, and rheumatoid arthritis: results from NHANES 2011-2016.低碳水化合物饮食评分、膳食常量营养素摄入量与类风湿关节炎之间的关系：2011 - 2016年美国国家健康与营养检查调查结果

Clin Rheumatol. 2025 Jan;44(1):171-182. doi: 10.1007/s10067-024-07269-9. Epub 2024 Dec 16.

A novel and innovative cancer classification framework through a consecutive utilization of hybrid feature selection.一种新颖且具有创新性的癌症分类框架，通过连续利用混合特征选择实现。

BMC Bioinformatics. 2023 Dec 15;24(1):479. doi: 10.1186/s12859-023-05605-5.

Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis.狼疮性肾炎还是没有？一个简单且临床友好的机器学习管道，帮助诊断狼疮性肾炎。

Inflamm Res. 2023 Jun;72(6):1315-1324. doi: 10.1007/s00011-023-01755-7. Epub 2023 Jun 10.

A Highly Discriminative Hybrid Feature Selection Algorithm for Cancer Diagnosis.一种用于癌症诊断的高判别混合特征选择算法。

ScientificWorldJournal. 2022 Aug 9;2022:1056490. doi: 10.1155/2022/1056490. eCollection 2022.

Lung Cancer Stage Prediction Using Multi-Omics Data.基于多组学数据的肺癌分期预测。

Comput Math Methods Med. 2022 Jul 16;2022:2279044. doi: 10.1155/2022/2279044. eCollection 2022.

Cancer Detection and Prediction Using Genetic Algorithms.使用遗传算法进行癌症检测和预测。

Comput Intell Neurosci. 2022 May 16;2022:1871841. doi: 10.1155/2022/1871841. eCollection 2022.

Optimal Deep Learning Enabled Prostate Cancer Detection Using Microarray Gene Expression.基于基因表达微阵列的最优深度学习前列腺癌检测。

J Healthc Eng. 2022 Mar 10;2022:7364704. doi: 10.1155/2022/7364704. eCollection 2022.

Identification of Diagnostic Biomarkers Associated with Stromal and Immune Cell Infiltration in Fatty Infiltration After Rotator Cuff Tear by Integrating Bioinformatic Analysis and Machine-Learning.通过整合生物信息学分析和机器学习鉴定与肩袖撕裂后脂肪浸润中基质和免疫细胞浸润相关的诊断生物标志物

Int J Gen Med. 2022 Feb 19;15:1805-1819. doi: 10.2147/IJGM.S354741. eCollection 2022.

本文引用的文献

Prediction of lung cancer patient survival via supervised machine learning classification techniques.通过监督机器学习分类技术预测肺癌患者的生存情况。

Int J Med Inform. 2017 Dec;108:1-8. doi: 10.1016/j.ijmedinf.2017.09.013. Epub 2017 Sep 25.

Incorporating gene ontology into fuzzy relational clustering of microarray gene expression data.将基因本体论纳入微阵列基因表达数据的模糊关系聚类中。

Biosystems. 2018 Jan;163:1-10. doi: 10.1016/j.biosystems.2017.09.017. Epub 2017 Nov 4.

A Sequential Learning Approach for Scaling Up Filter-Based Feature Subset Selection.基于序贯学习的过滤式特征子集选择方法的扩展。

IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2530-2544. doi: 10.1109/TNNLS.2017.2697407. Epub 2017 May 11.

A feature selection method based on multiple kernel learning with expression profiles of different types.一种基于多内核学习和不同类型表达谱的特征选择方法。

BioData Min. 2017 Feb 2;10:4. doi: 10.1186/s13040-017-0124-x. eCollection 2017.

A Gene Selection Method for Microarray Data Based on Binary PSO Encoding Gene-to-Class Sensitivity Information.一种基于二进制粒子群优化编码基因到类敏感性信息的微阵列数据基因选择方法。

IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):85-96. doi: 10.1109/TCBB.2015.2465906.

Gene selection for microarray cancer classification using a new evolutionary method employing artificial intelligence concepts.使用一种采用人工智能概念的新进化方法进行微阵列癌症分类的基因选择。

Genomics. 2017 Mar;109(2):91-107. doi: 10.1016/j.ygeno.2017.01.004. Epub 2017 Feb 1.

Principal component analysis based unsupervised feature extraction applied to budding yeast temporally periodic gene expression.基于主成分分析的无监督特征提取应用于出芽酵母的时间周期性基因表达。

BioData Min. 2016 Jun 29;9:22. doi: 10.1186/s13040-016-0101-9. eCollection 2016.

Gene selection approach based on improved swarm intelligent optimisation algorithm for tumour classification.基于改进群体智能优化算法的肿瘤分类基因选择方法

IET Syst Biol. 2016 Jun;10(3):107-15. doi: 10.1049/iet-syb.2015.0064.

Detecting gene-gene interactions using a permutation-based random forest method.使用基于排列的随机森林方法检测基因-基因相互作用。

BioData Min. 2016 Apr 6;9:14. doi: 10.1186/s13040-016-0093-5. eCollection 2016.

Regularized logistic regression with adjusted adaptive elastic net for gene selection in high dimensional cancer classification.用于高维癌症分类中基因选择的带调整自适应弹性网络的正则化逻辑回归

Comput Biol Med. 2015 Dec 1;67:136-45. doi: 10.1016/j.compbiomed.2015.10.008. Epub 2015 Oct 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于互信息和遗传算法的两阶段混合基因选择在癌症数据分类中的应用。

Two-Stage Hybrid Gene Selection Using Mutual Information and Genetic Algorithm for Cancer Data Classification.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献