• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用基因表达数据进行癌症分类的集成特征选择算法

An Integrated Feature Selection Algorithm for Cancer Classification using Gene Expression Data.

作者信息

Ahmed Saeed, Kabir Muhammad, Ali Zakir, Arif Muhammad, Ali Farman, Yu Dong-Jun

机构信息

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

出版信息

Comb Chem High Throughput Screen. 2018;21(9):631-645. doi: 10.2174/1386207322666181220124756.

DOI:10.2174/1386207322666181220124756
PMID:30569852
Abstract

AIM AND OBJECTIVE

Cancer is a dangerous disease worldwide, caused by somatic mutations in the genome. Diagnosis of this deadly disease at an early stage is exceptionally new clinical application of microarray data. In DNA microarray technology, gene expression data have a high dimension with small sample size. Therefore, the development of efficient and robust feature selection methods is indispensable that identify a small set of genes to achieve better classification performance.

MATERIALS AND METHODS

In this study, we developed a hybrid feature selection method that integrates correlation-based feature selection (CFS) and Multi-Objective Evolutionary Algorithm (MOEA) approaches which select the highly informative genes. The hybrid model with Redial base function neural network (RBFNN) classifier has been evaluated on 11 benchmark gene expression datasets by employing a 10-fold cross-validation test.

RESULTS

The experimental results are compared with seven conventional-based feature selection and other methods in the literature, which shows that our approach owned the obvious merits in the aspect of classification accuracy ratio and some genes selected by extensive comparing with other methods.

CONCLUSION

Our proposed CFS-MOEA algorithm attained up to 100% classification accuracy for six out of eleven datasets with a minimal sized predictive gene subset.

摘要

目的

癌症是一种在全球范围内具有危险性的疾病,由基因组中的体细胞突变引起。在早期阶段诊断这种致命疾病是微阵列数据一项全新的临床应用。在DNA微阵列技术中,基因表达数据具有高维度和小样本量的特点。因此,开发高效且强大的特征选择方法不可或缺,这些方法能够识别一小部分基因以实现更好的分类性能。

材料与方法

在本研究中,我们开发了一种混合特征选择方法,该方法整合了基于相关性的特征选择(CFS)和多目标进化算法(MOEA)方法,用于选择信息丰富的基因。使用径向基函数神经网络(RBFNN)分类器的混合模型通过10折交叉验证测试在11个基准基因表达数据集上进行了评估。

结果

将实验结果与七种基于传统方法的特征选择方法以及文献中的其他方法进行了比较,结果表明我们的方法在分类准确率方面具有明显优势,并且通过与其他方法的广泛比较,我们选择了一些基因。

结论

我们提出的CFS-MOEA算法在11个数据集中的6个数据集上实现了高达100%的分类准确率,且预测基因子集规模最小。

相似文献

1
An Integrated Feature Selection Algorithm for Cancer Classification using Gene Expression Data.一种使用基因表达数据进行癌症分类的集成特征选择算法
Comb Chem High Throughput Screen. 2018;21(9):631-645. doi: 10.2174/1386207322666181220124756.
2
Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.混合遗传算法-神经网络:未预处理微阵列数据的特征提取。
Artif Intell Med. 2011 Sep;53(1):47-56. doi: 10.1016/j.artmed.2011.06.008. Epub 2011 Jul 19.
3
Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification.遗传蜂群(GBC)算法:一种用于微阵列癌症分类的新基因选择方法。
Comput Biol Chem. 2015 Jun;56:49-60. doi: 10.1016/j.compbiolchem.2015.03.001. Epub 2015 Mar 18.
4
Hybrid Feature Selection Algorithm mRMR-ICA for Cancer Classification from Microarray Gene Expression Data.用于从微阵列基因表达数据进行癌症分类的混合特征选择算法mRMR-ICA
Comb Chem High Throughput Screen. 2018;21(6):420-430. doi: 10.2174/1386207321666180601074349.
5
Gene selection for tumor classification using a novel bio-inspired multi-objective approach.基于新型生物启发式多目标方法的肿瘤分类基因选择。
Genomics. 2018 Jan;110(1):10-17. doi: 10.1016/j.ygeno.2017.07.010. Epub 2017 Aug 3.
6
C-HMOSHSSA: Gene selection for cancer classification using multi-objective meta-heuristic and machine learning methods.C-HMOSHSSA:使用多目标元启发式和机器学习方法进行癌症分类的基因选择。
Comput Methods Programs Biomed. 2019 Sep;178:219-235. doi: 10.1016/j.cmpb.2019.06.029. Epub 2019 Jun 29.
7
A hybrid feature selection method for DNA microarray data.一种用于 DNA 微阵列数据的混合特征选择方法。
Comput Biol Med. 2011 Apr;41(4):228-37. doi: 10.1016/j.compbiomed.2011.02.004. Epub 2011 Mar 3.
8
Feature Subset Selection for Cancer Classification Using Weight Local Modularity.基于权重局部模块度的癌症分类特征子集选择
Sci Rep. 2016 Oct 5;6:34759. doi: 10.1038/srep34759.
9
A hybrid gene selection algorithm based on interaction information for microarray-based cancer classification.基于互信息的混合基因选择算法在基于微阵列的癌症分类中的应用。
PLoS One. 2019 Feb 15;14(2):e0212333. doi: 10.1371/journal.pone.0212333. eCollection 2019.
10
Biomarker discovery based on BBHA and AdaboostM1 on microarray data for cancer classification.基于BBHA和AdaboostM1的微阵列数据用于癌症分类的生物标志物发现。
Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug;2016:3080-3083. doi: 10.1109/EMBC.2016.7591380.

引用本文的文献

1
A gene selection algorithm for microarray cancer classification using an improved particle swarm optimization.基于改进型粒子群算法的基因选择算法在微阵列癌症分类中的应用
Sci Rep. 2024 Aug 23;14(1):19613. doi: 10.1038/s41598-024-68744-6.
2
A Computational Predictor for Accurate Identification of Tumor Homing Peptides by Integrating Sequential and Deep BiLSTM Features.一种通过整合序列和深度 BiLSTM 特征来准确识别肿瘤归巢肽的计算预测器。
Interdiscip Sci. 2024 Jun;16(2):503-518. doi: 10.1007/s12539-024-00628-9. Epub 2024 May 11.
3
Refining breast cancer biomarker discovery and drug targeting through an advanced data-driven approach.
通过先进的数据驱动方法改进乳腺癌生物标志物的发现和药物靶向。
BMC Bioinformatics. 2024 Jan 22;25(1):33. doi: 10.1186/s12859-024-05657-1.
4
iMRSAPred: Improved Prediction of Anti-MRSA Peptides Using Physicochemical and Pairwise Contact-Energy Properties of Amino Acids.iMRSAPred:利用氨基酸的物理化学性质和成对接触能特性改进抗耐甲氧西林金黄色葡萄球菌肽的预测
ACS Omega. 2024 Jan 3;9(2):2874-2883. doi: 10.1021/acsomega.3c08303. eCollection 2024 Jan 16.
5
Prediction of antifreeze proteins using machine learning.使用机器学习预测抗冻蛋白。
Sci Rep. 2022 Nov 30;12(1):20672. doi: 10.1038/s41598-022-24501-1.
6
DBP-iDWT: Improving DNA-Binding Proteins Prediction Using Multi-Perspective Evolutionary Profile and Discrete Wavelet Transform.DBP-iDWT:利用多视角进化特征和离散小波变换提高 DNA 结合蛋白预测
Comput Intell Neurosci. 2022 Sep 28;2022:2987407. doi: 10.1155/2022/2987407. eCollection 2022.
7
DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information.DP-BINDER:一种通过融合进化和物理化学信息来预测 DNA 结合蛋白的机器学习模型。
J Comput Aided Mol Des. 2019 Jul;33(7):645-658. doi: 10.1007/s10822-019-00207-x. Epub 2019 May 23.