• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基因表达数据的改进二进制粒子群优化算法进行特征选择

Improved binary PSO for feature selection using gene expression data.

作者信息

Chuang Li-Yeh, Chang Hsueh-Wei, Tu Chung-Jui, Yang Cheng-Hong

机构信息

Department of Chemical Engineering, I-Shou University, Kaohsiung 840, Taiwan.

出版信息

Comput Biol Chem. 2008 Feb;32(1):29-37. doi: 10.1016/j.compbiolchem.2007.09.005. Epub 2007 Sep 25.

DOI:10.1016/j.compbiolchem.2007.09.005
PMID:18023261
Abstract

Gene expression profiles, which represent the state of a cell at a molecular level, have great potential as a medical diagnosis tool. Compared to the number of genes involved, available training data sets generally have a fairly small sample size in cancer type classification. These training data limitations constitute a challenge to certain classification methodologies. A reliable selection method for genes relevant for sample classification is needed in order to speed up the processing rate, decrease the predictive error rate, and to avoid incomprehensibility due to the large number of genes investigated. Improved binary particle swarm optimization (IBPSO) is used in this study to implement feature selection, and the K-nearest neighbor (K-NN) method serves as an evaluator of the IBPSO for gene expression data classification problems. Experimental results show that this method effectively simplifies feature selection and reduces the total number of features needed. The classification accuracy obtained by the proposed method has the highest classification accuracy in nine of the 11 gene expression data test problems, and is comparative to the classification accuracy of the two other test problems, as compared to the best results previously published.

摘要

基因表达谱代表了细胞在分子水平上的状态,作为一种医学诊断工具具有巨大潜力。与所涉及的基因数量相比,在癌症类型分类中,可用的训练数据集通常样本量相当小。这些训练数据的局限性对某些分类方法构成了挑战。为了加快处理速度、降低预测错误率并避免因研究的基因数量众多而导致的不可理解性,需要一种用于样本分类相关基因的可靠选择方法。本研究使用改进的二元粒子群优化算法(IBPSO)来进行特征选择,并且K近邻(K-NN)方法作为IBPSO用于基因表达数据分类问题的评估器。实验结果表明,该方法有效地简化了特征选择并减少了所需特征的总数。与之前发表的最佳结果相比,该方法在11个基因表达数据测试问题中的9个中获得了最高的分类准确率,并且与其他两个测试问题的分类准确率相当。

相似文献

1
Improved binary PSO for feature selection using gene expression data.使用基因表达数据的改进二进制粒子群优化算法进行特征选择
Comput Biol Chem. 2008 Feb;32(1):29-37. doi: 10.1016/j.compbiolchem.2007.09.005. Epub 2007 Sep 25.
2
A hybrid feature selection method for DNA microarray data.一种用于 DNA 微阵列数据的混合特征选择方法。
Comput Biol Med. 2011 Apr;41(4):228-37. doi: 10.1016/j.compbiomed.2011.02.004. Epub 2011 Mar 3.
3
Tabu search and binary particle swarm optimization for feature selection using microarray data.使用微阵列数据进行特征选择的禁忌搜索和二进制粒子群优化算法
J Comput Biol. 2009 Dec;16(12):1689-703. doi: 10.1089/cmb.2007.0211.
4
What should be expected from feature selection in small-sample settings.在小样本情况下,特征选择应达到什么预期效果。
Bioinformatics. 2006 Oct 1;22(19):2430-6. doi: 10.1093/bioinformatics/btl407. Epub 2006 Jul 26.
5
Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.
6
Optimal number of features as a function of sample size for various classification rules.针对各种分类规则,作为样本大小函数的最优特征数量。
Bioinformatics. 2005 Apr 15;21(8):1509-15. doi: 10.1093/bioinformatics/bti171. Epub 2004 Nov 30.
7
A novel feature selection approach for biomedical data classification.一种用于生物医学数据分类的新特征选择方法。
J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30.
8
Reporting bias when using real data sets to analyze classification performance.使用真实数据集分析分类性能时的报告偏倚。
Bioinformatics. 2010 Jan 1;26(1):68-76. doi: 10.1093/bioinformatics/btp605. Epub 2009 Oct 21.
9
Gene selection from microarray data for cancer classification--a machine learning approach.基于机器学习方法从微阵列数据中进行癌症分类的基因选择
Comput Biol Chem. 2005 Feb;29(1):37-46. doi: 10.1016/j.compbiolchem.2004.11.001.
10
Compact cancer biomarkers discovery using a swarm intelligence feature selection algorithm.利用群体智能特征选择算法发现紧凑型癌症生物标志物。
Comput Biol Chem. 2010 Aug;34(4):244-50. doi: 10.1016/j.compbiolchem.2010.08.003. Epub 2010 Sep 9.

引用本文的文献

1
A new feature selection approach with binary exponential henry gas solubility optimization and hybrid data transformation methods.一种采用二元指数亨利气体溶解度优化和混合数据变换方法的新特征选择方法。
MethodsX. 2024 May 20;12:102770. doi: 10.1016/j.mex.2024.102770. eCollection 2024 Jun.
2
A novel framework of MOPSO-GDM in recognition of Alzheimer's EEG-based functional network.一种基于脑电图识别阿尔茨海默病功能网络的多目标粒子群优化-广义判别模型新框架。
Front Aging Neurosci. 2023 Jun 29;15:1160534. doi: 10.3389/fnagi.2023.1160534. eCollection 2023.
3
Hierarchical Harris hawks optimizer for feature selection.
用于特征选择的分层哈里斯鹰优化器
J Adv Res. 2023 Nov;53:261-278. doi: 10.1016/j.jare.2023.01.014. Epub 2023 Jan 20.
4
Identification of Novel Biomarkers for Response to Preoperative Chemoradiation in Locally Advanced Rectal Cancer with Genetic Algorithm-Based Gene Selection.基于遗传算法的基因选择鉴定局部晚期直肠癌术前放化疗反应的新型生物标志物。
J Gastrointest Cancer. 2023 Sep;54(3):937-950. doi: 10.1007/s12029-022-00873-5. Epub 2022 Dec 19.
5
Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems.二进制矮狐优化器,用于解决高维特征选择问题。
PLoS One. 2022 Oct 6;17(10):e0274850. doi: 10.1371/journal.pone.0274850. eCollection 2022.
6
Improved Binary Grasshopper Optimization Algorithm for Feature Selection Problem.用于特征选择问题的改进二进制蚱蜢优化算法
Entropy (Basel). 2022 May 31;24(6):777. doi: 10.3390/e24060777.
7
Feature Selection in High Dimensional Biomedical Data Based on BF-SFLA.基于布谷鸟搜索-正弦余弦算法的高维生物医学数据特征选择
Front Neurosci. 2022 Apr 18;16:854685. doi: 10.3389/fnins.2022.854685. eCollection 2022.
8
A hybrid feature selection algorithm and its application in bioinformatics.一种混合特征选择算法及其在生物信息学中的应用。
PeerJ Comput Sci. 2022 Mar 22;8:e933. doi: 10.7717/peerj-cs.933. eCollection 2022.
9
Application of PSO-based LSTM Neural Network for Outpatient Volume Prediction.基于 PSO 的 LSTM 神经网络在门诊量预测中的应用。
J Healthc Eng. 2021 Nov 26;2021:7246561. doi: 10.1155/2021/7246561. eCollection 2021.
10
A graph-based gene selection method for medical diagnosis problems using a many-objective PSO algorithm.基于图的基因选择方法,用于使用多目标 PSO 算法解决医学诊断问题。
BMC Med Inform Decis Mak. 2021 Nov 27;21(1):333. doi: 10.1186/s12911-021-01696-3.