• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用微阵列数据集进行癌症分类的新基因选择算法。

A novel gene selection algorithm for cancer classification using microarray datasets.

机构信息

School of Information Technology, Deakin University, Burwood, 3125, VIC, Australia.

出版信息

BMC Med Genomics. 2019 Jan 15;12(1):10. doi: 10.1186/s12920-018-0447-6.

DOI:10.1186/s12920-018-0447-6
PMID:30646919
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6334429/
Abstract

BACKGROUND

Microarray datasets are an important medical diagnostic tool as they represent the states of a cell at the molecular level. Available microarray datasets for classifying cancer types generally have a fairly small sample size compared to the large number of genes involved. This fact is known as a curse of dimensionality, which is a challenging problem. Gene selection is a promising approach that addresses this problem and plays an important role in the development of efficient cancer classification due to the fact that only a small number of genes are related to the classification problem. Gene selection addresses many problems in microarray datasets such as reducing the number of irrelevant and noisy genes, and selecting the most related genes to improve the classification results.

METHODS

An innovative Gene Selection Programming (GSP) method is proposed to select relevant genes for effective and efficient cancer classification. GSP is based on Gene Expression Programming (GEP) method with a new defined population initialization algorithm, a new fitness function definition, and improved mutation and recombination operators. . Support Vector Machine (SVM) with a linear kernel serves as a classifier of the GSP.

RESULTS

Experimental results on ten microarray cancer datasets demonstrate that Gene Selection Programming (GSP) is effective and efficient in eliminating irrelevant and redundant genes/features from microarray datasets. The comprehensive evaluations and comparisons with other methods show that GSP gives a better compromise in terms of all three evaluation criteria, i.e., classification accuracy, number of selected genes, and computational cost. The gene set selected by GSP has shown its superior performances in cancer classification compared to those selected by the up-to-date representative gene selection methods.

CONCLUSION

Gene subset selected by GSP can achieve a higher classification accuracy with less processing time.

摘要

背景

微阵列数据集是一种重要的医学诊断工具,因为它们代表了细胞在分子水平上的状态。与涉及的大量基因相比,用于对癌症类型进行分类的可用微阵列数据集的样本量通常相当小。这一事实被称为维度诅咒,这是一个具有挑战性的问题。基因选择是一种有前途的方法,可以解决这个问题,并由于只有少数基因与分类问题相关,因此在开发有效的癌症分类方法中发挥着重要作用。基因选择解决了微阵列数据集中的许多问题,例如减少不相关和嘈杂的基因数量,并选择与分类问题最相关的基因,以提高分类结果。

方法

提出了一种创新的基因选择编程(GSP)方法,用于选择相关基因以进行有效和高效的癌症分类。GSP 基于基因表达编程(GEP)方法,具有新定义的种群初始化算法、新的适应度函数定义以及改进的突变和重组算子。支持向量机(SVM)具有线性核作为 GSP 的分类器。

结果

在十个微阵列癌症数据集上的实验结果表明,基因选择编程(GSP)在从微阵列数据集中消除不相关和冗余基因/特征方面是有效和高效的。与其他方法的综合评估和比较表明,GSP 在所有三个评估标准(即分类准确性、选择的基因数量和计算成本)方面都提供了更好的折衷。与最新的代表性基因选择方法相比,GSP 选择的基因集在癌症分类方面表现出了更好的性能。

结论

GSP 选择的基因子集可以用更少的处理时间实现更高的分类准确性。

相似文献

1
A novel gene selection algorithm for cancer classification using microarray datasets.一种使用微阵列数据集进行癌症分类的新基因选择算法。
BMC Med Genomics. 2019 Jan 15;12(1):10. doi: 10.1186/s12920-018-0447-6.
2
Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification.基于信息增益和支持向量机的混合方法在癌症分类基因选择中的应用
Genomics Proteomics Bioinformatics. 2017 Dec;15(6):389-395. doi: 10.1016/j.gpb.2017.08.002. Epub 2017 Dec 12.
3
Hybrid Feature Selection Algorithm mRMR-ICA for Cancer Classification from Microarray Gene Expression Data.用于从微阵列基因表达数据进行癌症分类的混合特征选择算法mRMR-ICA
Comb Chem High Throughput Screen. 2018;21(6):420-430. doi: 10.2174/1386207321666180601074349.
4
Improving accuracy for cancer classification with a new algorithm for genes selection.利用新的基因选择算法提高癌症分类的准确性。
BMC Bioinformatics. 2012 Nov 13;13:298. doi: 10.1186/1471-2105-13-298.
5
A centroid-based gene selection method for microarray data classification.一种基于质心的微阵列数据分类基因选择方法。
J Theor Biol. 2016 Jul 7;400:32-41. doi: 10.1016/j.jtbi.2016.03.034. Epub 2016 Apr 4.
6
Tuning parameter estimation in SCAD-support vector machine using firefly algorithm with application in gene selection and cancer classification.使用萤火虫算法调整 SCAD-支持向量机的调参,并将其应用于基因选择和癌症分类。
Comput Biol Med. 2018 Dec 1;103:262-268. doi: 10.1016/j.compbiomed.2018.10.034. Epub 2018 Oct 31.
7
Deep gene selection method to select genes from microarray datasets for cancer classification.深度基因选择方法,从微阵列数据集选择基因用于癌症分类。
BMC Bioinformatics. 2019 Nov 27;20(1):608. doi: 10.1186/s12859-019-3161-2.
8
Gene Correlation Guided Gene Selection for Microarray Data Classification.基于基因相关性的基因选择在基因芯片数据分析分类中的应用。
Biomed Res Int. 2021 Aug 14;2021:6490118. doi: 10.1155/2021/6490118. eCollection 2021.
9
Feature selection and tumor classification for microarray data using relaxed Lasso and generalized multi-class support vector machine.使用松弛 Lasso 和广义多类支持向量机进行微阵列数据分析的特征选择和肿瘤分类。
J Theor Biol. 2019 Feb 21;463:77-91. doi: 10.1016/j.jtbi.2018.12.010. Epub 2018 Dec 8.
10
Multiclass cancer classification by support vector machines with class-wise optimized genes and probability estimates.基于类别优化基因和概率估计的支持向量机进行多类别癌症分类
J Theor Biol. 2009 Aug 7;259(3):533-40. doi: 10.1016/j.jtbi.2009.04.013. Epub 2009 May 3.

引用本文的文献

1
Navigating the microarray landscape: a comprehensive review of feature selection techniques and their applications.探索微阵列领域:特征选择技术及其应用的全面综述
Front Big Data. 2025 Jul 10;8:1624507. doi: 10.3389/fdata.2025.1624507. eCollection 2025.
2
Evaluating the Nuclear Reaction Optimization (NRO) Algorithm for Gene Selection in Cancer Classification.评估用于癌症分类中基因选择的核反应优化(NRO)算法。
Diagnostics (Basel). 2025 Apr 3;15(7):927. doi: 10.3390/diagnostics15070927.
3
Prediction of in-hospital mortality risk for patients with acute ST-elevation myocardial infarction after primary PCI based on predictors selected by GRACE score and two feature selection methods.

本文引用的文献

1
Prediction of NSCLC recurrence from microarray data with GEP.利用基因表达谱(GEP)从微阵列数据预测非小细胞肺癌(NSCLC)复发
IET Syst Biol. 2017 Jun;11(3):77-85. doi: 10.1049/iet-syb.2016.0033.
2
Lung cancer prediction from microarray data by gene expression programming.通过基因表达编程从微阵列数据预测肺癌
IET Syst Biol. 2016 Oct;10(5):168-178. doi: 10.1049/iet-syb.2015.0082.
3
Comparison among dimensionality reduction techniques based on Random Projection for cancer classification.基于随机投影的降维技术在癌症分类中的比较。
基于GRACE评分及两种特征选择方法筛选出的预测因子对急性ST段抬高型心肌梗死患者直接经皮冠状动脉介入治疗后院内死亡风险的预测
Front Cardiovasc Med. 2024 Oct 22;11:1419551. doi: 10.3389/fcvm.2024.1419551. eCollection 2024.
4
Machine learning for pan-cancer classification based on RNA sequencing data.基于RNA测序数据的全癌种分类机器学习方法
Front Mol Biosci. 2023 Nov 10;10:1285795. doi: 10.3389/fmolb.2023.1285795. eCollection 2023.
5
FSF-GA: A Feature Selection Framework for Phenotype Prediction Using Genetic Algorithms.FSF-GA:一种使用遗传算法进行表型预测的特征选择框架。
Genes (Basel). 2023 May 9;14(5):1059. doi: 10.3390/genes14051059.
6
Identification of Potential Biomarkers for Group I Pulmonary Hypertension Based on Machine Learning and Bioinformatics Analysis.基于机器学习和生物信息学分析鉴定 I 型肺动脉高压的潜在生物标志物。
Int J Mol Sci. 2023 Apr 28;24(9):8050. doi: 10.3390/ijms24098050.
7
A Novel Hybrid Runge Kutta Optimizer with Support Vector Machine on Gene Expression Data for Cancer Classification.一种基于基因表达数据的新型混合龙格-库塔优化器与支持向量机用于癌症分类
Diagnostics (Basel). 2023 May 3;13(9):1621. doi: 10.3390/diagnostics13091621.
8
An Overview: Genetic Tumor Markers for Early Detection and Current Gene Therapy Strategies.综述:用于早期检测的基因肿瘤标志物及当前的基因治疗策略
Cancer Inform. 2023 Feb 1;22:11769351221150772. doi: 10.1177/11769351221150772. eCollection 2023.
9
An Enhanced Hyper-Parameter Optimization of a Convolutional Neural Network Model for Leukemia Cancer Diagnosis in a Smart Healthcare System.智能医疗系统中用于白血病癌症诊断的卷积神经网络模型的增强型超参数优化。
Sensors (Basel). 2022 Dec 10;22(24):9689. doi: 10.3390/s22249689.
10
Quantitative Detection of Gastrointestinal Tumor Markers Using a Machine Learning Algorithm and Multicolor Quantum Dot Biosensor.基于机器学习算法和多色量子点生物传感器的胃肠道肿瘤标志物定量检测。
Comput Intell Neurosci. 2022 Sep 1;2022:9022821. doi: 10.1155/2022/9022821. eCollection 2022.
Comput Biol Chem. 2016 Dec;65:165-172. doi: 10.1016/j.compbiolchem.2016.09.010. Epub 2016 Sep 21.
4
Gene selection for cancer classification with the help of bees.借助蜜蜂进行癌症分类的基因选择
BMC Med Genomics. 2016 Aug 10;9 Suppl 2(Suppl 2):47. doi: 10.1186/s12920-016-0204-7.
5
A Highly Efficient Gene Expression Programming (GEP) Model for Auxiliary Diagnosis of Small Cell Lung Cancer.一种用于小细胞肺癌辅助诊断的高效基因表达式编程(GEP)模型。
PLoS One. 2015 May 21;10(5):e0125517. doi: 10.1371/journal.pone.0125517. eCollection 2015.
6
mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.mRMR-ABC:一种利用微阵列基因表达谱进行癌症分类的混合基因选择算法。
Biomed Res Int. 2015;2015:604910. doi: 10.1155/2015/604910. Epub 2015 Apr 15.
7
Identifying transcriptional cis-regulatory modules in animal genomes.识别动物基因组中的转录顺式调控模块。
Wiley Interdiscip Rev Dev Biol. 2015 Mar-Apr;4(2):59-84. doi: 10.1002/wdev.168. Epub 2014 Dec 29.
8
Prediction of lung cancer based on serum biomarkers by gene expression programming methods.基于基因表达编程方法通过血清生物标志物预测肺癌
Asian Pac J Cancer Prev. 2014;15(21):9367-73. doi: 10.7314/apjcp.2014.15.21.9367.
9
Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm.基于粒子群优化算法的决策树模型在癌症识别中的基因选择。
BMC Bioinformatics. 2014 Feb 20;15:49. doi: 10.1186/1471-2105-15-49.
10
Application of gene expression programming and neural networks to predict adverse events of radical hysterectomy in cervical cancer patients.应用基因表达编程和神经网络预测宫颈癌根治性子宫切除术后的不良事件。
Med Biol Eng Comput. 2013 Dec;51(12):1357-65. doi: 10.1007/s11517-013-1108-8. Epub 2013 Oct 18.