基于多类BCGA-ELM的分类器，可识别与癌症特征相关的生物标志物。

Multi-class BCGA-ELM based classifier that identifies biomarkers associated with hallmarks of cancer.

作者信息

Sachnev Vasily, Saraswathi Saras, Niaz Rashid, Kloczkowski Andrzej, Suresh Sundaram

机构信息

Department of Information, Communication and Electronics Engineering, Catholic University of Korea, Bucheon, Republic of Korea.

Battelle Center for Mathematical Medicine at The Research Institute at Nationwide Children's Hospital; currently at Sidra, Medical and Research Center, Doha, Qatar.

出版信息

BMC Bioinformatics. 2015 May 20;16:166. doi: 10.1186/s12859-015-0565-5.

DOI:10.1186/s12859-015-0565-5

PMID:25986937

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4448565/

Abstract

BACKGROUND

Traditional cancer treatments have centered on cytotoxic drugs and general purpose chemotherapy that may not be tailored to treat specific cancers. Identification of molecular markers that are related to different types of cancers might lead to discovery of drugs that are patient and disease specific. This study aims to use microarray gene expression cancer data to identify biomarkers that are indicative of different types of cancers. Our aim is to provide a multi-class cancer classifier that can simultaneously differentiate between cancers and identify type-specific biomarkers, through the application of the Binary Coded Genetic Algorithm (BCGA) and a neural network based Extreme Learning Machine (ELM) algorithm.

RESULTS

BCGA and ELM are combined and used to select a subset of genes that are present in the Global Cancer Mapping (GCM) data set. This set of candidate genes contains over 52 biomarkers that are related to multiple cancers, according to the literature. They include APOA1, VEGFC, YWHAZ, B2M, EIF2S1, CCR9 and many other genes that have been associated with the hallmarks of cancer. BCGA-ELM is tested on several cancer data sets and the results are compared to other classification methods. BCGA-ELM compares or exceeds other algorithms in terms of accuracy. We were also able to show that over 50% of genes selected by BCGA-ELM on GCM data are cancer related biomarkers.

CONCLUSIONS

We were able to simultaneously differentiate between 14 different types of cancers, using only 92 genes, to achieve a multi-class classification accuracy of 95.4% which is between 21.6% and 38% higher than other results in the literature for multi-class cancer classification. Our findings suggest that computational algorithms such as BCGA-ELM can facilitate biomarker-driven integrated cancer research that can lead to a detailed understanding of the complexities of cancer.

摘要

背景

传统的癌症治疗主要集中在细胞毒性药物和通用化疗上，这些治疗可能无法针对特定癌症进行定制。识别与不同类型癌症相关的分子标记物可能会促使发现针对患者和疾病的特异性药物。本研究旨在利用微阵列基因表达癌症数据来识别指示不同类型癌症的生物标志物。我们的目标是通过应用二进制编码遗传算法（BCGA）和基于神经网络的极限学习机（ELM）算法，提供一种多类癌症分类器，该分类器能够同时区分不同癌症并识别类型特异性生物标志物。

结果

将BCGA和ELM相结合，用于选择全球癌症图谱（GCM）数据集中存在的基因子集。根据文献，这组候选基因包含52种以上与多种癌症相关的生物标志物。它们包括载脂蛋白A1（APOA1）、血管内皮生长因子C（VEGFC）、14-3-3ζ蛋白（YWHAZ）、β2微球蛋白（B2M）、真核翻译起始因子2亚基1（EIF2S1）、趋化因子受体9（CCR9）以及许多其他与癌症特征相关的基因。在多个癌症数据集上对BCGA-ELM进行了测试，并将结果与其他分类方法进行了比较。在准确性方面，BCGA-ELM与其他算法相当或更优。我们还能够证明，BCGA-ELM在GCM数据上选择的基因中，超过50%是与癌症相关的生物标志物。

结论

我们仅使用92个基因就能同时区分14种不同类型的癌症，实现了95.4%的多类分类准确率，比文献中多类癌症分类的其他结果高21.6%至38%。我们的研究结果表明，诸如BCGA-ELM之类的计算算法可以促进生物标志物驱动的综合癌症研究，从而有助于深入了解癌症的复杂性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ea/4448565/a6d810f357a7/12859_2015_565_Fig1_HTML.jpg

相似文献

Multi-class BCGA-ELM based classifier that identifies biomarkers associated with hallmarks of cancer.基于多类BCGA-ELM的分类器，可识别与癌症特征相关的生物标志物。

BMC Bioinformatics. 2015 May 20;16:166. doi: 10.1186/s12859-015-0565-5.

Multi-category classification using an Extreme Learning Machine for microarray gene expression cancer diagnosis.使用极限学习机进行多类别分类以诊断微阵列基因表达癌症

IEEE/ACM Trans Comput Biol Bioinform. 2007 Jul-Sep;4(3):485-495. doi: 10.1109/tcbb.2007.1012.

ICGA-PSO-ELM approach for accurate multiclass cancer classification resulting in reduced gene sets in which genes encoding secreted proteins are highly represented.ICGA-PSO-ELM 方法可实现精确的多癌症分类，减少了基因集，其中高度代表了编码分泌蛋白的基因。

IEEE/ACM Trans Comput Biol Bioinform. 2011 Mar-Apr;8(2):452-63. doi: 10.1109/TCBB.2010.13.

BMC Bioinformatics. 2007 Jun 16;8:206. doi: 10.1186/1471-2105-8-206.

TSG: a new algorithm for binary and multi-class cancer classification and informative genes selection.TSG：一种用于二分类和多分类癌症分类及信息基因选择的新算法。

BMC Med Genomics. 2013;6 Suppl 1(Suppl 1):S3. doi: 10.1186/1755-8794-6-S1-S3. Epub 2013 Jan 23.

Multiclass cancer classification and biomarker discovery using GA-based algorithms.使用基于遗传算法的算法进行多类别癌症分类和生物标志物发现。

Bioinformatics. 2005 Jun 1;21(11):2691-7. doi: 10.1093/bioinformatics/bti419. Epub 2005 Apr 6.

Prediction of cancer class with majority voting genetic programming classifier using gene expression data.使用基因表达数据，通过多数投票遗传编程分类器预测癌症类别。

IEEE/ACM Trans Comput Biol Bioinform. 2009 Apr-Jun;6(2):353-67. doi: 10.1109/TCBB.2007.70245.

Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.

Accurate cancer classification using expressions of very few genes.利用极少基因的表达进行精确的癌症分类。

IEEE/ACM Trans Comput Biol Bioinform. 2007 Jan-Mar;4(1):40-53. doi: 10.1109/TCBB.2007.1006.

An Integrated Feature Selection Algorithm for Cancer Classification using Gene Expression Data.一种使用基因表达数据进行癌症分类的集成特征选择算法

Comb Chem High Throughput Screen. 2018;21(9):631-645. doi: 10.2174/1386207322666181220124756.

引用本文的文献

Development and Validation of the Predictive Model for Esophageal Squamous Cell Carcinoma Differentiation Degree.食管鳞状细胞癌分化程度预测模型的建立与验证

Front Genet. 2020 Oct 23;11:595638. doi: 10.3389/fgene.2020.595638. eCollection 2020.

Random Subspace Aggregation for Cancer Prediction with Gene Expression Profiles.基于基因表达谱的癌症预测随机子空间聚合

Biomed Res Int. 2016;2016:4596326. doi: 10.1155/2016/4596326. Epub 2016 Nov 24.

本文引用的文献

Feature weight estimation for gene selection: a local hyperlinear learning approach.特征权重估计在基因选择中的应用：一种局部超线性学习方法。

BMC Bioinformatics. 2014 Mar 14;15:70. doi: 10.1186/1471-2105-15-70.

TSG: a new algorithm for binary and multi-class cancer classification and informative genes selection.TSG：一种用于二分类和多分类癌症分类及信息基因选择的新算法。

BMC Med Genomics. 2013;6 Suppl 1(Suppl 1):S3. doi: 10.1186/1755-8794-6-S1-S3. Epub 2013 Jan 23.

An ensemble correlation-based gene selection algorithm for cancer classification with gene expression data.基于集成相关性的基因选择算法在基因表达数据癌症分类中的应用。

Bioinformatics. 2012 Dec 15;28(24):3306-15. doi: 10.1093/bioinformatics/bts602. Epub 2012 Oct 11.

Comparative evaluation of set-level techniques in predictive classification of gene expression samples.基于集合水平的技术在基因表达样本预测分类中的比较评估。

BMC Bioinformatics. 2012 Jun 25;13 Suppl 10(Suppl 10):S15. doi: 10.1186/1471-2105-13-S10-S15.

Hallmarks of cancer: the next generation.癌症的特征：下一代。

Cell. 2011 Mar 4;144(5):646-74. doi: 10.1016/j.cell.2011.02.013.

Multiclass classification of microarray data samples with a reduced number of genes.基于少量基因的微阵列数据样本的多类分类。

BMC Bioinformatics. 2011 Feb 22;12:59. doi: 10.1186/1471-2105-12-59.

Optimization based tumor classification from microarray gene expression data.基于优化的微阵列基因表达数据肿瘤分类。

PLoS One. 2011 Feb 4;6(2):e14579. doi: 10.1371/journal.pone.0014579.

IEEE/ACM Trans Comput Biol Bioinform. 2011 Mar-Apr;8(2):452-63. doi: 10.1109/TCBB.2010.13.

A hybrid BPSO-CGA approach for gene selection and classification of microarray data.一种用于基因选择和微阵列数据分类的混合BPSO-CGA方法。

J Comput Biol. 2012 Jan;19(1):68-82. doi: 10.1089/cmb.2010.0064. Epub 2011 Jan 6.

Improving cancer classification accuracy using gene pairs.利用基因对提高癌症分类准确性。

PLoS One. 2010 Dec 21;5(12):e14305. doi: 10.1371/journal.pone.0014305.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于多类BCGA-ELM的分类器，可识别与癌症特征相关的生物标志物。

Multi-class BCGA-ELM based classifier that identifies biomarkers associated with hallmarks of cancer.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献