使用概率模型构建遗传算法进行癌症分类的基因选择

Gene selection for classification of cancers using probabilistic model building genetic algorithm.

作者信息

Paul Topon Kumar, Iba Hitoshi

机构信息

Department of Frontier Informatics, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba 277-8561, Japan.

出版信息

Biosystems. 2005 Dec;82(3):208-25. doi: 10.1016/j.biosystems.2005.07.003. Epub 2005 Aug 22.

DOI:10.1016/j.biosystems.2005.07.003

PMID:16112804

Abstract

Recently, DNA microarray-based gene expression profiles have been used to correlate the clinical behavior of cancers with the differential gene expression levels in cancerous and normal tissues. To this end, after selection of some predictive genes based on signal-to-noise (S2N) ratio, unsupervised learning like clustering and supervised learning like k-nearest neighbor (k NN) classifier are widely used. Instead of S2N ratio, adaptive searches like Probabilistic Model Building Genetic Algorithm (PMBGA) can be applied for selection of a smaller size gene subset that would classify patient samples more accurately. In this paper, we propose a new PMBGA-based method for identification of informative genes from microarray data. By applying our proposed method to classification of three microarray data sets of binary and multi-type tumors, we demonstrate that the gene subsets selected with our technique yield better classification accuracy.

摘要

最近，基于DNA微阵列的基因表达谱已被用于将癌症的临床行为与癌组织和正常组织中的差异基因表达水平相关联。为此，在基于信噪比（S2N）选择一些预测基因后，无监督学习（如聚类）和监督学习（如k近邻（k NN）分类器）被广泛使用。代替信噪比，像概率模型构建遗传算法（PMBGA）这样的自适应搜索可用于选择较小规模的基因子集，该子集能更准确地对患者样本进行分类。在本文中，我们提出了一种基于PMBGA的新方法，用于从微阵列数据中识别信息基因。通过将我们提出的方法应用于二元和多类型肿瘤的三个微阵列数据集的分类，我们证明了用我们的技术选择的基因子集产生了更好的分类准确性。

相似文献

Gene selection for classification of cancers using probabilistic model building genetic algorithm.使用概率模型构建遗传算法进行癌症分类的基因选择

Biosystems. 2005 Dec;82(3):208-25. doi: 10.1016/j.biosystems.2005.07.003. Epub 2005 Aug 22.

Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.使用非负主成分分析改进基因表达癌症分子模式发现

Genome Inform. 2008;21:200-11.

Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.

Gene selection using support vector machines with non-convex penalty.使用具有非凸惩罚项的支持向量机进行基因选择。

Bioinformatics. 2006 Jan 1;22(1):88-95. doi: 10.1093/bioinformatics/bti736. Epub 2005 Oct 25.

Cancer DNA microarray analysis considering multi-subclass with graph-based clustering method.基于图形聚类方法的多亚类癌症DNA微阵列分析

J Biosci Bioeng. 2008 Nov;106(5):442-8. doi: 10.1263/jbb.106.442.

Spectral analysis of two-signed microarray expression data.双符号微阵列表达数据的光谱分析。

Math Med Biol. 2007 Jun;24(2):131-48. doi: 10.1093/imammb/dql030. Epub 2006 Nov 28.

Recursive fuzzy granulation for gene subsets extraction and cancer classification.用于基因子集提取和癌症分类的递归模糊粒度化

IEEE Trans Inf Technol Biomed. 2008 Nov;12(6):723-30. doi: 10.1109/TITB.2008.920787.

New gene selection method for multiclass tumor classification by class centroid.基于类质心的多类肿瘤分类新基因选择方法。

J Biomed Inform. 2009 Feb;42(1):59-65. doi: 10.1016/j.jbi.2008.05.011. Epub 2008 Jun 17.

Fuzzy-rough supervised attribute clustering algorithm and classification of microarray data.模糊粗糙监督属性聚类算法与微阵列数据分类

IEEE Trans Syst Man Cybern B Cybern. 2011 Feb;41(1):222-33. doi: 10.1109/TSMCB.2010.2050684. Epub 2010 Jun 10.

Unsupervised clustering in mRNA expression profiles.mRNA表达谱中的无监督聚类

Comput Biol Med. 2006 Oct;36(10):1126-42. doi: 10.1016/j.compbiomed.2005.09.003. Epub 2005 Oct 24.

引用本文的文献

An efficient gene selection method for microarray data based on LASSO and BPSO.基于 LASSO 和 BPSO 的微阵列数据高效基因选择方法。

BMC Bioinformatics. 2019 Dec 30;20(Suppl 22):715. doi: 10.1186/s12859-019-3228-0.

A robust hybrid approach based on estimation of distribution algorithm and support vector machine for hunting candidate disease genes.一种基于分布估计算法和支持向量机的强大混合方法用于寻找候选疾病基因。

ScientificWorldJournal. 2013;2013:393570. doi: 10.1155/2013/393570. Epub 2013 Feb 7.

A comparison of machine learning techniques for survival prediction in breast cancer.机器学习技术在乳腺癌生存预测中的比较。

BioData Min. 2011 May 11;4:12. doi: 10.1186/1756-0381-4-12.

A review of estimation of distribution algorithms in bioinformatics.一种在生物信息学中评估分布算法的综述。

BioData Min. 2008 Sep 11;1(1):6. doi: 10.1186/1756-0381-1-6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用概率模型构建遗传算法进行癌症分类的基因选择

Gene selection for classification of cancers using probabilistic model building genetic algorithm.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献