一种具有简单决策规则的新分类模型，用于发现最优特征基因对。

A new classification model with simple decision rule for discovering optimal feature gene pairs.

作者信息

Li Jie, Tang Xianglong

机构信息

Department of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China.

出版信息

Comput Biol Med. 2007 Nov;37(11):1637-46. doi: 10.1016/j.compbiomed.2007.03.004. Epub 2007 May 7.

DOI:10.1016/j.compbiomed.2007.03.004

PMID:17482157

Abstract

Classifiers have been widely used to select an optimal subset of feature genes from microarray data for accurate classification of cancer samples and cancer-related studies. However, the classification rules derived from most classifiers are complex and difficult to understand in biological significance. How to solve this problem is a new challenge. In this paper, a new classification model based on gene pair is proposed to address the problem. The experimental results on several microarray data demonstrate that the proposed classification model performs well in finding a large number of excellent feature gene pairs. A 100% LOOCV classification accuracy can be achieved using a single classification model based on optimal feature gene pair or combining multiple top-ranked classification models. Using the proposed method, we successfully identified important cancer-related genes that had been validated in previous biological studies while they were not discovered by the other methods.

摘要

分类器已被广泛用于从微阵列数据中选择特征基因的最优子集，以对癌症样本进行准确分类及开展癌症相关研究。然而，大多数分类器得出的分类规则复杂，且在生物学意义上难以理解。如何解决这一问题是一项新挑战。本文提出了一种基于基因对的新分类模型来解决该问题。在多个微阵列数据上的实验结果表明，所提出的分类模型在找到大量优秀特征基因对方面表现良好。使用基于最优特征基因对的单个分类模型或组合多个排名靠前的分类模型可实现100%的留一法交叉验证分类准确率。使用所提出的方法，我们成功识别出了在先前生物学研究中已得到验证但未被其他方法发现的重要癌症相关基因。

相似文献

A new classification model with simple decision rule for discovering optimal feature gene pairs.一种具有简单决策规则的新分类模型，用于发现最优特征基因对。

Comput Biol Med. 2007 Nov;37(11):1637-46. doi: 10.1016/j.compbiomed.2007.03.004. Epub 2007 May 7.

Pattern identification and classification in gene expression data using an autoassociative neural network model.使用自联想神经网络模型对基因表达数据进行模式识别和分类。

Biotechnol Bioeng. 2003 Mar 5;81(5):594-606. doi: 10.1002/bit.10505.

Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.

Cancer classification and prediction using logistic regression with Bayesian gene selection.使用贝叶斯基因选择的逻辑回归进行癌症分类和预测。

J Biomed Inform. 2004 Aug;37(4):249-59. doi: 10.1016/j.jbi.2004.07.009.

CARSVM: a class association rule-based classification framework and its application to gene expression data.CARSVM：一种基于类关联规则的分类框架及其在基因表达数据中的应用。

Artif Intell Med. 2008 Sep;44(1):7-25. doi: 10.1016/j.artmed.2008.05.002. Epub 2008 Jun 30.

Selecting a minimal number of relevant genes from microarray data to design accurate tissue classifiers.从微阵列数据中选择最少数量的相关基因以设计精确的组织分类器。

Biosystems. 2007 Jul-Aug;90(1):78-86. doi: 10.1016/j.biosystems.2006.07.002. Epub 2006 Jul 10.

Optimal number of features as a function of sample size for various classification rules.针对各种分类规则，作为样本大小函数的最优特征数量。

Bioinformatics. 2005 Apr 15;21(8):1509-15. doi: 10.1093/bioinformatics/bti171. Epub 2004 Nov 30.

Consensus analysis of multiple classifiers using non-repetitive variables: diagnostic application to microarray gene expression data.使用非重复变量的多个分类器的一致性分析：在微阵列基因表达数据中的诊断应用

Comput Biol Chem. 2007 Feb;31(1):48-56. doi: 10.1016/j.compbiolchem.2007.01.001. Epub 2007 Jan 4.

A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue.一种用于从癌组织基因表达数据中进行特征选择和规则提取的多核支持向量机方案。

Artif Intell Med. 2007 Oct;41(2):161-75. doi: 10.1016/j.artmed.2007.07.008. Epub 2007 Sep 11.

Classification of microarray data with factor mixture models.基于因子混合模型的微阵列数据分类

Bioinformatics. 2006 Jan 15;22(2):202-8. doi: 10.1093/bioinformatics/bti779. Epub 2005 Nov 15.

引用本文的文献

Therapy-, gender- and race-specific microRNA markers, target genes and networks related to glioblastoma recurrence and survival.与胶质母细胞瘤复发和生存相关的治疗、性别和种族特异性 microRNA 标志物、靶基因和网络。

Cancer Genomics Proteomics. 2011 Jul-Aug;8(4):173-83.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种具有简单决策规则的新分类模型，用于发现最优特征基因对。

A new classification model with simple decision rule for discovering optimal feature gene pairs.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献