CARSVM：一种基于类关联规则的分类框架及其在基因表达数据中的应用。

Kianmehr Keivan, Alhajj Reda

BIDEALS Group, Department of Computer Science, University of Calgary, 2500 University Drive NW, Calgary, Alberta, Canada T2N 1N4.

Artif Intell Med. 2008 Sep;44(1):7-25. doi: 10.1016/j.artmed.2008.05.002. Epub 2008 Jun 30.

OBJECTIVE

In this study, we aim at building a classification framework, namely the CARSVM model, which integrates association rule mining and support vector machine (SVM). The goal is to benefit from advantages of both, the discriminative knowledge represented by class association rules and the classification power of the SVM algorithm, to construct an efficient and accurate classifier model that improves the interpretability problem of SVM as a traditional machine learning technique and overcomes the efficiency issues of associative classification algorithms.

METHOD

In our proposed framework: instead of using the original training set, a set of rule-based feature vectors, which are generated based on the discriminative ability of class association rules over the training samples, are presented to the learning component of the SVM algorithm. We show that rule-based feature vectors present a high-qualified source of discrimination knowledge that can impact substantially the prediction power of SVM and associative classification techniques. They provide users with more conveniences in terms of understandability and interpretability as well.

RESULTS

We have used four datasets from UCI ML repository to evaluate the performance of the developed system in comparison with five well-known existing classification methods. Because of the importance and popularity of gene expression analysis as real world application of the classification model, we present an extension of CARSVM combined with feature selection to be applied to gene expression data. Then, we describe how this combination will provide biologists with an efficient and understandable classifier model. The reported test results and their biological interpretation demonstrate the applicability, efficiency and effectiveness of the proposed model.

CONCLUSION

From the results, it can be concluded that a considerable increase in classification accuracy can be obtained when the rule-based feature vectors are integrated in the learning process of the SVM algorithm. In the context of applicability, according to the results obtained from gene expression analysis, we can conclude that the CARSVM system can be utilized in a variety of real world applications with some adjustments.

目的

在本研究中，我们旨在构建一个分类框架，即CARSVM模型，该模型集成了关联规则挖掘和支持向量机（SVM）。目标是利用两者的优势，即类关联规则所代表的判别知识和SVM算法的分类能力，构建一个高效且准确的分类器模型，以改善SVM作为传统机器学习技术时的可解释性问题，并克服关联分类算法的效率问题。

方法

在我们提出的框架中：不是使用原始训练集，而是将一组基于类关联规则对训练样本的判别能力生成的基于规则的特征向量呈现给SVM算法的学习组件。我们表明，基于规则的特征向量呈现出高质量的判别知识源，这可以极大地影响SVM和关联分类技术的预测能力。它们在可理解性和可解释性方面也为用户提供了更多便利。

结果

我们使用了来自UCI机器学习库的四个数据集，与五种著名的现有分类方法相比，评估所开发系统的性能。由于基因表达分析作为分类模型的实际应用的重要性和普遍性，我们提出了结合特征选择的CARSVM扩展，以应用于基因表达数据。然后，我们描述了这种结合将如何为生物学家提供一个高效且可理解的分类器模型。报告的测试结果及其生物学解释证明了所提出模型的适用性、效率和有效性。

结论

从结果可以得出结论，当将基于规则特征向量集成到SVM算法的学习过程中时，可以显著提高分类准确率。在适用性方面，根据从基因表达分析中获得的结果，我们可以得出结论，CARSVM系统经过一些调整后可用于各种实际应用。

相似文献

CARSVM: a class association rule-based classification framework and its application to gene expression data.

Artif Intell Med. 2008 Sep;44(1):7-25. doi: 10.1016/j.artmed.2008.05.002. Epub 2008 Jun 30.

A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue.

Artif Intell Med. 2007 Oct;41(2):161-75. doi: 10.1016/j.artmed.2007.07.008. Epub 2007 Sep 11.

Mixture classification model based on clinical markers for breast cancer prognosis.

Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.

Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.

Genome Inform. 2008;21:200-11.

Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE.

BMC Bioinformatics. 2006 Dec 25;7:543. doi: 10.1186/1471-2105-7-543.

The application of mutual information-based feature selection and fuzzy LS-SVM-based classifier in motion classification.

Comput Methods Programs Biomed. 2008 Jun;90(3):275-84. doi: 10.1016/j.cmpb.2008.01.003. Epub 2008 Mar 4.

Tumor classification ranking from microarray data.

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.

Bias in error estimation when using cross-validation for model selection.

BMC Bioinformatics. 2006 Feb 23;7:91. doi: 10.1186/1471-2105-7-91.

A new classification model with simple decision rule for discovering optimal feature gene pairs.

Comput Biol Med. 2007 Nov;37(11):1637-46. doi: 10.1016/j.compbiomed.2007.03.004. Epub 2007 May 7.

[Rule induction algorithm for brain glioma using support vector machine].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2006 Apr;23(2):410-2.

引用本文的文献

Rule-Based Models for Risk Estimation and Analysis of In-hospital Mortality in Emergency and Critical Care.

Front Med (Lausanne). 2021 Nov 8;8:785711. doi: 10.3389/fmed.2021.785711. eCollection 2021.

DQB: A novel dynamic quantitive classification model using artificial bee colony algorithm with application on gene expression profiles.

Saudi J Biol Sci. 2018 Jul;25(5):932-946. doi: 10.1016/j.sjbs.2018.01.017. Epub 2018 Feb 9.

Toxicity prediction from toxicogenomic data based on class association rule mining.

Toxicol Rep. 2014 Nov 7;1:1133-1142. doi: 10.1016/j.toxrep.2014.10.014. eCollection 2014.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

CARSVM: a class association rule-based classification framework and its application to gene expression data.

Artif Intell Med. 2008 Sep;44(1):7-25. doi: 10.1016/j.artmed.2008.05.002. Epub 2008 Jun 30.

A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue.

Artif Intell Med. 2007 Oct;41(2):161-75. doi: 10.1016/j.artmed.2007.07.008. Epub 2007 Sep 11.

Mixture classification model based on clinical markers for breast cancer prognosis.

Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.

Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.

Genome Inform. 2008;21:200-11.

Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE.

BMC Bioinformatics. 2006 Dec 25;7:543. doi: 10.1186/1471-2105-7-543.

The application of mutual information-based feature selection and fuzzy LS-SVM-based classifier in motion classification.

Comput Methods Programs Biomed. 2008 Jun;90(3):275-84. doi: 10.1016/j.cmpb.2008.01.003. Epub 2008 Mar 4.

Tumor classification ranking from microarray data.

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.

Bias in error estimation when using cross-validation for model selection.

BMC Bioinformatics. 2006 Feb 23;7:91. doi: 10.1186/1471-2105-7-91.

A new classification model with simple decision rule for discovering optimal feature gene pairs.

Comput Biol Med. 2007 Nov;37(11):1637-46. doi: 10.1016/j.compbiomed.2007.03.004. Epub 2007 May 7.

[Rule induction algorithm for brain glioma using support vector machine].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2006 Apr;23(2):410-2.

引用本文的文献

Rule-Based Models for Risk Estimation and Analysis of In-hospital Mortality in Emergency and Critical Care.

Front Med (Lausanne). 2021 Nov 8;8:785711. doi: 10.3389/fmed.2021.785711. eCollection 2021.

DQB: A novel dynamic quantitive classification model using artificial bee colony algorithm with application on gene expression profiles.

Saudi J Biol Sci. 2018 Jul;25(5):932-946. doi: 10.1016/j.sjbs.2018.01.017. Epub 2018 Feb 9.

Toxicity prediction from toxicogenomic data based on class association rule mining.

Toxicol Rep. 2014 Nov 7;1:1133-1142. doi: 10.1016/j.toxrep.2014.10.014. eCollection 2014.

Suppr
超能文献

CARSVM: a class association rule-based classification framework and its application to gene expression data.

作者信息

机构信息

出版信息

OBJECTIVE

METHOD

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

Suppr超能文献

CARSVM：一种基于类关联规则的分类框架及其在基因表达数据中的应用。

CARSVM: a class association rule-based classification framework and its application to gene expression data.

作者信息

机构信息

出版信息

OBJECTIVE

METHOD

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

Suppr
超能文献