使用在不同特征子集上训练的溯因网络委员会改进医学数据分类。

Improved classification of medical data using abductive network committees trained on different feature subsets.

作者信息

Abdel-Aal R E

机构信息

Department of Computer Engineering, King Fahd University of Petroleum and Minerals, P.O. Box 1759, KFUPM, Dhahran 31261, Saudi Arabia.

出版信息

Comput Methods Programs Biomed. 2005 Nov;80(2):141-53. doi: 10.1016/j.cmpb.2005.08.001. Epub 2005 Sep 19.

DOI:10.1016/j.cmpb.2005.08.001

PMID:16169631

Abstract

This paper demonstrates the use of abductive network classifier committees trained on different features for improving classification accuracy in medical diagnosis. In an earlier publication, committee members were trained on different subsets of the training set to ensure enough diversity for improved committee performance. In situations characterized by high data dimensionality, i.e. a large number of features and a relatively few training examples, it may be more advantageous to split the feature set rather than the training set. We describe a novel approach for tentatively ranking the features and forming subsets of uniform predictive quality for training individual members. The abductive network training algorithm is used to select optimum predictors from the feature set at various levels of model complexity specified by the user. Using the resulting tentative ranking, the features are grouped into mutually exclusive subsets of approximately equal predictive power for training the members. The approach is demonstrated on three standard medical diagnosis datasets (breast cancer, heart disease, and diabetes). Three-member committees trained on different feature subsets and using simple output combination methods reduce classification errors by up to 20% compared to the best single model developed with the full feature set. Results are compared with those reported previously with members trained through splitting the training set. Training abductive committee members on feature subsets of approximately equal predictive power achieves both diversity and quality for improved committee performance. Ensemble feature subset selection can be performed using GMDH-based learning algorithms. The approach should be advantageous in situations characterized by high data dimensionality.

摘要

本文展示了在不同特征上训练的溯因网络分类器委员会在提高医学诊断分类准确性方面的应用。在早期的一篇论文中，委员会成员是在训练集的不同子集上进行训练的，以确保足够的多样性来提高委员会的性能。在以高数据维度为特征的情况下，即大量特征和相对较少的训练示例，划分特征集而非训练集可能更具优势。我们描述了一种新颖的方法，用于初步对特征进行排序，并形成具有统一预测质量的子集来训练各个成员。溯因网络训练算法用于在用户指定的不同模型复杂度水平下，从特征集中选择最优预测器。利用得到的初步排序，将特征分组为预测能力大致相等的相互排斥的子集，用于训练成员。该方法在三个标准医学诊断数据集（乳腺癌、心脏病和糖尿病）上进行了演示。与使用完整特征集开发的最佳单一模型相比，在不同特征子集上训练并使用简单输出组合方法的三人委员会可将分类错误降低多达20%。将结果与之前通过划分训练集训练成员所报告的结果进行了比较。在预测能力大致相等的特征子集上训练溯因委员会成员，可实现多样性和质量，从而提高委员会的性能。可以使用基于GMDH的学习算法进行集成特征子集选择。该方法在以高数据维度为特征的情况下应该具有优势。

相似文献

Improved classification of medical data using abductive network committees trained on different feature subsets.

Comput Methods Programs Biomed. 2005 Nov;80(2):141-53. doi: 10.1016/j.cmpb.2005.08.001. Epub 2005 Sep 19.

GMDH-based feature ranking and selection for improved classification of medical data.

J Biomed Inform. 2005 Dec;38(6):456-68. doi: 10.1016/j.jbi.2005.03.003. Epub 2005 Apr 16.

Abductive network committees for improved classification of medical data.

Methods Inf Med. 2004;43(2):192-201.

A novel feature selection approach for biomedical data classification.

J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30.

Medical data mining by fuzzy modeling with selected features.

Artif Intell Med. 2008 Jul;43(3):195-206. doi: 10.1016/j.artmed.2008.04.004. Epub 2008 Jun 5.

Ensemble adaptive network-based fuzzy inference system with weighted arithmetical mean and application to diagnosis of optic nerve disease from visual-evoked potential signals.

Artif Intell Med. 2008 Jun;43(2):141-9. doi: 10.1016/j.artmed.2008.03.007. Epub 2008 May 12.

Identification of patients with congestive heart failure using different neural networks approaches.

Technol Health Care. 2009;17(4):305-21. doi: 10.3233/THC-2009-0542.

Tumor classification by combining PNN classifier ensemble with neighborhood rough set based gene reduction.

Comput Biol Med. 2010 Feb;40(2):179-89. doi: 10.1016/j.compbiomed.2009.11.014. Epub 2009 Dec 30.

Rough set feature selection and rule induction for prediction of malignancy degree in brain glioma.

Comput Methods Programs Biomed. 2006 Aug;83(2):147-56. doi: 10.1016/j.cmpb.2006.06.007. Epub 2006 Aug 8.

Differential diagnosis of CT focal liver lesions using texture features, feature selection and ensemble driven classifiers.

Artif Intell Med. 2007 Sep;41(1):25-37. doi: 10.1016/j.artmed.2007.05.002. Epub 2007 Jul 12.

引用本文的文献

Health informatics publication trends in Saudi Arabia: a bibliometric analysis over the last twenty-four years.

J Med Libr Assoc. 2021 Apr 1;109(2):219-239. doi: 10.5195/jmla.2021.1072.

A Systematic Mapping Study of Data Preparation in Heart Disease Knowledge Discovery.

J Med Syst. 2018 Dec 13;43(1):17. doi: 10.1007/s10916-018-1134-z.

A robust multi-class feature selection strategy based on Rotation Forest Ensemble algorithm for diagnosis of Erythemato-Squamous diseases.

J Med Syst. 2012 Apr;36(2):941-9. doi: 10.1007/s10916-010-9558-0. Epub 2010 Jul 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用在不同特征子集上训练的溯因网络委员会改进医学数据分类。

Improved classification of medical data using abductive network committees trained on different feature subsets.

作者信息

Abdel-Aal R E

机构信息

Department of Computer Engineering, King Fahd University of Petroleum and Minerals, P.O. Box 1759, KFUPM, Dhahran 31261, Saudi Arabia.

出版信息

Comput Methods Programs Biomed. 2005 Nov;80(2):141-53. doi: 10.1016/j.cmpb.2005.08.001. Epub 2005 Sep 19.

DOI:10.1016/j.cmpb.2005.08.001

PMID:16169631

Abstract

摘要

使用在不同特征子集上训练的溯因网络委员会改进医学数据分类。

Improved classification of medical data using abductive network committees trained on different feature subsets.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用在不同特征子集上训练的溯因网络委员会改进医学数据分类。

Improved classification of medical data using abductive network committees trained on different feature subsets.

作者信息

机构信息

出版信息

相似文献

引用本文的文献