一种基于粗糙集的聚类方法及其在医学数据库知识发现中的应用。

A clustering method based on rough sets and its application to knowledge discovery in the medical database.

作者信息

Hirano S, Tsumoto S, Okuzaki T, Hata Y

机构信息

Department of Medical Informatics, Shimane Medical University, School of Medicine, Izumo, Shimane 691-8501, Japan.

出版信息

Stud Health Technol Inform. 2001;84(Pt 1):206-10.

PMID:11604734

Abstract

This paper proposes a clustering method for nominal and numerical data based on Rough Sets and its application to knowledge discovery in the medical database. Classification is performed according to the indiscernibility relations defined on the basis of relative similarity between objects. The similarity is defined as a combination of two types of similarity measures: the Hamming distance for nominal attributes and the Mahalanobis distance for numerical attributes. Excessive generation of small category is suppressed by modifying similar equivalence relations into the same equivalence relation. An analysis of the meningoencephalitis diagnosis database was performed to validate this method. The result showed that this method could deal well with both types of attributes and discover the primary factors for diagnosis.

摘要

本文提出了一种基于粗糙集的名义数据和数值数据聚类方法及其在医学数据库知识发现中的应用。根据基于对象间相对相似性定义的不可分辨关系进行分类。相似性被定义为两种相似性度量的组合：名义属性的汉明距离和数值属性的马氏距离。通过将相似等价关系修改为相同等价关系来抑制小类的过度生成。对脑膜脑炎诊断数据库进行了分析以验证该方法。结果表明，该方法能够很好地处理这两种类型的属性并发现诊断的主要因素。

相似文献

A clustering method based on rough sets and its application to knowledge discovery in the medical database.

Stud Health Technol Inform. 2001;84(Pt 1):206-10.

[Knowledge discovery in database and its application in clinical diagnosis].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2004 Aug;21(4):677-80.

IEEE Trans Pattern Anal Mach Intell. 2004 Apr;26(4):434-48. doi: 10.1109/TPAMI.2004.1265860.

Randomized maps for assessing the reliability of patients clusters in DNA microarray data analyses.

Artif Intell Med. 2006 Jun;37(2):85-109. doi: 10.1016/j.artmed.2006.03.005. Epub 2006 May 23.

Knowledge discovery in clinical databases based on variable precision rough set model.

Proc Annu Symp Comput Appl Med Care. 1995:270-4.

Greedy rule generation from discrete data and its use in neural network rule extraction.

Neural Netw. 2008 Sep;21(7):1020-8. doi: 10.1016/j.neunet.2008.01.003. Epub 2008 Mar 23.

A multi-stage approach to clustering and imputation of gene expression profiles.

Bioinformatics. 2007 Apr 15;23(8):998-1005. doi: 10.1093/bioinformatics/btm053. Epub 2007 Feb 18.

Machine learning method for knowledge discovery experimented with otoneurological data.

Comput Methods Programs Biomed. 2008 Aug;91(2):154-64. doi: 10.1016/j.cmpb.2008.03.003. Epub 2008 Jun 3.

Randomized clustering forests for image classification.

IEEE Trans Pattern Anal Mach Intell. 2008 Sep;30(9):1632-46. doi: 10.1109/TPAMI.2007.70822.

Penalized and weighted K-means for clustering with scattered objects and prior information in high-throughput biological data.

Bioinformatics. 2007 Sep 1;23(17):2247-55. doi: 10.1093/bioinformatics/btm320. Epub 2007 Jun 27.

引用本文的文献

J Med Syst. 2003 Jun;27(3):271-82. doi: 10.1023/a:1022527528856.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种基于粗糙集的聚类方法及其在医学数据库知识发现中的应用。

A clustering method based on rough sets and its application to knowledge discovery in the medical database.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献