使用基于高效哈希表的封闭频繁项集挖掘技术诊断冠状动脉疾病。

Diagnosis of coronary artery disease using an efficient hash table based closed frequent itemsets mining.

机构信息

Department of Computer Applications, St.Xavier's Catholic College of Engineering, Chunkankadai, K.K. Dist., Nagercoil, 629003, Tamil Nadu, India.

出版信息

Med Biol Eng Comput. 2018 May;56(5):749-759. doi: 10.1007/s11517-017-1719-6. Epub 2017 Sep 14.

DOI:10.1007/s11517-017-1719-6

PMID:28905236

Abstract

This paper proposes an efficient hash table based closed frequent itemsets (HCFI) mining algorithm to envisage coronary artery disease early. HCFI algorithm generates closed frequent itemsets efficiently by performing intersection operation on transaction id's of itemset without considering the name of item/itemset. The employed hash table reduces search efficiency to O(1) or constant time. HCFI algorithm is applied on the UCI (University of California, Irvine) Cleveland dataset, a biological database of cardiovascular disease to generate closed frequent itemsets on the dataset. The findings of HCFI algorithm are (1) it determines a set of distinguished features to differentiate a 'healthy' and a 'sick' class. The features such as heart status being normal, oldpeak being less than or equal to 1.2, slope being up, number of vessels colored being zero, absence of exercise-induced angina, maximum heart rate achieved between 151 and 180 are referred as 'healthy' class. The features like chest pain are being asymptomatic, heart-status being reversible defect, slope being flat, and presence of exercise-induced-angina and serum cholesterol being greater than 240 indicate a presumption of heart disease to both genders. (2) It predicts that females have less chance of coronary heart disease than males. This algorithm is also compared with two other state-of-the-art-algorithms 'NAFCP' (N-list based algorithm for mining frequent closed patterns) and 'PredictiveApriori' to show the effectiveness of the proposed algorithm.

摘要

本文提出了一种基于高效哈希表的闭合频繁项集 (HCFI) 挖掘算法，以早期发现冠心病。HCFI 算法通过对项集的事务 ID 执行交集操作，而不考虑项/项集的名称，从而有效地生成闭合频繁项集。所采用的哈希表将搜索效率降低到 O(1)或常数时间。HCFI 算法应用于 UCI（加利福尼亚大学欧文分校）克利夫兰数据集，这是一个心血管疾病的生物数据库，用于在数据集上生成闭合频繁项集。HCFI 算法的发现结果为：(1) 它确定了一组有区别的特征来区分“健康”和“患病”类别。例如，心脏状态正常、oldpeak 小于或等于 1.2、斜率上升、血管数量为零、不存在运动引起的心绞痛、最大心率在 151 到 180 之间的特征被称为“健康”类别。而胸痛无症状、心脏状态为可逆缺陷、斜率为平坦、存在运动引起的心绞痛以及血清胆固醇大于 240 等特征则表明两性都有可能患有心脏病。(2) 它预测女性患冠心病的几率低于男性。该算法还与另外两种最先进的算法“NAFCP”（用于挖掘频繁闭合模式的 N 列表算法）和“PredictiveApriori”进行了比较，以显示所提出算法的有效性。

相似文献

Diagnosis of coronary artery disease using an efficient hash table based closed frequent itemsets mining.

Med Biol Eng Comput. 2018 May;56(5):749-759. doi: 10.1007/s11517-017-1719-6. Epub 2017 Sep 14.

The Mining Algorithm of Maximum Frequent Itemsets Based on Frequent Pattern Tree.

Comput Intell Neurosci. 2022 May 18;2022:7022168. doi: 10.1155/2022/7022168. eCollection 2022.

An efficient pattern growth approach for mining fault tolerant frequent itemsets.

Expert Syst Appl. 2020 Apr 1;143:113046. doi: 10.1016/j.eswa.2019.113046. Epub 2019 Oct 21.

An efficient algorithm for mining closed itemsets.

J Zhejiang Univ Sci. 2004 Jan;5(1):8-15. doi: 10.1007/BF02839306.

Mining Association rules for Low-Frequency itemsets.

PLoS One. 2018 Jul 23;13(7):e0198066. doi: 10.1371/journal.pone.0198066. eCollection 2018.

A novel association rule mining approach using TID intermediate itemset.

PLoS One. 2018 Jan 19;13(1):e0179703. doi: 10.1371/journal.pone.0179703. eCollection 2018.

Bit-table based biclustering and frequent closed itemset mining in high-dimensional binary data.

ScientificWorldJournal. 2014 Jan 30;2014:870406. doi: 10.1155/2014/870406. eCollection 2014.

Quantifying the informativeness for biomedical literature summarization: An itemset mining method.

Comput Methods Programs Biomed. 2017 Jul;146:77-89. doi: 10.1016/j.cmpb.2017.05.011. Epub 2017 May 27.

Negative and positive association rules mining from text using frequent and infrequent itemsets.

ScientificWorldJournal. 2014;2014:973750. doi: 10.1155/2014/973750. Epub 2014 May 18.

TKFIM: Top-K frequent itemset mining technique based on equivalence classes.

PeerJ Comput Sci. 2021 Mar 8;7:e385. doi: 10.7717/peerj-cs.385. eCollection 2021.

引用本文的文献

NetNCSP: Nonoverlapping closed sequential pattern mining.

Knowl Based Syst. 2020 May 21;196:105812. doi: 10.1016/j.knosys.2020.105812. Epub 2020 Mar 31.

An automatic multi-class coronary atherosclerosis plaque detection and classification framework.

Med Biol Eng Comput. 2019 Jan;57(1):245-257. doi: 10.1007/s11517-018-1880-6. Epub 2018 Aug 7.

本文引用的文献

Modeling a healthy and a person with heart failure conditions using the object-oriented modeling environment Dymola.

Med Biol Eng Comput. 2015 Oct;53(10):1049-68. doi: 10.1007/s11517-015-1384-6. Epub 2015 Sep 18.

Integration of Different Risk Assessment Tools to Improve Stratification of Patients with Coronary Artery Disease.

Med Biol Eng Comput. 2015 Oct;53(10):1069-83. doi: 10.1007/s11517-015-1342-3. Epub 2015 Jul 28.

Automated diagnosis of coronary artery disease based on data mining and fuzzy modeling.

IEEE Trans Inf Technol Biomed. 2008 Jul;12(4):447-58. doi: 10.1109/TITB.2007.907985.

Association rule discovery with the train and test approach for heart disease prediction.

IEEE Trans Inf Technol Biomed. 2006 Apr;10(2):334-43. doi: 10.1109/titb.2006.864475.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用基于高效哈希表的封闭频繁项集挖掘技术诊断冠状动脉疾病。

Diagnosis of coronary artery disease using an efficient hash table based closed frequent itemsets mining.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献