医学数据中风险模式的高效发现。

Efficient discovery of risk patterns in medical data.

作者信息

Li Jiuyong, Fu Ada Wai-chee, Fahey Paul

机构信息

School of Computer and Information Science, University of South Australia, Mawson Lakes, Adelaide 5095, South Australia, Australia.

出版信息

Artif Intell Med. 2009 Jan;45(1):77-89. doi: 10.1016/j.artmed.2008.07.008. Epub 2008 Sep 9.

DOI:10.1016/j.artmed.2008.07.008

PMID:18783927

Abstract

OBJECTIVE

This paper studies a problem of efficiently discovering risk patterns in medical data. Risk patterns are defined by a statistical metric, relative risk, which has been widely used in epidemiological research.

METHODS

To avoid fruitless search in the complete exploration of risk patterns, we define optimal risk pattern set to exclude superfluous patterns, i.e. complicated patterns with lower relative risk than their corresponding simpler form patterns. We prove that mining optimal risk pattern sets conforms an anti-monotone property that supports an efficient mining algorithm. We propose an efficient algorithm for mining optimal risk pattern sets based on this property. We also propose a hierarchical structure to present discovered patterns for the easy perusal by domain experts.

RESULTS

The proposed approach is compared with two well-known rule discovery methods, decision tree and association rule mining approaches on benchmark data sets and applied to a real world application. The proposed method discovers more and better quality risk patterns than a decision tree approach. The decision tree method is not designed for such applications and is inadequate for pattern exploring. The proposed method does not discover a large number of uninteresting superfluous patterns as an association mining approach does. The proposed method is more efficient than an association rule mining method. A real world case study shows that the method reveals some interesting risk patterns to medical practitioners.

CONCLUSION

The proposed method is an efficient approach to explore risk patterns. It quickly identifies cohorts of patients that are vulnerable to a risk outcome from a large data set. The proposed method is useful for exploratory study on large medical data to generate and refine hypotheses. The method is also useful for designing medical surveillance systems.

摘要

目的

本文研究医学数据中高效发现风险模式的问题。风险模式由一种统计指标——相对风险定义，该指标在流行病学研究中已被广泛使用。

方法

为避免在风险模式的全面探索中进行无意义的搜索，我们定义了最优风险模式集以排除多余模式，即相对风险低于其相应简单形式模式的复杂模式。我们证明挖掘最优风险模式集符合一种反单调性质，这支持了一种高效的挖掘算法。基于此性质，我们提出了一种挖掘最优风险模式集的高效算法。我们还提出了一种层次结构来展示发现的模式，以便领域专家轻松查阅。

结果

将所提出的方法与两种著名的规则发现方法（决策树和关联规则挖掘方法）在基准数据集上进行比较，并应用于实际应用。所提出的方法比决策树方法发现了更多且质量更好的风险模式。决策树方法并非为此类应用而设计，在模式探索方面存在不足。所提出的方法不像关联挖掘方法那样发现大量无趣的多余模式。所提出的方法比关联规则挖掘方法更高效。一个实际案例研究表明，该方法向医学从业者揭示了一些有趣的风险模式。

结论

所提出的方法是探索风险模式的一种有效途径。它能快速从大数据集中识别出易出现风险结果的患者群体。所提出的方法对于大型医学数据的探索性研究以生成和完善假设很有用。该方法对于设计医学监测系统也很有用。

相似文献

Efficient discovery of risk patterns in medical data.

Artif Intell Med. 2009 Jan;45(1):77-89. doi: 10.1016/j.artmed.2008.07.008. Epub 2008 Sep 9.

PARM--an efficient algorithm to mine association rules from spatial data.

IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1513-24. doi: 10.1109/TSMCB.2008.927730.

The limitations of decision trees and automatic learning in real world medical decision making.

Stud Health Technol Inform. 1998;52 Pt 1:529-33.

Rough set feature selection and rule induction for prediction of malignancy degree in brain glioma.

Comput Methods Programs Biomed. 2006 Aug;83(2):147-56. doi: 10.1016/j.cmpb.2006.06.007. Epub 2006 Aug 8.

Data mining and genetic algorithm based gene/SNP selection.

Artif Intell Med. 2004 Jul;31(3):183-96. doi: 10.1016/j.artmed.2004.04.002.

A Condition-Enumeration Tree method for mining biclusters from DNA microarray data sets.

Biosystems. 2009 Jul;97(1):44-59. doi: 10.1016/j.biosystems.2009.04.003. Epub 2009 Apr 23.

Combined mining: discovering informative knowledge in complex data.

IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):699-712. doi: 10.1109/TSMCB.2010.2086060.

Mining association rules from clinical databases: an intelligent diagnostic process in healthcare.

Stud Health Technol Inform. 2001;84(Pt 2):1399-403.

Mining unexpected temporal associations: applications in detecting adverse drug reactions.

IEEE Trans Inf Technol Biomed. 2008 Jul;12(4):488-500. doi: 10.1109/TITB.2007.900808.

Mining significant tree patterns in carbohydrate sugar chains.

Bioinformatics. 2008 Aug 15;24(16):i167-73. doi: 10.1093/bioinformatics/btn293.

引用本文的文献

Identifying risk factors for Alzheimer's disease from multivariate longitudinal clinical data using temporal pattern mining.

BMC Bioinformatics. 2025 Feb 17;26(1):56. doi: 10.1186/s12859-024-06018-8.

Surprising and novel multivariate sequential patterns using odds ratio for temporal evolution in healthcare.

BMC Med Inform Decis Mak. 2024 Jun 13;24(1):165. doi: 10.1186/s12911-024-02566-4.

Using the Diagnostic Odds Ratio to Select Patterns to Build an Interpretable Pattern-Based Classifier in a Clinical Domain: Multivariate Sequential Pattern Mining Study.

JMIR Med Inform. 2022 Aug 10;10(8):e32319. doi: 10.2196/32319.

Predicting Anxiety in Routine Palliative Care Using Bayesian-Inspired Association Rule Mining.

Front Digit Health. 2021 Aug 25;3:724049. doi: 10.3389/fdgth.2021.724049. eCollection 2021.

Prevalence and comorbidities of known diabetes in northeastern Italy.

J Diabetes Investig. 2013 Jul 8;4(4):355-60. doi: 10.1111/jdi.12043. Epub 2013 Feb 21.

Knowledge Discovery in a Community Data Set: Malnutrition among the Elderly.

Healthc Inform Res. 2014 Jan;20(1):30-8. doi: 10.4258/hir.2014.20.1.30. Epub 2014 Jan 31.

Association rule mining based study for identification of clinical parameters akin to occurrence of brain tumor.

Bioinformation. 2013 Jun 29;9(11):555-9. doi: 10.6026/97320630009555. Print 2013.

Predicting in-hospital maternal mortality in Senegal and Mali.

PLoS One. 2013 May 30;8(5):e64157. doi: 10.1371/journal.pone.0064157. Print 2013.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

医学数据中风险模式的高效发现。

Efficient discovery of risk patterns in medical data.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献