Brossette S E, Sprague A P, Hardin J M, Waites K B, Jones W T, Moser S A
University of Alabama at Birmingham, USA.
J Am Med Inform Assoc. 1998 Jul-Aug;5(4):373-81. doi: 10.1136/jamia.1998.0050373.
The authors consider the problem of identifying new, unexpected, and interesting patterns in hospital infection control and public health surveillance data and present a new data analysis process and system based on association rules to address this problem.
The authors first illustrate the need for automated pattern discovery and data mining in hospital infection control and public health surveillance. Next, they define association rules, explain how those rules can be used in surveillance, and present a novel process and system--the Data Mining Surveillance System (DMSS)--that utilize association rules to identify new and interesting patterns in surveillance data.
Experimental results were obtained using DMSS to analyze Pseudomonas aeruginosa infection control data collected over one year (1996) at University of Alabama at Birmingham Hospital. Experiments using one-, three-, and six-month time partitions yielded 34, 57, and 28 statistically significant events, respectively. Although not all statistically significant events are clinically significant, a subset of events generated in each analysis indicated potentially significant shifts in the occurrence of infection or antimicrobial resistance patterns of P. aeruginosa.
The new process and system are efficient and effective in identifying new, unexpected, and interesting patterns in surveillance data. The clinical relevance and utility of this process await the results of prospective studies currently in progress.
作者探讨了在医院感染控制和公共卫生监测数据中识别新的、意外的和有趣模式的问题,并提出了一种基于关联规则的新数据分析流程和系统来解决这一问题。
作者首先阐述了在医院感染控制和公共卫生监测中进行自动模式发现和数据挖掘的必要性。接下来,他们定义了关联规则,解释了这些规则如何用于监测,并提出了一个新颖的流程和系统——数据挖掘监测系统(DMSS),该系统利用关联规则来识别监测数据中的新的和有趣的模式。
使用DMSS对阿拉巴马大学伯明翰医院在一年(1996年)内收集的铜绿假单胞菌感染控制数据进行分析,获得了实验结果。使用1个月、3个月和6个月的时间分区进行的实验分别产生了34、57和28个具有统计学意义的事件。虽然并非所有具有统计学意义的事件都具有临床意义,但每次分析中产生的一部分事件表明铜绿假单胞菌感染或抗菌药物耐药模式的发生可能有显著变化。
新的流程和系统在识别监测数据中的新的、意外的和有趣的模式方面是高效且有效的。这一流程的临床相关性和实用性有待目前正在进行的前瞻性研究的结果。