Cohen Gilles, Sax Hugo, Geissbuhler Antoine
Medical Informatics Service, University Hospital of Geneva, 1211 Geneva, Switzerland.
Stud Health Technol Inform. 2008;136:21-6.
Nosocomial infections (NIs) - those acquired in health care settings - represent one of the major causes of increased mortality in hospitalized patients. As they are a real problem for both patients and health authorities, the development of an effective surveillance system to monitor and detect them is of paramount importance. This paper presents a retrospective analysis of a prevalence survey of NIs done in the Geneva University Hospital. The objective is to identify patients with one or more NIs based on clinical and other data collected during the survey. In this classification task, the main difficulty lies in the significant imbalance between positive and negative cases. To overcome this problem, we investigate one-class Parzen density estimator which can be trained to differentiate two classes taking examples from a single class. The results obtained are encouraging: whereas standard 2-class SVMs scored a baseline sensitivity of 50.6% on this problem, the one-class approach increased sensitivity to as much as 88.6%. These results suggest that one-class Parzen density estimator can provide an effective and efficient way of overcoming data imbalance in classification problems.
医院感染(NI)——即在医疗保健机构中获得的感染——是住院患者死亡率增加的主要原因之一。由于它们对患者和卫生当局来说都是一个实际问题,因此开发一个有效的监测系统来监测和检测它们至关重要。本文对日内瓦大学医院进行的医院感染患病率调查进行了回顾性分析。目的是根据调查期间收集的临床和其他数据识别患有一种或多种医院感染的患者。在这个分类任务中,主要困难在于阳性和阴性病例之间的显著不平衡。为了克服这个问题,我们研究了单类Parzen密度估计器,它可以通过从单个类中获取示例进行训练来区分两类。获得的结果令人鼓舞:在此问题上,标准的二类支持向量机的基线灵敏度为50.6%,而单类方法将灵敏度提高到了88.6%。这些结果表明,单类Parzen密度估计器可以提供一种有效且高效的方法来克服分类问题中的数据不平衡。