Schrom John R, Caraballo Pedro J, Castro M Regina, Simon György J
University of Minnesota, Minneapolis, MN.
Mayo Clinic, Rochester, MN.
AMIA Annu Symp Proc. 2013 Nov 16;2013:1249-57. eCollection 2013.
Prediabetes is the most important risk factor for developing type-2 diabetes mellitus, an important and growing epidemic. Prediabetes is often associated with comorbidities including hypercholesterolemia. While statin drugs are indicated to treat hypercholesterolemia, recent reports suggest a possible increased risk of developing overt diabetes associated with the use of statins. Association rule mining is a data mining technique capable of identifying interesting relationships between risks and treatments. However, it is limited in its ability to accurately calculate the effect of a treatment, as it does not appropriately account for bias and confounding. We propose a novel combination of propensity score matching and association rule mining to account for this bias, and find meaningful associations between a treatment and outcome for various subpopulations. We demonstrate this technique on a real diabetes data set examining the relationship between statin use and diabetes, and identify risk and protective factors previously not clearly defined.
糖尿病前期是发展为2型糖尿病的最重要风险因素,2型糖尿病是一种日益严重的重要流行病。糖尿病前期常与包括高胆固醇血症在内的合并症相关。虽然他汀类药物被用于治疗高胆固醇血症,但最近的报告表明,使用他汀类药物可能会增加患显性糖尿病的风险。关联规则挖掘是一种数据挖掘技术,能够识别风险与治疗之间的有趣关系。然而,它在准确计算治疗效果方面存在局限性,因为它没有恰当地考虑偏差和混杂因素。我们提出了一种倾向得分匹配与关联规则挖掘的新颖组合方法来解决这种偏差,并在各个亚人群中找到治疗与结果之间有意义的关联。我们在一个真实的糖尿病数据集上展示了这种技术,该数据集用于研究他汀类药物的使用与糖尿病之间的关系,并识别出先前未明确界定的风险和保护因素。