Searles Nielsen Susan, Warden Mark N, Camacho-Soto Alejandra, Willis Allison W, Wright Brenton A, Racette Brad A
From the Department of Neurology (S.S.N., M.N.W., A.C.-S., B.A.W., B.A.R.), Washington University School of Medicine, St. Louis, MO; Departments of Neurology and Biostatistics and Epidemiology (A.W.W.), University of Pennsylvania School of Medicine, Philadelphia; and School of Public Health, Faculty of Health Sciences (B.A.R.), University of the Witwatersrand, Parktown, South Africa.
Neurology. 2017 Oct 3;89(14):1448-1456. doi: 10.1212/WNL.0000000000004536. Epub 2017 Sep 1.
To use administrative medical claims data to identify patients with incident Parkinson disease (PD) prior to diagnosis.
Using a population-based case-control study of incident PD in 2009 among Medicare beneficiaries aged 66-90 years (89,790 cases, 118,095 controls) and the elastic net algorithm, we developed a cross-validated model for predicting PD using only demographic data and 2004-2009 Medicare claims data. We then compared this model to more basic models containing only demographic data and diagnosis codes for constipation, taste/smell disturbance, and REM sleep behavior disorder, using each model's receiver operator characteristic area under the curve (AUC).
We observed all established associations between PD and age, sex, race/ethnicity, tobacco smoking, and the above medical conditions. A model with those predictors had an AUC of only 0.670 (95% confidence interval [CI] 0.668-0.673). In contrast, the AUC for a predictive model with 536 diagnosis and procedure codes was 0.857 (95% CI 0.855-0.859). At the optimal cut point, sensitivity was 73.5% and specificity was 83.2%.
Using only demographic data and selected diagnosis and procedure codes readily available in administrative claims data, it is possible to identify individuals with a high probability of eventually being diagnosed with PD.
利用行政医疗索赔数据识别帕金森病(PD)确诊前的新发患者。
采用基于人群的病例对照研究,对2009年年龄在66 - 90岁的医疗保险受益人中的新发PD患者(89790例病例,118095例对照)进行研究,并使用弹性网络算法,我们仅利用人口统计学数据和2004 - 2009年医疗保险索赔数据开发了一个用于预测PD的交叉验证模型。然后,我们使用每个模型的曲线下接受者操作特征面积(AUC),将该模型与仅包含人口统计学数据以及便秘、味觉/嗅觉障碍和快速眼动睡眠行为障碍诊断代码的更基本模型进行比较。
我们观察到了PD与年龄、性别、种族/民族、吸烟以及上述医疗状况之间所有已确立的关联。包含这些预测因素的模型的AUC仅为0.670(95%置信区间[CI] 0.668 - 0.673)。相比之下,一个包含536个诊断和程序代码的预测模型的AUC为0.857(95% CI 0.855 - 0.859)。在最佳切点处,敏感性为73.5%,特异性为83.2%。
仅使用行政索赔数据中现成的人口统计学数据以及选定的诊断和程序代码,就有可能识别出最终被诊断为PD可能性很高的个体。