Humana, 545 Wilson Valley Dr., Marion, NC 28752.
J Manag Care Spec Pharm. 2015 Dec;21(12):1149-59. doi: 10.18553/jmcp.2015.21.12.1149.
Despite the importance of early detection, delayed diagnosis of chronic obstructive pulmonary disease (COPD) is relatively common. Approximately 12 million people in the United States have undiagnosed COPD. Diagnosis of COPD is essential for the timely implementation of interventions, such as smoking cessation programs, drug therapies, and pulmonary rehabilitation, which are aimed at improving outcomes and slowing disease progression.
To develop and validate a predictive model to identify patients likely to have undiagnosed COPD using administrative claims data.
A predictive model was developed and validated utilizing a retro-spective cohort of patients with and without a COPD diagnosis (cases and controls), aged 40-89, with a minimum of 24 months of continuous health plan enrollment (Medicare Advantage Prescription Drug [MAPD] and commercial plans), and identified between January 1, 2009, and December 31, 2012, using Humana's claims database. Stratified random sampling based on plan type (commercial or MAPD) and index year was performed to ensure that cases and controls had a similar distribution of these variables. Cases and controls were compared to identify demographic, clinical, and health care resource utilization (HCRU) characteristics associated with a COPD diagnosis. Stepwise logistic regression (SLR), neural networking, and decision trees were used to develop a series of models. The models were trained, validated, and tested on randomly partitioned subsets of the sample (Training, Validation, and Test data subsets). Measures used to evaluate and compare the models included area under the curve (AUC); index of the receiver operating characteristics (ROC) curve; sensitivity, specificity, positive predictive value (PPV); and negative predictive value (NPV). The optimal model was selected based on AUC index on the Test data subset.
A total of 50,880 cases and 50,880 controls were included, with MAPD patients comprising 92% of the study population. Compared with controls, cases had a statistically significantly higher comorbidity burden and HCRU (including hospitalizations, emergency room visits, and medical procedures). The optimal predictive model was generated using SLR, which included 34 variables that were statistically significantly associated with a COPD diagnosis. After adjusting for covariates, anticholinergic bronchodilators (OR = 3.336) and tobacco cessation counseling (OR = 2.871) were found to have a large influence on the model. The final predictive model had an AUC of 0.754, sensitivity of 60%, specificity of 78%, PPV of 73%, and an NPV of 66%.
This claims-based predictive model provides an acceptable level of accuracy in identifying patients likely to have undiagnosed COPD in a large national health plan. Identification of patients with undiagnosed COPD may enable timely management and lead to improved health outcomes and reduced COPD-related health care expenditures.
尽管早期发现很重要,但慢性阻塞性肺疾病(COPD)的诊断延迟仍较为常见。在美国,约有 1200 万人患有未确诊的 COPD。COPD 的诊断对于及时实施干预措施至关重要,如戒烟计划、药物治疗和肺康复,这些措施旨在改善预后并减缓疾病进展。
利用管理索赔数据开发并验证一种预测模型,以识别可能患有未确诊 COPD 的患者。
利用 Humana 的索赔数据库,针对年龄在 40-89 岁之间、至少有 24 个月连续健康计划(医疗保险优势处方药[MAPD]和商业计划)的患有和不患有 COPD 诊断(病例和对照组)的患者,建立并验证了一个预测模型。根据计划类型(商业或 MAPD)和索引年进行分层随机抽样,以确保病例和对照组在这些变量的分布上具有相似性。对病例和对照组进行比较,以确定与 COPD 诊断相关的人口统计学、临床和医疗资源利用(HCRU)特征。使用逐步逻辑回归(SLR)、神经网络和决策树开发了一系列模型。使用随机划分的样本子集(训练、验证和测试数据子集)对模型进行训练、验证和测试。用于评估和比较模型的度量包括曲线下面积(AUC);接收器工作特性(ROC)曲线的指数;灵敏度、特异性、阳性预测值(PPV);和阴性预测值(NPV)。基于测试数据子集的 AUC 指数选择最佳模型。
共纳入 50880 例病例和 50880 例对照组,其中 MAPD 患者占研究人群的 92%。与对照组相比,病例组的合并症负担和 HCRU(包括住院、急诊就诊和医疗程序)明显更高。使用 SLR 生成了最佳预测模型,该模型包含 34 个与 COPD 诊断显著相关的变量。在调整协变量后,发现抗胆碱能支气管扩张剂(OR=3.336)和戒烟咨询(OR=2.871)对模型有较大影响。最终预测模型的 AUC 为 0.754,灵敏度为 60%,特异性为 78%,PPV 为 73%,NPV 为 66%。
基于索赔的预测模型在识别大型国家健康计划中可能患有未确诊 COPD 的患者方面具有可接受的准确性。识别患有未确诊 COPD 的患者可以实现及时管理,并改善健康结局和降低 COPD 相关医疗保健支出。