Sauer Brian C, Teng Chia-Chen, Accortt Neil A, Burningham Zachary, Collier David, Trivedi Mona, Cannon Grant W
Salt Lake City Veterans Affairs Medical Center, Health Services Research and Development (IDEAS) Center and University of Utah Division of Epidemiology, Salt Lake City, UT, USA.
Salt Lake IDEAS Center, VA; Salt Lake City Health Care System, 500 Foothill Drive Bldg. 182, Salt Lake City, UT, 84148-0001, USA.
Arthritis Res Ther. 2017 May 8;19(1):86. doi: 10.1186/s13075-017-1294-0.
This study developed and validated a claims-based statistical model to predict rheumatoid arthritis (RA) disease activity, measured by the 28-joint count Disease Activity Score (DAS28).
Veterans enrolled in the Veterans Affairs Rheumatoid Arthritis (VARA) registry with one year of data available for review before being assessed by the DAS28, were studied. Three models were developed based on initial selection of variables for analyses. The first model was based on clinically defined variables, the second leveraged grouping systems for high dimensional data and the third approach prescreened all possible predictors based on a significant bivariate association with the DAS28. The least absolute shrinkage and selection operator (LASSO) with fivefold cross-validation was used for variable selection and model development. Models were also compared for patients with <5 years to those ≥5 years of RA disease. Classification accuracy was examined for remission (DAS28 < 2.6) and for low (2.6-3.1), moderate (3.2-5.1) and high (>5.1) activity.
There were 1582 Veterans who fulfilled inclusion criteria. The adjusted r-square for the three models tested ranged from 0.221 to 0.223. The models performed slightly better for patients with <5 years of RA disease than for patients with ≥5 years of RA disease. Correct classification of DAS28 categories ranged from 39.9% to 40.5% for the three models.
The multiple models tested showed weak overall predictive accuracy in measuring DAS28. The models performed poorly at predicting patients with remission and high disease activity. Future research should investigate components of disease activity measures directly from medical records and incorporate additional laboratory and other clinical data.
本研究开发并验证了一种基于索赔数据的统计模型,用于预测类风湿关节炎(RA)的疾病活动度,该活动度通过28关节计数疾病活动评分(DAS28)来衡量。
对纳入退伍军人事务部类风湿关节炎(VARA)登记系统的退伍军人进行研究,这些退伍军人在接受DAS28评估前有一年的数据可供审查。基于分析变量的初始选择,开发了三种模型。第一种模型基于临床定义的变量,第二种利用高维数据的分组系统,第三种方法基于与DAS28的显著双变量关联对所有可能的预测因子进行预筛选。采用具有五重交叉验证的最小绝对收缩和选择算子(LASSO)进行变量选择和模型开发。还比较了RA病程<5年和≥5年患者的模型。检查了缓解期(DAS28<2.6)以及低(2.6 - 3.1)、中(3.2 - 5.1)和高(>5.1)活动度的分类准确性。
有1582名退伍军人符合纳入标准。所测试的三种模型的调整后决定系数范围为0.221至0.223。RA病程<5年的患者模型表现略优于病程≥5年的患者。三种模型对DAS28类别的正确分类率在39.9%至40.5%之间。
所测试的多种模型在测量DAS28时总体预测准确性较弱。这些模型在预测缓解期和高疾病活动度患者方面表现不佳。未来的研究应直接从医疗记录中调查疾病活动度测量的组成部分,并纳入更多实验室和其他临床数据。