Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA.
Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, USA.
J Alzheimers Dis. 2023;95(3):931-940. doi: 10.3233/JAD-230344.
Multiple algorithms with variable performance have been developed to identify dementia using combinations of billing codes and medication data that are widely available from electronic health records (EHR). If the characteristics of misclassified patients are clearly identified, modifying existing algorithms to improve performance may be possible.
To examine the performance of a code-based algorithm to identify dementia cases in the population-based Mayo Clinic Study of Aging (MCSA) where dementia diagnosis (i.e., reference standard) is actively assessed through routine follow-up and describe the characteristics of persons incorrectly categorized.
There were 5,316 participants (age at baseline (mean (SD)): 73.3 (9.68) years; 50.7% male) without dementia at baseline and available EHR data. ICD-9/10 codes and prescription medications for dementia were extracted between baseline and one year after an MCSA dementia diagnosis or last follow-up. Fisher's exact or Kruskal-Wallis tests were used to compare characteristics between groups.
Algorithm sensitivity and specificity were 0.70 (95% CI: 0.67, 0.74) and 0.95 (95% CI: 0.95, 0.96). False positives (i.e., participants falsely diagnosed with dementia by the algorithm) were older, with higher Charlson comorbidity index, more likely to have mild cognitive impairment (MCI), and longer follow-up (versus true negatives). False negatives (versus true positives) were older, more likely to have MCI, or have more functional limitations.
We observed a moderate-high performance of the code-based diagnosis method against the population-based MCSA reference standard dementia diagnosis. Older participants and those with MCI at baseline were more likely to be misclassified.
已经开发出多种算法,这些算法通过结合电子健康记录(EHR)中广泛可用的计费代码和药物数据来识别痴呆症,其性能各不相同。如果能够明确识别出错误分类患者的特征,那么可能可以修改现有的算法来提高性能。
检查基于代码的算法在人群为基础的 Mayo 诊所衰老研究(MCSA)中识别痴呆症病例的性能,该研究通过常规随访积极评估痴呆症诊断(即参考标准),并描述错误分类患者的特征。
有 5316 名参与者(基线时的年龄(平均值(SD)):73.3(9.68)岁;50.7%为男性)在基线时没有痴呆症且具有 EHR 数据。在 MCSA 痴呆症诊断或最后一次随访之前和之后一年之间,提取 ICD-9/10 代码和用于痴呆症的处方药物。使用 Fisher 精确检验或 Kruskal-Wallis 检验比较组间的特征。
算法的敏感性和特异性分别为 0.70(95%CI:0.67,0.74)和 0.95(95%CI:0.95,0.96)。假阳性(即,算法错误诊断为痴呆症的参与者)年龄较大,Charlson 合并症指数较高,更有可能患有轻度认知障碍(MCI),且随访时间更长(与真阴性相比)。假阴性(与真阳性相比)年龄较大,更有可能患有 MCI 或有更多的功能障碍。
我们观察到基于代码的诊断方法对基于人群的 MCSA 参考标准痴呆症诊断具有较高的性能。年龄较大的参与者和基线时患有 MCI 的参与者更有可能被错误分类。