Hendrix Nathaniel, Parikh Rishi V, Taskier Madeline, Walter Grace, Rochlin Ilia, Saydah Sharon, Koumans Emilia H, Rincón-Guevara Oscar, Rehkopf David H, Phillips Robert L
Center for Professionalism and Value in Health Care, American Board of Family Medicine, Washington, District of Columbia, United States of America.
Department of Epidemiology and Population Health, Stanford School of Medicine, Palo Alto, California, United States of America.
PLoS One. 2025 May 16;20(5):e0324017. doi: 10.1371/journal.pone.0324017. eCollection 2025.
Post-COVID conditions (PCC) have proven difficult to diagnose. In this retrospective observational study, we aimed to characterize the level of variation in PCC diagnoses observed across clinicians from a number of methodological angles and to determine whether natural language classifiers trained on clinical notes can reconcile differences in diagnostic definitions.
We used data from 519 primary care clinics around the United States who were in the American Family Cohort registry between October 1, 2021 (when the ICD-10 code for PCC was activated) and November 1, 2023. There were 6,116 patients with a diagnostic code for PCC (U09.9), and 5,020 with diagnostic codes for both PCC and COVID-19. We explored these data using 4 different outcomes: 1) Time between COVID-19 and PCC diagnostic codes; 2) Count of patients with PCC diagnostic codes per clinician; 3) Patient-specific probability of PCC diagnostic code based on patient and clinician characteristics; and 4) Performance of a natural language classifier trained on notes from 5,000 patients annotated by two physicians to indicate probable PCC.
Of patients with diagnostic codes for PCC and COVID-19, 61.3% were diagnosed with PCC less than 12 weeks after initial recorded COVID-19. Clinicians in the top 1% of diagnostic propensity accounted for more than a third of all PCC diagnoses (35.8%). Comparing LASSO logistic regressions predicting documentation of PCC diagnosis, a log-likelihood test showed significantly better fit when clinician and practice site indicators were included (p < 0.0001). Inter-rater agreement between physician annotators on PCC diagnosis was moderate (Cohen's kappa: 0.60), and performance of the natural language classifiers was marginal (best AUC: 0.724, 95% credible interval: 0.555-0.878).
We found evidence of substantial disagreement between clinicians on diagnostic criteria for PCC. The variation in diagnostic rates across clinicians points to the possibilities of under- and over-diagnosis for patients.
新冠后状况(PCC)已被证明难以诊断。在这项回顾性观察研究中,我们旨在从多个方法学角度描述不同临床医生对PCC诊断的差异程度,并确定基于临床记录训练的自然语言分类器是否能够协调诊断定义上的差异。
我们使用了来自美国519家初级保健诊所的数据,这些诊所均参与了美国家庭队列登记,时间跨度为2021年10月1日(PCC的ICD - 10编码启用之时)至2023年11月1日。有6116例患者有PCC诊断代码(U09.9),5020例患者同时有PCC和新冠诊断代码。我们使用4种不同的结果对这些数据进行了探究:1)新冠诊断代码与PCC诊断代码之间的时间间隔;2)每位临床医生诊断为PCC的患者数量;3)基于患者和临床医生特征的患者特定PCC诊断代码概率;4)一个自然语言分类器的性能,该分类器基于5000例由两名医生标注为可能患有PCC的患者的记录进行训练。
在有PCC和新冠诊断代码的患者中,61.3%在首次记录新冠后不到12周被诊断为PCC。诊断倾向处于前1%的临床医生做出的PCC诊断占所有PCC诊断的三分之一以上(35.8%)。比较预测PCC诊断记录的LASSO逻辑回归,当纳入临床医生和执业地点指标时,对数似然检验显示拟合度显著更好(p < 0.0001)。医生标注者之间关于PCC诊断的评分者间一致性为中等(科恩kappa系数:0.60),自然语言分类器的性能一般(最佳AUC:0.724,95%可信区间:0.555 - 0.878)。
我们发现临床医生在PCC诊断标准上存在重大分歧的证据。不同临床医生诊断率的差异表明患者存在诊断不足和诊断过度的可能性。