Center for Research Informatics, University of Chicago, Chicago, Illinois, USA.
Department of Medicine, University of Chicago, Chicago, Illinois, USA.
J Am Med Inform Assoc. 2021 Jan 15;28(1):104-112. doi: 10.1093/jamia/ocaa220.
OBJECTIVE: Adherence to a treatment plan from HIV-positive patients is necessary to decrease their mortality and improve their quality of life, however some patients display poor appointment adherence and become lost to follow-up (LTFU). We applied natural language processing (NLP) to analyze indications towards or against LTFU in HIV-positive patients' notes. MATERIALS AND METHODS: Unstructured lemmatized notes were labeled with an LTFU or Retained status using a 183-day threshold. An NLP and supervised machine learning system with a linear model and elastic net regularization was trained to predict this status. Prevalence of characteristics domains in the learned model weights were evaluated. RESULTS: We analyzed 838 LTFU vs 2964 Retained notes and obtained a weighted F1 mean of 0.912 via nested cross-validation; another experiment with notes from the same patients in both classes showed substantially lower metrics. "Comorbidities" were associated with LTFU through, for instance, "HCV" (hepatitis C virus) and likewise "Good adherence" with Retained, represented with "Well on ART" (antiretroviral therapy). DISCUSSION: Mentions of mental health disorders and substance use were associated with disparate retention outcomes, however history vs active use was not investigated. There remains further need to model transitions between LTFU and being retained in care over time. CONCLUSION: We provided an important step for the future development of a model that could eventually help to identify patients who are at risk for falling out of care and to analyze which characteristics could be factors for this. Further research is needed to enhance this method with structured electronic medical record fields.
目的:HIV 阳性患者坚持治疗方案对于降低死亡率和提高生活质量至关重要,但有些患者的预约依从性较差,导致失访(LTFU)。我们应用自然语言处理(NLP)分析 HIV 阳性患者病历中提示 LTFU 的特征。
材料和方法:使用 183 天的时间阈值,对未结构化的词干化病历进行 LTFU 或保留状态的标记。采用线性模型和弹性网络正则化的 NLP 和监督机器学习系统来训练预测该状态。评估所学到的模型权重中特征域的普遍性。
结果:我们分析了 838 例 LTFU 与 2964 例保留病历,通过嵌套交叉验证获得加权 F1 均值为 0.912;对同一患者两类病例的另一项实验显示,指标值明显较低。“合并症”与 LTFU 相关,例如“HCV”(丙型肝炎病毒),而“良好的依从性”与保留相关,表现为“ART 治疗效果良好”。
讨论:心理健康障碍和药物使用的提及与不同的保留结果相关,但未研究病史与当前使用情况的关系。仍然需要进一步建立模型,以模拟随时间推移的 LTFU 和持续治疗之间的转换。
结论:我们为未来开发模型迈出了重要一步,该模型最终可能有助于识别有脱离治疗风险的患者,并分析哪些特征可能是导致这种情况的因素。需要进一步研究,以利用结构化电子病历字段增强这种方法。
J Am Med Inform Assoc. 2021-1-15
J Acquir Immune Defic Syndr. 2025-4-3
Curr HIV/AIDS Rep. 2021-6
Infect Dis Obstet Gynecol. 2019-2-14
Health Psychol. 2018-6
Lancet HIV. 2017-11
J Acquir Immune Defic Syndr. 2018-2-1
AIDS Behav. 2018-3