Suppr超能文献

通过自然语言处理弥合更年期状态分类中的信息差距。

Bridging information gaps in menopause status classification through natural language processing.

作者信息

Eyre Hannah, Alba Patrick R, Gibson Carolyn J, Gatsby Elise, Lynch Kristine E, Patterson Olga V, DuVall Scott L

机构信息

VA Informatics and Computing Infrastructure, VA Salt Lake City Health Care System, Salt Lake City, UT 84113, United States.

Department of Internal Medicine, School of Medicine, University of Utah, Salt Lake City, UT 84112, United States.

出版信息

JAMIA Open. 2024 Feb 9;7(1):ooae013. doi: 10.1093/jamiaopen/ooae013. eCollection 2024 Apr.

Abstract

OBJECTIVE

To use natural language processing (NLP) of clinical notes to augment existing structured electronic health record (EHR) data for classification of a patient's menopausal status.

MATERIALS AND METHODS

A rule-based NLP system was designed to capture evidence of a patient's menopause status including dates of a patient's last menstrual period, reproductive surgeries, and postmenopause diagnosis as well as their use of birth control and menstrual interruptions. NLP-derived output was used in combination with structured EHR data to classify a patient's menopausal status. NLP processing and patient classification were performed on a cohort of 307 512 female Veterans receiving healthcare at the US Department of Veterans Affairs (VA).

RESULTS

NLP was validated at 99.6% precision. Including the NLP-derived data into a menopause phenotype increased the number of patients with data relevant to their menopausal status by 118%. Using structured codes alone, 81 173 (27.0%) are able to be classified as postmenopausal or premenopausal. However, with the inclusion of NLP, this number increased 167 804 (54.6%) patients. The premenopausal category grew by 532.7% with the inclusion of NLP data.

DISCUSSION

By employing NLP, it became possible to identify documented data elements that predate VA care, originate outside VA networks, or have no corresponding structured field in the VA EHR that would be otherwise inaccessible for further analysis.

CONCLUSION

NLP can be used to identify concepts relevant to a patient's menopausal status in clinical notes. Adding NLP-derived data to an algorithm classifying a patient's menopausal status significantly increases the number of patients classified using EHR data, ultimately enabling more detailed assessments of the impact of menopause on health outcomes.

摘要

目的

利用临床记录的自然语言处理(NLP)来扩充现有的结构化电子健康记录(EHR)数据,以对患者的绝经状态进行分类。

材料与方法

设计了一个基于规则的NLP系统,以获取患者绝经状态的证据,包括患者末次月经日期、生殖手术、绝经后诊断以及她们使用节育措施和月经中断情况。NLP得出的输出结果与结构化EHR数据结合使用,以对患者的绝经状态进行分类。对在美国退伍军人事务部(VA)接受医疗保健的307512名女性退伍军人队列进行了NLP处理和患者分类。

结果

NLP的验证精度为99.6%。将NLP得出的数据纳入绝经表型,使具有与其绝经状态相关数据的患者数量增加了118%。仅使用结构化编码时,81173名(27.0%)患者能够被分类为绝经后或绝经前。然而,纳入NLP后,这一数字增加到了167804名(54.6%)患者。纳入NLP数据后,绝经前类别增加了532.7%。

讨论

通过采用NLP,能够识别早于VA医疗护理记录的数据元素、源自VA网络之外的数据元素,或者在VA EHR中没有相应结构化字段的数据元素,否则这些数据将无法进行进一步分析。

结论

NLP可用于识别临床记录中与患者绝经状态相关的概念。将NLP得出的数据添加到对患者绝经状态进行分类的算法中,可显著增加使用EHR数据分类的患者数量,最终能够更详细地评估绝经对健康结果的影响。

相似文献

本文引用的文献

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验