• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从临床记录中自动提取旅行史以用于传染病事件的检测:算法的开发和验证。

Automated Travel History Extraction From Clinical Notes for Informing the Detection of Emergent Infectious Disease Events: Algorithm Development and Validation.

机构信息

VA Salt Lake City Health Care System, US Department of Veterans Affairs, Salt Lake City, UT, United States.

Division of Epidemiology, Department of Internal Medicine, University of Utah, Salt Lake City, UT, United States.

出版信息

JMIR Public Health Surveill. 2021 Mar 24;7(3):e26719. doi: 10.2196/26719.

DOI:10.2196/26719
PMID:33759790
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7993087/
Abstract

BACKGROUND

Patient travel history can be crucial in evaluating evolving infectious disease events. Such information can be challenging to acquire in electronic health records, as it is often available only in unstructured text.

OBJECTIVE

This study aims to assess the feasibility of annotating and automatically extracting travel history mentions from unstructured clinical documents in the Department of Veterans Affairs across disparate health care facilities and among millions of patients. Information about travel exposure augments existing surveillance applications for increased preparedness in responding quickly to public health threats.

METHODS

Clinical documents related to arboviral disease were annotated following selection using a semiautomated bootstrapping process. Using annotated instances as training data, models were developed to extract from unstructured clinical text any mention of affirmed travel locations outside of the continental United States. Automated text processing models were evaluated, involving machine learning and neural language models for extraction accuracy.

RESULTS

Among 4584 annotated instances, 2659 (58%) contained an affirmed mention of travel history, while 347 (7.6%) were negated. Interannotator agreement resulted in a document-level Cohen kappa of 0.776. Automated text processing accuracy (F1 85.6, 95% CI 82.5-87.9) and computational burden were acceptable such that the system can provide a rapid screen for public health events.

CONCLUSIONS

Automated extraction of patient travel history from clinical documents is feasible for enhanced passive surveillance public health systems. Without such a system, it would usually be necessary to manually review charts to identify recent travel or lack of travel, use an electronic health record that enforces travel history documentation, or ignore this potential source of information altogether. The development of this tool was initially motivated by emergent arboviral diseases. More recently, this system was used in the early phases of response to COVID-19 in the United States, although its utility was limited to a relatively brief window due to the rapid domestic spread of the virus. Such systems may aid future efforts to prevent and contain the spread of infectious diseases.

摘要

背景

在评估不断演变的传染病事件时,患者的旅行史可能至关重要。由于此类信息通常仅以非结构化文本形式存在,因此在电子健康记录中获取这些信息具有一定挑战性。

目的

本研究旨在评估在不同医疗保健机构和数百万患者的退伍军人事务部非结构化临床文档中,对旅行史记录进行注释和自动提取的可行性。有关旅行暴露的信息可增强现有的监测应用程序,以快速应对公共卫生威胁,提高应对准备水平。

方法

使用半自动引导过程选择临床文档后,对与虫媒病毒病相关的文档进行注释。使用标注实例作为训练数据,开发模型以从非结构化临床文本中提取任何确认的美国大陆以外旅行地点的提及。评估了自动化文本处理模型,包括用于提取准确性的机器学习和神经语言模型。

结果

在 4584 个注释实例中,有 2659 个(58%)包含肯定的旅行史提及,而有 347 个(7.6%)被否定。文档级别的两位评估者间一致性 Cohen kappa 值为 0.776。自动化文本处理的准确性(F1 85.6,95%CI 82.5-87.9)和计算负担是可以接受的,因此该系统可以快速筛查公共卫生事件。

结论

从临床文档中自动提取患者旅行史对于增强被动监测公共卫生系统是可行的。如果没有这样的系统,通常需要手动查看图表以确定最近的旅行或缺乏旅行,使用强制记录旅行史的电子健康记录,或者完全忽略此潜在信息来源。该工具的开发最初是由新兴的虫媒病毒病驱动的。最近,在美国 COVID-19 疫情早期阶段,该系统被用于响应,但由于病毒在国内迅速传播,其用途仅限于相对较短的时间窗口。此类系统可能有助于未来预防和控制传染病的传播。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4115/7993087/3ea1f23766a4/publichealth_v7i3e26719_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4115/7993087/4a5a49be39cf/publichealth_v7i3e26719_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4115/7993087/3c777d110d92/publichealth_v7i3e26719_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4115/7993087/3ea1f23766a4/publichealth_v7i3e26719_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4115/7993087/4a5a49be39cf/publichealth_v7i3e26719_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4115/7993087/3c777d110d92/publichealth_v7i3e26719_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4115/7993087/3ea1f23766a4/publichealth_v7i3e26719_fig3.jpg

相似文献

1
Automated Travel History Extraction From Clinical Notes for Informing the Detection of Emergent Infectious Disease Events: Algorithm Development and Validation.从临床记录中自动提取旅行史以用于传染病事件的检测:算法的开发和验证。
JMIR Public Health Surveill. 2021 Mar 24;7(3):e26719. doi: 10.2196/26719.
2
[A customized method for information extraction from unstructured text data in the electronic medical records].[一种从电子病历非结构化文本数据中提取信息的定制方法]
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.
3
Task definition, annotated dataset, and supervised natural language processing models for symptom extraction from unstructured clinical notes.从非结构化临床记录中提取症状的任务定义、标注数据集和监督自然语言处理模型。
J Biomed Inform. 2020 Feb;102:103354. doi: 10.1016/j.jbi.2019.103354. Epub 2019 Dec 12.
4
Information extraction from multi-institutional radiology reports.从多机构放射学报告中提取信息。
Artif Intell Med. 2016 Jan;66:29-39. doi: 10.1016/j.artmed.2015.09.007. Epub 2015 Oct 3.
5
Optimizing A syndromic surveillance text classifier for influenza-like illness: Does document source matter?优化用于流感样疾病的症状监测文本分类器:文档来源重要吗?
AMIA Annu Symp Proc. 2008 Nov 6;2008:692-6.
6
Data for registry and quality review can be retrospectively collected using natural language processing from unstructured charts of arthroplasty patients.可以使用自然语言处理从关节置换患者的非结构化图表中回顾性地收集注册和质量审查数据。
Bone Joint J. 2020 Jul;102-B(7_Supple_B):99-104. doi: 10.1302/0301-620X.102B7.BJJ-2019-1574.R1.
7
Validation of a Natural Language Processing Algorithm for Detecting Infectious Disease Symptoms in Primary Care Electronic Medical Records in Singapore.用于检测新加坡基层医疗电子病历中传染病症状的自然语言处理算法的验证
JMIR Med Inform. 2018 Jun 11;6(2):e36. doi: 10.2196/medinform.8204.
8
A method for cohort selection of cardiovascular disease records from an electronic health record system.一种从电子健康记录系统中选择心血管疾病记录队列的方法。
Int J Med Inform. 2017 Jun;102:138-149. doi: 10.1016/j.ijmedinf.2017.03.015. Epub 2017 Mar 30.
9
Congestive heart failure information extraction framework for automated treatment performance measures assessment.用于自动治疗性能评估的充血性心力衰竭信息提取框架
J Am Med Inform Assoc. 2017 Apr 1;24(e1):e40-e46. doi: 10.1093/jamia/ocw097.
10
Identifying signs and symptoms of urinary tract infection from emergency department clinical notes using large language models.利用大语言模型从急诊科临床记录中识别尿路感染的体征和症状。
Acad Emerg Med. 2024 Jun;31(6):599-610. doi: 10.1111/acem.14883. Epub 2024 Apr 3.

引用本文的文献

1
Health Care Language Models and Their Fine-Tuning for Information Extraction: Scoping Review.医疗保健语言模型及其在信息提取方面的微调:范围综述。
JMIR Med Inform. 2024 Oct 21;12:e60164. doi: 10.2196/60164.
2
Developing Evidence to Support Policy: Protocol for the StrAtegic PoLicy EvIdence-Based Evaluation CeNTer (SALIENT).制定证据以支持政策:战略政策循证评估中心(SALIENT)方案。
JMIR Res Protoc. 2024 Sep 19;13:e59830. doi: 10.2196/59830.
3
Innovations in public health surveillance: An overview of novel use of data and analytic methods.

本文引用的文献

1
A pragmatic guide to geoparsing evaluation: Toponyms, Named Entity Recognition and pragmatics.地理解析评估实用指南:地名、命名实体识别与语用学
Lang Resour Eval. 2020;54(3):683-712. doi: 10.1007/s10579-019-09475-3. Epub 2019 Sep 19.
2
Observations of the global epidemiology of COVID-19 from the prepandemic period using web-based surveillance: a cross-sectional analysis.利用基于网络的监测对大流行前时期 COVID-19 的全球流行病学进行观察:一项横断面分析。
Lancet Infect Dis. 2020 Nov;20(11):1255-1262. doi: 10.1016/S1473-3099(20)30581-8. Epub 2020 Jul 29.
3
Machine Learning to Detect Self-Reporting of Symptoms, Testing Access, and Recovery Associated With COVID-19 on Twitter: Retrospective Big Data Infoveillance Study.
公共卫生监测的创新:数据与分析方法新用途概述
Can Commun Dis Rep. 2024 Apr 30;50(3-4):93-101. doi: 10.14745/ccdr.v50i34a02.
4
Changes to Public Health Surveillance Methods Due to the COVID-19 Pandemic: Scoping Review.因 COVID-19 大流行而改变的公共卫生监测方法:范围综述。
JMIR Public Health Surveill. 2024 Jan 19;10:e49185. doi: 10.2196/49185.
5
Travelers and travel vaccines at six health care systems in the Vaccine Safety Datalink.旅行者和旅行疫苗在六个医疗保健系统中的疫苗安全数据链接。
Vaccine. 2022 Sep 29;40(41):5904-5911. doi: 10.1016/j.vaccine.2022.08.023. Epub 2022 Sep 3.
基于机器学习的方法在推特上检测与 COVID-19 相关的自我报告症状、检测途径和康复情况:回顾性大数据信息监测研究。
JMIR Public Health Surveill. 2020 Jun 8;6(2):e19509. doi: 10.2196/19509.
4
Emergency Response to COVID-19 in Canada: Platform Development and Implementation for eHealth in Crisis Management.加拿大对 COVID-19 的紧急应对:危机管理中的电子健康平台开发与实施。
JMIR Public Health Surveill. 2020 May 15;6(2):e18995. doi: 10.2196/18995.
5
Importance of a Travel History in Evaluation of Respiratory Infections.旅行史在呼吸道感染评估中的重要性。
Curr Emerg Hosp Med Rep. 2016;4(3):141-152. doi: 10.1007/s40138-016-0109-y. Epub 2016 Jul 26.
6
Managing Emerging Infectious Diseases: Should Travel Be the Fifth Vital Sign?管理新发传染病:旅行应该成为第五大生命体征吗?
Ann Intern Med. 2020 Apr 21;172(8):560-561. doi: 10.7326/M20-0643. Epub 2020 Mar 3.
7
Open access epidemiological data from the COVID-19 outbreak.来自新冠疫情的开放获取流行病学数据。
Lancet Infect Dis. 2020 May;20(5):534. doi: 10.1016/S1473-3099(20)30119-5. Epub 2020 Feb 19.
8
Adverse drug events and medication relation extraction in electronic health records with ensemble deep learning methods.基于集成深度学习方法的电子健康记录中的药物不良反应和药物关系提取。
J Am Med Inform Assoc. 2020 Jan 1;27(1):39-46. doi: 10.1093/jamia/ocz101.
9
Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning.利用机器学习从放射学报告中提取完整的结构化信息。
J Digit Imaging. 2019 Aug;32(4):554-564. doi: 10.1007/s10278-019-00234-y.
10
Detecting Adverse Drug Events with Rapidly Trained Classification Models.快速训练的分类模型检测药物不良事件。
Drug Saf. 2019 Jan;42(1):147-156. doi: 10.1007/s40264-018-0763-y.