• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

英国国民健康服务研究医院的自由文本分析平台部署:伦敦大学学院医院的CogStack

Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals.

作者信息

Noor Kawsar, Roguski Lukasz, Bai Xi, Handy Alex, Klapaukh Roman, Folarin Amos, Romao Luis, Matteson Joshua, Lea Nathan, Zhu Leilei, Asselbergs Folkert W, Wong Wai Keong, Shah Anoop, Dobson Richard Jb

机构信息

University College London, London, United Kingdom.

Institute of Health Informatics, University College London, London, United Kingdom.

出版信息

JMIR Med Inform. 2022 Aug 24;10(8):e38122. doi: 10.2196/38122.

DOI:10.2196/38122
PMID:36001371
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9453582/
Abstract

BACKGROUND

As more health care organizations transition to using electronic health record (EHR) systems, it is important for these organizations to maximize the secondary use of their data to support service improvement and clinical research. These organizations will find it challenging to have systems capable of harnessing the unstructured data fields in the record (clinical notes, letters, etc) and more practically have such systems interact with all of the hospital data systems (legacy and current).

OBJECTIVE

We describe the deployment of the EHR interfacing information extraction and retrieval platform CogStack at University College London Hospitals (UCLH).

METHODS

At UCLH, we have deployed the CogStack platform, an information retrieval platform with natural language processing capabilities. The platform addresses the problem of data ingestion and harmonization from multiple data sources using the Apache NiFi module for managing complex data flows. The platform also facilitates the extraction of structured data from free-text records through use of the MedCAT natural language processing library. Finally, data science tools are made available to support data scientists and the development of downstream applications dependent upon data ingested and analyzed by CogStack.

RESULTS

The platform has been deployed at the hospital, and in particular, it has facilitated a number of research and service evaluation projects. To date, we have processed over 30 million records, and the insights produced from CogStack have informed a number of clinical research use cases at the hospital.

CONCLUSIONS

The CogStack platform can be configured to handle the data ingestion and harmonization challenges faced by a hospital. More importantly, the platform enables the hospital to unlock important clinical information from the unstructured portion of the record using natural language processing technology.

摘要

背景

随着越来越多的医疗保健机构转向使用电子健康记录(EHR)系统,这些机构将其数据的二次利用最大化以支持服务改进和临床研究变得至关重要。这些机构会发现,要拥有能够利用记录中的非结构化数据字段(临床笔记、信件等)的系统,并且更实际地让此类系统与所有医院数据系统(传统系统和当前系统)进行交互具有挑战性。

目的

我们描述了电子健康记录接口信息提取和检索平台CogStack在伦敦大学学院医院(UCLH)的部署情况。

方法

在UCLH,我们部署了CogStack平台,这是一个具有自然语言处理能力的信息检索平台。该平台使用Apache NiFi模块来管理复杂数据流,解决了从多个数据源摄取和协调数据的问题。该平台还通过使用MedCAT自然语言处理库,促进了从自由文本记录中提取结构化数据。最后,提供了数据科学工具来支持数据科学家以及依赖CogStack摄取和分析的数据开发下游应用程序。

结果

该平台已在医院部署,特别是,它推动了许多研究和服务评估项目。迄今为止,我们已经处理了超过3000万条记录,并且CogStack产生的见解为医院的许多临床研究用例提供了依据。

结论

CogStack平台可以配置为处理医院面临的数据摄取和协调挑战。更重要的是,该平台使医院能够使用自然语言处理技术从记录的非结构化部分解锁重要的临床信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8735/9453582/37527977f331/medinform_v10i8e38122_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8735/9453582/8098a1bff29e/medinform_v10i8e38122_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8735/9453582/37527977f331/medinform_v10i8e38122_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8735/9453582/8098a1bff29e/medinform_v10i8e38122_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8735/9453582/37527977f331/medinform_v10i8e38122_fig2.jpg

相似文献

1
Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals.英国国民健康服务研究医院的自由文本分析平台部署:伦敦大学学院医院的CogStack
JMIR Med Inform. 2022 Aug 24;10(8):e38122. doi: 10.2196/38122.
2
SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research.SemEHR:一个通用的语义搜索系统,用于从临床记录中提取语义数据,以提供个性化护理、临床试验招募和临床研究。
J Am Med Inform Assoc. 2018 May 1;25(5):530-537. doi: 10.1093/jamia/ocx160.
3
CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital.CogStack-在大型国民保健制度基金会信托医院中部署集成信息检索和提取服务的经验。
BMC Med Inform Decis Mak. 2018 Jun 25;18(1):47. doi: 10.1186/s12911-018-0623-9.
4
Multi-domain clinical natural language processing with MedCAT: The Medical Concept Annotation Toolkit.多领域临床自然语言处理与 MedCAT:医学概念标注工具包。
Artif Intell Med. 2021 Jul;117:102083. doi: 10.1016/j.artmed.2021.102083. Epub 2021 May 1.
5
Text mining occupations from the mental health electronic health record: a natural language processing approach using records from the Clinical Record Interactive Search (CRIS) platform in south London, UK.从精神健康电子健康记录中挖掘文本职业信息:使用英国伦敦南部临床记录交互检索(CRIS)平台记录的自然语言处理方法。
BMJ Open. 2021 Mar 25;11(3):e042274. doi: 10.1136/bmjopen-2020-042274.
6
Hospital-wide natural language processing summarising the health data of 1 million patients.全院范围的自然语言处理对100万名患者的健康数据进行汇总。
PLOS Digit Health. 2023 May 9;2(5):e0000218. doi: 10.1371/journal.pdig.0000218. eCollection 2023 May.
7
Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack.基于电子健康记录使用CogStack实现实时精神病风险检测与警报系统。
J Vis Exp. 2020 May 15(159). doi: 10.3791/60794.
8
Foresight-a generative pretrained transformer for modelling of patient timelines using electronic health records: a retrospective modelling study.利用电子健康记录对患者时间线进行建模的生成式预训练转换器 Foresight:一项回顾性建模研究。
Lancet Digit Health. 2024 Apr;6(4):e281-e290. doi: 10.1016/S2589-7500(24)00025-6.
9
Natural Language Processing for Mimicking Clinical Trial Recruitment in Critical Care: A Semi-Automated Simulation Based on the LeoPARDS Trial.自然语言处理在模拟重症监护临床试验招募中的应用:基于 LeoPARDS 试验的半自动化模拟。
IEEE J Biomed Health Inform. 2020 Oct;24(10):2950-2959. doi: 10.1109/JBHI.2020.2977925. Epub 2020 Mar 9.
10
Natural language processing for disease phenotyping in UK primary care records for research: a pilot study in myocardial infarction and death.用于研究的英国初级医疗记录中疾病表型的自然语言处理:心肌梗死和死亡的一项试点研究
J Biomed Semantics. 2019 Nov 12;10(Suppl 1):20. doi: 10.1186/s13326-019-0214-4.

引用本文的文献

1
AI assisted prediction of unplanned intensive care admissions using natural language processing in elective neurosurgery.利用自然语言处理技术在择期神经外科手术中进行人工智能辅助预测非计划重症监护病房入院情况
NPJ Digit Med. 2025 Aug 27;8(1):549. doi: 10.1038/s41746-025-01952-0.
2
Concept Recognition and Characterization of Patients Undergoing Resection of Vestibular Schwannoma Using Natural Language Processing.使用自然语言处理对前庭神经鞘瘤切除患者进行概念识别与特征描述
J Neurol Surg B Skull Base. 2024 May 11;86(3):332-341. doi: 10.1055/s-0044-1786738. eCollection 2025 Jun.
3
Engineering of Generative Artificial Intelligence and Natural Language Processing Models to Accurately Identify Arrhythmia Recurrence.

本文引用的文献

1
Evaluation of antithrombotic use and COVID-19 outcomes in a nationwide atrial fibrillation cohort.评估全国范围内心房颤动队列中抗血栓治疗的使用情况和 COVID-19 结局。
Heart. 2022 May 25;108(12):923-931. doi: 10.1136/heartjnl-2021-320325.
2
Multi-domain clinical natural language processing with MedCAT: The Medical Concept Annotation Toolkit.多领域临床自然语言处理与 MedCAT:医学概念标注工具包。
Artif Intell Med. 2021 Jul;117:102083. doi: 10.1016/j.artmed.2021.102083. Epub 2021 May 1.
3
Data gaps in electronic health record (EHR) systems: An audit of problem list completeness during the COVID-19 pandemic.
用于准确识别心律失常复发的生成式人工智能和自然语言处理模型的工程设计。
Circ Arrhythm Electrophysiol. 2025 Jan;18(1):e013023. doi: 10.1161/CIRCEP.124.013023. Epub 2024 Dec 16.
4
Early evaluation of a natural language processing tool to improve access to educational resources for surgical patients.早期评估自然语言处理工具,以改善手术患者获取教育资源的途径。
Eur Spine J. 2024 Jul;33(7):2545-2552. doi: 10.1007/s00586-024-08315-5. Epub 2024 May 30.
5
Neural machine translation of clinical text: an empirical investigation into multilingual pre-trained language models and transfer-learning.临床文本的神经机器翻译:对多语言预训练语言模型和迁移学习的实证研究。
Front Digit Health. 2024 Feb 26;6:1211564. doi: 10.3389/fdgth.2024.1211564. eCollection 2024.
6
Optimizing artificial intelligence in sepsis management: Opportunities in the present and looking closely to the future.优化脓毒症管理中的人工智能:当前机遇与展望未来
J Intensive Med. 2023 Nov 29;4(1):34-45. doi: 10.1016/j.jointm.2023.10.001. eCollection 2024 Jan.
7
New Horizons in artificial intelligence in the healthcare of older people.人工智能在老年人医疗保健领域的新进展。
Age Ageing. 2023 Dec 1;52(12). doi: 10.1093/ageing/afad219.
8
Long Covid symptoms and diagnosis in primary care: A cohort study using structured and unstructured data in The Health Improvement Network primary care database.长新冠症状和初级保健诊断:使用健康改善网络初级保健数据库中的结构化和非结构化数据的队列研究。
PLoS One. 2023 Sep 26;18(9):e0290583. doi: 10.1371/journal.pone.0290583. eCollection 2023.
9
Routine healthcare data for adverse events after spinal surgery.脊柱手术后不良事件的常规医疗保健数据。
Acta Neurochir (Wien). 2023 Dec;165(12):3991-3992. doi: 10.1007/s00701-023-05783-4. Epub 2023 Sep 2.
10
Breaking Bias: The Role of Artificial Intelligence in Improving Clinical Decision-Making.打破偏见:人工智能在改善临床决策中的作用。
Cureus. 2023 Mar 20;15(3):e36415. doi: 10.7759/cureus.36415. eCollection 2023 Mar.
电子健康记录 (EHR) 系统中的数据空白:COVID-19 大流行期间问题清单完整性的审核。
Int J Med Inform. 2021 Jun;150:104452. doi: 10.1016/j.ijmedinf.2021.104452. Epub 2021 Apr 1.
4
Angiotensin-converting enzyme inhibitors and angiotensin II receptor blockers are not associated with severe COVID-19 infection in a multi-site UK acute hospital trust.血管紧张素转换酶抑制剂和血管紧张素 II 受体阻滞剂与英国多地点急性医院信托的严重 COVID-19 感染无关。
Eur J Heart Fail. 2020 Jun;22(6):967-974. doi: 10.1002/ejhf.1924. Epub 2020 Jul 7.
5
Protected Health Information filter (Philter): accurately and securely de-identifying free-text clinical notes.受保护的健康信息过滤器(Philter):准确且安全地去除自由文本临床记录中的身份标识信息。
NPJ Digit Med. 2020 Apr 14;3:57. doi: 10.1038/s41746-020-0258-y. eCollection 2020.
6
Natural Language Processing for Mimicking Clinical Trial Recruitment in Critical Care: A Semi-Automated Simulation Based on the LeoPARDS Trial.自然语言处理在模拟重症监护临床试验招募中的应用:基于 LeoPARDS 试验的半自动化模拟。
IEEE J Biomed Health Inform. 2020 Oct;24(10):2950-2959. doi: 10.1109/JBHI.2020.2977925. Epub 2020 Mar 9.
7
Artificial intelligence approaches using natural language processing to advance EHR-based clinical research.利用自然语言处理技术的人工智能方法来推进基于电子健康记录的临床研究。
J Allergy Clin Immunol. 2020 Feb;145(2):463-469. doi: 10.1016/j.jaci.2019.12.897. Epub 2019 Dec 26.
8
A frame semantic overview of NLP-based information extraction for cancer-related EHR notes.基于框架语义的自然语言处理信息抽取在癌症相关电子病历中的应用综述。
J Biomed Inform. 2019 Dec;100:103301. doi: 10.1016/j.jbi.2019.103301. Epub 2019 Oct 4.
9
The Evolving Use of Electronic Health Records (EHR) for Research.电子健康记录(EHR)在研究中的应用不断发展。
Semin Radiat Oncol. 2019 Oct;29(4):354-361. doi: 10.1016/j.semradonc.2019.05.010.
10
CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital.CogStack-在大型国民保健制度基金会信托医院中部署集成信息检索和提取服务的经验。
BMC Med Inform Decis Mak. 2018 Jun 25;18(1):47. doi: 10.1186/s12911-018-0623-9.