通过语言模型和本体论实现更便于患者理解的临床记录。

Towards more patient friendly clinical notes through language models and ontologies.

机构信息

Babylon Health, London, UK.

出版信息

AMIA Annu Symp Proc. 2022 Feb 21;2021:881-890. eCollection 2021.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8861686/

Abstract

Clinical notes are an efficient way to record patient information but are notoriously hard to decipher for non-experts. Automatically simplifying medical text can empower patients with valuable information about their health, while saving clinicians time. We present a novel approach to automated simplification of medical text based on word frequencies and language modelling, grounded on medical ontologies enriched with layman terms. We release a new dataset of pairs of publicly available medical sentences and a version of them simplified by clinicians. Also, we define a novel text simplification metric and evaluation framework, which we use to conduct a large-scale human evaluation of our method against the state of the art. Our method based on a language model trained on medical forum data generates simpler sentences while preserving both grammar and the original meaning, surpassing the current state of the art.

摘要

临床笔记是记录患者信息的有效方式，但对于非专业人员来说，这些笔记通常很难理解。自动简化医学文本可以为患者提供有关其健康状况的有价值信息，同时为临床医生节省时间。我们提出了一种基于词汇频率和语言模型的新型医学文本自动简化方法，该方法基于医学本体论和外行人术语。我们发布了一个新的数据集，其中包含一对公开可用的医学句子及其由临床医生简化的版本。此外，我们还定义了一种新的文本简化度量和评估框架，我们使用该框架对我们的方法与现有技术进行了大规模的人工评估。我们的方法基于在医学论坛数据上训练的语言模型生成更简单的句子，同时保留语法和原始含义，超过了现有技术的水平。

相似文献

Towards more patient friendly clinical notes through language models and ontologies.

AMIA Annu Symp Proc. 2022 Feb 21;2021:881-890. eCollection 2021.

A comparison of word embeddings for the biomedical natural language processing.

J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

Estimating redundancy in clinical text.

J Biomed Inform. 2021 Dec;124:103938. doi: 10.1016/j.jbi.2021.103938. Epub 2021 Oct 23.

Aligned-Layer Text Search in Clinical Notes.

Stud Health Technol Inform. 2017;245:629-633.

Automated identification of wound information in clinical notes of patients with heart diseases: Developing and validating a natural language processing application.

Int J Nurs Stud. 2016 Dec;64:25-31. doi: 10.1016/j.ijnurstu.2016.09.013. Epub 2016 Sep 19.

Syntactic parsing of clinical text: guideline and corpus development with handling ill-formed sentences.

J Am Med Inform Assoc. 2013 Nov-Dec;20(6):1168-77. doi: 10.1136/amiajnl-2013-001810. Epub 2013 Aug 1.

Domain adaption of parsing for operative notes.

J Biomed Inform. 2015 Apr;54:1-9. doi: 10.1016/j.jbi.2015.01.016. Epub 2015 Feb 7.

Voice-dictated versus typed-in clinician notes: linguistic properties and the potential implications on natural language processing.

AMIA Annu Symp Proc. 2011;2011:1630-8. Epub 2011 Oct 22.

Identifying Diabetes in Clinical Notes in Hebrew: A Novel Text Classification Approach Based on Word Embedding.

Stud Health Technol Inform. 2019 Aug 21;264:393-397. doi: 10.3233/SHTI190250.

User evaluation of the effects of a text simplification algorithm using term familiarity on perception, understanding, learning, and information retention.

J Med Internet Res. 2013 Jul 31;15(7):e144. doi: 10.2196/jmir.2569.

引用本文的文献

MedReadCtrl: Personalizing medical text generation with readability-controlled instruction learning.

medRxiv. 2025 Jul 11:2025.07.09.25331239. doi: 10.1101/2025.07.09.25331239.

Improving Clinical Documentation with Artificial Intelligence: A Systematic Review.

Perspect Health Inf Manag. 2024 Jun 1;21(2):1d. eCollection 2024 Summer-Fall.

[Teaching Concept Hanover : Digitally integrated teaching for medical students at the University Clinic for Ophthalmology of the Hanover Medical School].

Ophthalmologie. 2025 Mar;122(3):201-209. doi: 10.1007/s00347-024-02170-x. Epub 2025 Jan 15.

A framework for human evaluation of large language models in healthcare derived from literature review.

NPJ Digit Med. 2024 Sep 28;7(1):258. doi: 10.1038/s41746-024-01258-7.

本文引用的文献

Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources.

Nucleic Acids Res. 2019 Jan 8;47(D1):D1018-D1027. doi: 10.1093/nar/gky1105.

Plain-language medical vocabulary for precision diagnosis.

Nat Genet. 2018 Apr;50(4):474-476. doi: 10.1038/s41588-018-0096-x.

MIMIC-III, a freely accessible critical care database.

Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.

Amazon's Mechanical Turk: A New Source of Inexpensive, Yet High-Quality, Data?

Perspect Psychol Sci. 2011 Jan;6(1):3-5. doi: 10.1177/1745691610393980. Epub 2011 Feb 3.

Entity linking for biomedical literature.

BMC Med Inform Decis Mak. 2015;15 Suppl 1(Suppl 1):S4. doi: 10.1186/1472-6947-15-S1-S4. Epub 2015 May 20.

A classification of errors in lay comprehension of medical documents.

J Biomed Inform. 2012 Dec;45(6):1151-63. doi: 10.1016/j.jbi.2012.07.012. Epub 2012 Aug 20.

2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.

Exploring and developing consumer health vocabularies.

J Am Med Inform Assoc. 2006 Jan-Feb;13(1):24-9. doi: 10.1197/jamia.M1761. Epub 2005 Oct 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过语言模型和本体论实现更便于患者理解的临床记录。

Towards more patient friendly clinical notes through language models and ontologies.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献