MedReadMe：医学领域细粒度句子可读性的系统研究。

MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain.

作者信息

Jiang Chao, Xu Wei

机构信息

College of Computing, Georgia Institute of Technology.

出版信息

Proc Conf Empir Methods Nat Lang Process. 2024 Nov;2024:17293-17319. doi: 10.18653/v1/2024.emnlp-main.958.

DOI:10.18653/v1/2024.emnlp-main.958

PMID:40612445

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12225841/

Abstract

Medical texts are notoriously challenging to read. Properly measuring their readability is the first step towards making them more accessible. In this paper, we present a systematic study on fine-grained readability measurements in the medical domain at both sentence-level and span-level. We introduce a new dataset MedReadMe, which consists of manually annotated readability ratings and fine-grained complex span annotation for 4,520 sentences, featuring two novel "Google-Easy" and "Google-Hard" categories. It supports our quantitative analysis, which covers 650 linguistic features and automatic complex word and jargon identification. Enabled by our high-quality annotation, we benchmark and improve several state-of-the-art sentence-level readability metrics for the medical domain specifically, which include unsupervised, supervised, and prompting-based methods using recently developed large language models (LLMs). Informed by our fine-grained complex span annotation, we find that adding a single feature, capturing the number of jargon spans, into existing readability formulas can significantly improve their correlation with human judgments. We will publicly release the dataset and code.

摘要

医学文本向来极难读懂。准确衡量其易读性是使其更易于理解的第一步。在本文中，我们对医学领域句子层面和跨度层面的细粒度易读性测量进行了系统研究。我们引入了一个新的数据集MedReadMe，它包含对4520个句子的人工标注易读性评分和细粒度复杂跨度标注，具有两个新颖的“谷歌易读”和“谷歌难读”类别。它支持我们的定量分析，该分析涵盖650种语言特征以及自动复杂词和行话识别。借助我们高质量的标注，我们专门对医学领域的几种最先进的句子层面易读性指标进行了基准测试和改进，其中包括使用最近开发的大语言模型（LLMs）的无监督、有监督和基于提示的方法。基于我们细粒度的复杂跨度标注，我们发现，在现有的易读性公式中添加一个捕捉行话跨度数量的单一特征，可以显著提高它们与人类判断的相关性。我们将公开发布数据集和代码。

相似文献

MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain.MedReadMe：医学领域细粒度句子可读性的系统研究。

Proc Conf Empir Methods Nat Lang Process. 2024 Nov;2024:17293-17319. doi: 10.18653/v1/2024.emnlp-main.958.

Artificial Intelligence Shows Limited Success in Improving Readability Levels of Spanish-language Orthopaedic Patient Education Materials.人工智能在提高西班牙语骨科患者教育材料的可读性方面成效有限。

Clin Orthop Relat Res. 2025 Feb 11. doi: 10.1097/CORR.0000000000003413.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验：定性证据综合。

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。

Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.

Evaluating and Improving Syndrome Differentiation Thinking Ability in Large Language Models: Method Development Study.评估和提高大语言模型中的辨证思维能力：方法开发研究

JMIR Med Inform. 2025 Jun 20;13:e75103. doi: 10.2196/75103.

The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂（GLP-1 RAs）减肥效果的网状Meta分析的数量、质量及结果：一项范围综述

Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。

Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.

本文引用的文献

Personalized Jargon Identification for Enhanced Interdisciplinary Communication.用于加强跨学科交流的个性化术语识别

Proc Conf. 2024 Jun;2024:4535-4550. doi: 10.18653/v1/2024.naacl-long.255.

Fine-tuning large neural language models for biomedical natural language processing.针对生物医学自然语言处理对大型神经语言模型进行微调。

Patterns (N Y). 2023 Apr 14;4(4):100729. doi: 10.1016/j.patter.2023.100729.

MedJEx: A Medical Jargon Extraction Model with Wiki's Hyperlink Span and Contextualized Masked Language Model Score.MedJEx：一种具有维基百科超链接跨度和上下文掩码语言模型评分的医学术语提取模型。

Proc Conf Empir Methods Nat Lang Process. 2022 Dec;2022:11733-11751.

Readability and quality of online information on total ankle arthroplasty.全踝关节置换术在线信息的可读性与质量

Foot (Edinb). 2023 Mar;54:101985. doi: 10.1016/j.foot.2023.101985. Epub 2023 Feb 21.

Quality and readability of online information on plantar fasciitis and calcaneal spur.足底筋膜炎和跟骨骨刺的在线信息的质量和可读性。

Rheumatol Int. 2022 Nov;42(11):1965-1972. doi: 10.1007/s00296-022-05165-6. Epub 2022 Jun 28.

Paragraph-level Simplification of Medical Texts.医学文本的段落级简化

Proc Conf. 2021 Jun;2021:4972-4984. doi: 10.18653/v1/2021.naacl-main.395.

Readability and Variability Among Online Resources for Patella Dislocation: What Patients Are Reading.髌脱位在线资源的可读性和可变性：患者正在阅读的内容。

Orthopedics. 2022 Mar-Apr;45(2):e62-e66. doi: 10.3928/01477447-20220105-09. Epub 2022 Jan 12.

Readability of Patient Education Materials From High-Impact Medical Journals: A 20-Year Analysis.高影响力医学期刊中患者教育材料的可读性：一项为期20年的分析。

J Patient Exp. 2021 Mar 3;8:2374373521998847. doi: 10.1177/2374373521998847. eCollection 2021.

Readability, content, and quality of COVID-19 patient education materials from academic medical centers in the United States.美国学术医学中心的 COVID-19 患者教育材料的可读性、内容和质量。

Am J Infect Control. 2021 Jun;49(6):690-693. doi: 10.1016/j.ajic.2020.11.023. Epub 2020 Nov 28.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT：一种用于生物医学文本挖掘的预训练生物医学语言表示模型。

Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验