Cohen Trevor, Xu Weizhe, Guo Yue, Pakhomov Serguei, Leroy Gondy
Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA.
Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA.
J Biomed Inform. 2025 Jan;161:104758. doi: 10.1016/j.jbi.2024.104758. Epub 2024 Dec 9.
Health literacy is a prerequisite to informed health-related decision making. To facilitate understanding of information, text should be presented at an appropriate reading level for the reader. Cognitive studies suggest that the coherence of a text - the interconnectedness between the ideas it expresses - is especially important for low-knowledge readers, who lack the background knowledge to draw inferences from text that is implicitly connected only. Prior work in cognitive science has yielded automated methods to estimate coherence. These methods estimate the proximity between text representations in a semantic vector space, with the underlying idea that units of text that are poorly connected will be further apart in this space. In addition, recent work with large language models (LLMs) has produced probabilistic methodological analogues that have yet to be evaluated for this purpose. This work concerns the relationship between these automated measures and layperson comprehension of biomedical text. To characterize this relationship, we applied a range of automated measures of text coherence to a set of text snippets, some of which were deliberately modified to improve their accessibility in a series of reading comprehension experiments. Results indicate significant associations between reader comprehension - as estimated using multiple-choice questions - and LLM-derived coherence metrics. Interventions designed to improve the comprehensibility of passages also improved their coherence, as measured with the best-performing LLM-derived models and shown by improved reader understanding of the text. These findings support the utility of LLM-derived measures of text coherence as a means to identify gaps in connectedness that make biomedical text difficult for laypeople to understand, with the potential to inform both manual and automated methods to improve the accessibility of the biomedical literature.
健康素养是做出明智的健康相关决策的前提条件。为便于理解信息,文本应以适合读者阅读水平的方式呈现。认知研究表明,文本的连贯性——即其表达的观点之间的相互联系——对于低知识水平的读者尤为重要,他们缺乏背景知识,无法从仅隐含关联的文本中进行推理。认知科学领域的先前工作已经产生了估计连贯性的自动化方法。这些方法估计语义向量空间中文本表示之间的接近度,其基本思想是,连接性差的文本单元在这个空间中会相距更远。此外,最近关于大语言模型(LLM)的研究产生了概率性的方法类似物,但尚未针对此目的进行评估。这项工作关注这些自动化测量与外行人对生物医学文本的理解之间的关系。为了描述这种关系,我们将一系列文本连贯性的自动化测量方法应用于一组文本片段,其中一些文本片段经过刻意修改,以在一系列阅读理解实验中提高其易读性。结果表明,读者理解(通过多项选择题估计)与基于大语言模型得出的连贯性指标之间存在显著关联。旨在提高段落可理解性的干预措施也提高了它们的连贯性,这是通过表现最佳的基于大语言模型得出的模型测量的,并且读者对文本的理解得到了改善也证明了这一点。这些发现支持了基于大语言模型得出的文本连贯性测量方法的实用性,它可以作为一种手段来识别导致生物医学文本让外行人难以理解的连接性差距,有可能为改进生物医学文献易读性的人工和自动化方法提供信息。