• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面向消费者健康问题答案的问题驱动式总结。

Question-driven summarization of answers to consumer health questions.

机构信息

Lister Hill National Center for Biomedical Communications, U.S. National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.

出版信息

Sci Data. 2020 Oct 2;7(1):322. doi: 10.1038/s41597-020-00667-z.

DOI:10.1038/s41597-020-00667-z
PMID:33009402
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7532186/
Abstract

Automatic summarization of natural language is a widely studied area in computer science, one that is broadly applicable to anyone who needs to understand large quantities of information. In the medical domain, automatic summarization has the potential to make health information more accessible to people without medical expertise. However, to evaluate the quality of summaries generated by summarization algorithms, researchers first require gold standard, human generated summaries. Unfortunately there is no available data for the purpose of assessing summaries that help consumers of health information answer their questions. To address this issue, we present the MEDIQA-Answer Summarization dataset, the first dataset designed for question-driven, consumer-focused summarization. It contains 156 health questions asked by consumers, answers to these questions, and manually generated summaries of these answers. The dataset's unique structure allows it to be used for at least eight different types of summarization evaluations. We also benchmark the performance of baseline and state-of-the-art deep learning approaches on the dataset, demonstrating how it can be used to evaluate automatically generated summaries.

摘要

自然语言自动摘要在计算机科学领域是一个备受研究的领域,广泛适用于任何需要理解大量信息的人。在医学领域,自动摘要有可能使没有医学专业知识的人更容易获得健康信息。然而,为了评估摘要算法生成的摘要的质量,研究人员首先需要人工生成的黄金标准摘要。不幸的是,目前没有可用的数据来评估帮助健康信息消费者回答问题的摘要。为了解决这个问题,我们提出了 MEDIQA-Answer Summarization 数据集,这是第一个专门为面向问题、面向消费者的摘要而设计的数据集。它包含 156 个由消费者提出的健康问题、这些问题的答案以及这些答案的手动生成的摘要。该数据集的独特结构使其至少可以用于八种不同类型的摘要评估。我们还在数据集上对基准和最先进的深度学习方法的性能进行了基准测试,展示了如何使用它来评估自动生成的摘要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77c2/7532186/56603d711350/41597_2020_667_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77c2/7532186/50bbd699f8b7/41597_2020_667_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77c2/7532186/478c01715ad4/41597_2020_667_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77c2/7532186/56603d711350/41597_2020_667_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77c2/7532186/50bbd699f8b7/41597_2020_667_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77c2/7532186/478c01715ad4/41597_2020_667_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77c2/7532186/56603d711350/41597_2020_667_Fig3_HTML.jpg

相似文献

1
Question-driven summarization of answers to consumer health questions.面向消费者健康问题答案的问题驱动式总结。
Sci Data. 2020 Oct 2;7(1):322. doi: 10.1038/s41597-020-00667-z.
2
SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions.SemBioNLQA:一个语义生物医学问答系统,用于检索自然语言问题的准确和理想答案。
Artif Intell Med. 2020 Jan;102:101767. doi: 10.1016/j.artmed.2019.101767. Epub 2019 Nov 28.
3
Consumer health information and question answering: helping consumers find answers to their health-related information needs.消费者健康信息与问答:帮助消费者寻找与其健康相关的信息需求的答案。
J Am Med Inform Assoc. 2020 Feb 1;27(2):194-201. doi: 10.1093/jamia/ocz152.
4
Question-aware transformer models for consumer health question summarization.基于问句感知的 Transformer 模型在消费者健康问句总结中的应用
J Biomed Inform. 2022 Apr;128:104040. doi: 10.1016/j.jbi.2022.104040. Epub 2022 Mar 6.
5
Automatically classifying question types for consumer health questions.自动对消费者健康问题的问题类型进行分类。
AMIA Annu Symp Proc. 2014 Nov 14;2014:1018-27. eCollection 2014.
6
Evaluation of an automated knowledge-based textual summarization system for longitudinal clinical data, in the intensive care domain.评估一个自动化的基于知识的文本摘要系统在重症监护领域的纵向临床数据中的应用。
Artif Intell Med. 2017 Oct;82:20-33. doi: 10.1016/j.artmed.2017.09.001. Epub 2017 Sep 27.
7
Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation.基于深度神经网络的临床相关生物医学文本摘要:模型开发与验证。
J Med Internet Res. 2020 Oct 23;22(10):e19810. doi: 10.2196/19810.
8
Toward automated consumer question answering: automatically separating consumer questions from professional questions in the healthcare domain.迈向自动化消费者问答:在医疗保健领域自动区分消费者问题和专业问题。
J Biomed Inform. 2011 Dec;44(6):1032-8. doi: 10.1016/j.jbi.2011.08.008. Epub 2011 Aug 12.
9
An Ensemble Method for Spelling Correction in Consumer Health Questions.一种用于消费者健康问题拼写纠正的集成方法。
AMIA Annu Symp Proc. 2015 Nov 5;2015:727-36. eCollection 2015.
10
An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.BIOASQ大规模生物医学语义索引与问答竞赛概述。
BMC Bioinformatics. 2015 Apr 30;16:138. doi: 10.1186/s12859-015-0564-6.

引用本文的文献

1
A Dataset of Medical Questions Paired with Automatically Generated Answers and Evidence-supported References.一个包含医学问题以及自动生成答案和有证据支持的参考文献的数据集。
Sci Data. 2025 Jun 19;12(1):1035. doi: 10.1038/s41597-025-05233-z.
2
MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning.MedAdapter:大型语言模型在医学推理方面的高效测试时自适应
Proc Conf Empir Methods Nat Lang Process. 2024 Nov;2024:22294-22314. doi: 10.18653/v1/2024.emnlp-main.1244.
3
Comparing the performance of ChatGPT-3.5-Turbo, ChatGPT-4, and Google Bard with Iranian students in pre-internship comprehensive exams.
将 ChatGPT-3.5-Turbo、ChatGPT-4 和 Google Bard 与伊朗医学生在实习前综合考试中的表现进行比较。
Sci Rep. 2024 Nov 18;14(1):28456. doi: 10.1038/s41598-024-79335-w.
4
Multifaceted Natural Language Processing Task-Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation.基于转换器模型的双向编码器表示的多方面自然语言处理任务评估在双语(韩语和英语)临床笔记中的应用:算法开发和验证。
JMIR Med Inform. 2024 Oct 30;12:e52897. doi: 10.2196/52897.
5
Ascle-A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study.Ascle-A 是一个用于医疗文本生成的 Python 自然语言处理工具包:开发和评估研究。
J Med Internet Res. 2024 Oct 3;26:e60601. doi: 10.2196/60601.
6
Existing Barriers Faced by and Future Design Recommendations for Direct-to-Consumer Health Care Artificial Intelligence Apps: Scoping Review.直接面向消费者的医疗保健人工智能应用程序所面临的现有障碍和未来设计建议:范围综述。
J Med Internet Res. 2023 Dec 18;25:e50342. doi: 10.2196/50342.
7
Artificial Intelligence in Childcare: Assessing the Performance and Acceptance of ChatGPT Responses.人工智能在儿童保育中的应用:评估ChatGPT回复的性能与可接受性
Cureus. 2023 Aug 31;15(8):e44484. doi: 10.7759/cureus.44484. eCollection 2023 Aug.
8
CHQ- SocioEmo: Identifying Social and Emotional Support Needs in Consumer-Health Questions.CHQ- 社会情感:识别消费者健康问题中的社会和情感支持需求。
Sci Data. 2023 May 27;10(1):329. doi: 10.1038/s41597-023-02203-1.
9
quEHRy: a question answering system to query electronic health records.QueHRy:一个问答系统,用于查询电子健康记录。
J Am Med Inform Assoc. 2023 May 19;30(6):1091-1102. doi: 10.1093/jamia/ocad050.
10
How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment.ChatGPT在美国医师执照考试(USMLE)中的表现如何?大语言模型对医学教育和知识评估的影响。
JMIR Med Educ. 2023 Feb 8;9:e45312. doi: 10.2196/45312.