• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度排序递归自动编码器的列表式学习对生物医学问答对进行排序。

List-wise learning to rank biomedical question-answer pairs with deep ranking recursive autoencoders.

机构信息

Department of Computer Science and Technology, School of Mechanical Electronic and Information Engineering, China University of Mining and Technology Beijing, Beijing, China.

Alibaba Group, Hangzhou, China.

出版信息

PLoS One. 2020 Nov 9;15(11):e0242061. doi: 10.1371/journal.pone.0242061. eCollection 2020.

DOI:10.1371/journal.pone.0242061
PMID:33166367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7652278/
Abstract

Biomedical question answering (QA) represents a growing concern among industry and academia due to the crucial impact of biomedical information. When mapping and ranking candidate snippet answers within relevant literature, current QA systems typically refer to information retrieval (IR) techniques: specifically, query processing approaches and ranking models. However, these IR-based approaches are insufficient to consider both syntactic and semantic relatedness and thus cannot formulate accurate natural language answers. Recently, deep learning approaches have become well-known for learning optimal semantic feature representations in natural language processing tasks. In this paper, we present a deep ranking recursive autoencoders (rankingRAE) architecture for ranking question-candidate snippet answer pairs (Q-S) to obtain the most relevant candidate answers for biomedical questions extracted from the potentially relevant documents. In particular, we convert the task of ranking candidate answers to several simultaneous binary classification tasks for determining whether a question and a candidate answer are relevant. The compositional words and their random initialized vectors of concatenated Q-S pairs are fed into recursive autoencoders to learn the optimal semantic representations in an unsupervised way, and their semantic relatedness is classified through supervised learning. Unlike several existing methods to directly choose the top-K candidates with highest probabilities, we take the influence of different ranking results into consideration. Consequently, we define a listwise "ranking error" for loss function computation to penalize inappropriate answer ranking for each question and to eliminate their influence. The proposed architecture is evaluated with respect to the BioASQ 2013-2018 Six-year Biomedical Question Answering benchmarks. Compared with classical IR models, other deep representation models, as well as some state-of-the-art systems for these tasks, the experimental results demonstrate the robustness and effectiveness of rankingRAE.

摘要

生物医学问答 (QA) 由于生物医学信息的关键影响,在工业界和学术界引起了越来越多的关注。在映射和对相关文献中的候选片段答案进行排名时,当前的 QA 系统通常参考信息检索 (IR) 技术:具体来说,查询处理方法和排名模型。但是,这些基于 IR 的方法不足以考虑语法和语义相关性,因此无法形成准确的自然语言答案。最近,深度学习方法在自然语言处理任务中学习最佳语义特征表示方面变得众所周知。在本文中,我们提出了一种深度排名递归自动编码器 (rankingRAE) 架构,用于对问题-候选片段答案对 (Q-S) 进行排名,以从潜在相关文档中获取与生物医学问题最相关的候选答案。具体来说,我们将对候选答案进行排名的任务转换为几个同时的二进制分类任务,以确定问题和候选答案是否相关。将 Q-S 对的组合词及其随机初始化向量作为输入,通过递归自动编码器以无监督的方式学习最佳语义表示,然后通过监督学习对它们的语义相关性进行分类。与直接选择具有最高概率的前 K 个候选者的几种现有方法不同,我们考虑了不同排名结果的影响。因此,我们定义了一个列表级别的“排名错误”,用于计算损失函数,以惩罚每个问题中不合适的答案排名,并消除其影响。该架构在 2013-2018 年生物 ASQ 六年度生物医学问答基准上进行了评估。与经典 IR 模型、其他深度表示模型以及这些任务的一些最新系统相比,实验结果证明了 rankingRAE 的稳健性和有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9907/7652278/811d9e51d8d7/pone.0242061.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9907/7652278/ca95ffa95609/pone.0242061.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9907/7652278/c45c6c5ec0a0/pone.0242061.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9907/7652278/bf8349162292/pone.0242061.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9907/7652278/811d9e51d8d7/pone.0242061.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9907/7652278/ca95ffa95609/pone.0242061.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9907/7652278/c45c6c5ec0a0/pone.0242061.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9907/7652278/bf8349162292/pone.0242061.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9907/7652278/811d9e51d8d7/pone.0242061.g004.jpg

相似文献

1
List-wise learning to rank biomedical question-answer pairs with deep ranking recursive autoencoders.基于深度排序递归自动编码器的列表式学习对生物医学问答对进行排序。
PLoS One. 2020 Nov 9;15(11):e0242061. doi: 10.1371/journal.pone.0242061. eCollection 2020.
2
SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions.SemBioNLQA:一个语义生物医学问答系统,用于检索自然语言问题的准确和理想答案。
Artif Intell Med. 2020 Jan;102:101767. doi: 10.1016/j.artmed.2019.101767. Epub 2019 Nov 28.
3
An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.BIOASQ大规模生物医学语义索引与问答竞赛概述。
BMC Bioinformatics. 2015 Apr 30;16:138. doi: 10.1186/s12859-015-0564-6.
4
A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering.一种基于机器学习的生物医学问答中问题类型分类方法。
Methods Inf Med. 2017 May 18;56(3):209-216. doi: 10.3414/ME16-01-0116. Epub 2017 Mar 31.
5
Large scale biomedical texts classification: a kNN and an ESA-based approaches.大规模生物医学文本分类:基于k近邻算法和基于词嵌入语义分析的方法。
J Biomed Semantics. 2016 Jun 16;7:40. doi: 10.1186/s13326-016-0073-1.
6
Word embeddings and external resources for answer processing in biomedical factoid question answering.词向量和外部资源在生物医学事实问答中的答案处理
J Biomed Inform. 2019 Apr;92:103118. doi: 10.1016/j.jbi.2019.103118. Epub 2019 Feb 10.
7
A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering.一种基于概率信息检索模型和统一医学语言系统(UMLS)概念的生物医学问答中的段落检索方法。
J Biomed Inform. 2017 Apr;68:96-103. doi: 10.1016/j.jbi.2017.03.001. Epub 2017 Mar 7.
8
Biomedical question answering using semantic relations.基于语义关系的生物医学问答
BMC Bioinformatics. 2015 Jan 16;16(1):6. doi: 10.1186/s12859-014-0365-3.
9
A question-entailment approach to question answering.问题蕴涵方法在问答中的应用。
BMC Bioinformatics. 2019 Oct 22;20(1):511. doi: 10.1186/s12859-019-3119-4.
10
Health assistant: answering your questions anytime from biomedical literature.健康助手:随时随地回答您关于生物医学文献的问题。
Bioinformatics. 2019 Oct 15;35(20):4129-4139. doi: 10.1093/bioinformatics/btz195.

引用本文的文献

1
Question answering systems for health professionals at the point of care-a systematic review.在护理点为医疗保健专业人员提供问答系统——系统评价。
J Am Med Inform Assoc. 2024 Apr 3;31(4):1009-1024. doi: 10.1093/jamia/ocae015.
2
Demystifying the Role of Natural Language Processing (NLP) in Smart City Applications: Background, Motivation, Recent Advances, and Future Research Directions.揭开自然语言处理(NLP)在智慧城市应用中的作用:背景、动机、最新进展及未来研究方向
Wirel Pers Commun. 2023;130(2):857-908. doi: 10.1007/s11277-023-10312-8. Epub 2023 Mar 16.

本文引用的文献

1
A question-entailment approach to question answering.问题蕴涵方法在问答中的应用。
BMC Bioinformatics. 2019 Oct 22;20(1):511. doi: 10.1186/s12859-019-3119-4.
2
Clinical text classification with rule-based features and knowledge-guided convolutional neural networks.基于规则特征和知识引导卷积神经网络的临床文本分类。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):71. doi: 10.1186/s12911-019-0781-4.
3
Word embeddings and external resources for answer processing in biomedical factoid question answering.词向量和外部资源在生物医学事实问答中的答案处理
J Biomed Inform. 2019 Apr;92:103118. doi: 10.1016/j.jbi.2019.103118. Epub 2019 Feb 10.
4
A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering.一种基于概率信息检索模型和统一医学语言系统(UMLS)概念的生物医学问答中的段落检索方法。
J Biomed Inform. 2017 Apr;68:96-103. doi: 10.1016/j.jbi.2017.03.001. Epub 2017 Mar 7.
5
An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.BIOASQ大规模生物医学语义索引与问答竞赛概述。
BMC Bioinformatics. 2015 Apr 30;16:138. doi: 10.1186/s12859-015-0564-6.
6
Knowledge Discovery and interactive Data Mining in Bioinformatics--State-of-the-Art, future challenges and research directions.生物信息学中的知识发现与交互式数据挖掘——现状、未来挑战及研究方向
BMC Bioinformatics. 2014;15 Suppl 6(Suppl 6):I1. doi: 10.1186/1471-2105-15-S6-I1. Epub 2014 May 16.
7
The MiPACQ clinical question answering system.MiPACQ临床问答系统。
AMIA Annu Symp Proc. 2011;2011:171-80. Epub 2011 Oct 22.
8
Automatically extracting information needs from complex clinical questions.从复杂的临床问题中自动提取信息需求。
J Biomed Inform. 2010 Dec;43(6):962-71. doi: 10.1016/j.jbi.2010.07.007. Epub 2010 Jul 27.
9
Biomedical question answering: a survey.生物医学问答:综述。
Comput Methods Programs Biomed. 2010 Jul;99(1):1-24. doi: 10.1016/j.cmpb.2009.10.003. Epub 2009 Nov 13.
10
A cognitive evaluation of four online search engines for answering definitional questions posed by physicians.对四个在线搜索引擎回答医生提出的定义性问题的认知评估。
Pac Symp Biocomput. 2007:328-39.