一种用于生物医学问答的自监督语言模型选择策略。

A self-supervised language model selection strategy for biomedical question answering.

作者信息

Arabzadeh Negar, Bagheri Ebrahim

机构信息

University of Waterloo, Waterloo, ON, Canada.

Toronto Metropolitan University, Toronto, ON, Canada.

出版信息

J Biomed Inform. 2023 Oct;146:104486. doi: 10.1016/j.jbi.2023.104486. Epub 2023 Sep 16.

DOI:10.1016/j.jbi.2023.104486

PMID:37722445

Abstract

Large neural-based Pre-trained Language Models (PLM) have recently gained much attention due to their noteworthy performance in many downstream Information Retrieval (IR) and Natural Language Processing (NLP) tasks. PLMs can be categorized as either general-purpose, which are trained on resources such as large-scale Web corpora, and domain-specific which are trained on in-domain or mixed-domain corpora. While domain-specific PLMs have shown promising performance on domain-specific tasks, they are significantly more computationally expensive compared to general-purpose PLMs as they have to be either retrained or trained from scratch. The objective of our work in this paper is to explore whether it would be possible to leverage general-purpose PLMs to show competitive performance to domain-specific PLMs without the need for expensive retraining of the PLMs for domain-specific tasks. By focusing specifically on the recent BioASQ Biomedical Question Answering task, we show how different general-purpose PLMs show synergistic behaviour in terms of performance, which can lead to overall notable performance improvement when used in tandem with each other. More concretely, given a set of general-purpose PLMs, we propose a self-supervised method for training a classifier that systematically selects the PLM that is most likely to answer the question correctly on a per-input basis. We show that through such a selection strategy, the performance of general-purpose PLMs can become competitive with domain-specific PLMs while remaining computationally light since there is no need to retrain the large language model itself. We run experiments on the BioASQ dataset, which is a large-scale biomedical question-answering benchmark. We show that utilizing our proposed selection strategy can show statistically significant performance improvements on general-purpose language models with an average of 16.7% when using only lighter models such as DistilBERT and DistilRoBERTa, as well as 14.2% improvement when using relatively larger models such as BERT and RoBERTa and so, their performance become competitive with domain-specific large language models such as PubMedBERT.

摘要

基于神经网络的大型预训练语言模型（PLM）最近备受关注，因为它们在许多下游信息检索（IR）和自然语言处理（NLP）任务中表现出色。PLM 可分为通用型和特定领域型，通用型是在大规模网络语料库等资源上进行训练，特定领域型则是在领域内或混合领域语料库上进行训练。虽然特定领域的 PLM 在特定领域任务中表现出了有前景的性能，但与通用型 PLM 相比，它们的计算成本要高得多，因为它们必须重新训练或从头开始训练。本文我们工作的目标是探索是否有可能利用通用型 PLM 来展现出与特定领域 PLM 相竞争的性能，而无需针对特定领域任务对 PLM 进行昂贵的重新训练。通过特别关注最近的生物医学问答任务 BioASQ，我们展示了不同的通用型 PLM 在性能方面如何表现出协同行为，当它们相互配合使用时，可以带来整体显著的性能提升。更具体地说，给定一组通用型 PLM，我们提出了一种自监督方法来训练一个分类器，该分类器会根据每个输入系统地选择最有可能正确回答问题的 PLM。我们表明，通过这种选择策略，通用型 PLM 的性能可以与特定领域的 PLM 相竞争，同时由于无需重新训练大语言模型本身，计算负担仍然较轻。我们在 BioASQ 数据集上进行了实验，该数据集是一个大规模的生物医学问答基准。我们表明，利用我们提出的选择策略，可以在通用语言模型上显示出具有统计学意义的性能提升，仅使用 DistilBERT 和 DistilRoBERTa 等较轻模型时平均提升 16.7%，使用 BERT 和 RoBERTa 等相对较大模型时提升 14.2%，因此，它们的性能与 PubMedBERT 等特定领域的大语言模型相竞争。

相似文献

A self-supervised language model selection strategy for biomedical question answering.一种用于生物医学问答的自监督语言模型选择策略。

J Biomed Inform. 2023 Oct;146:104486. doi: 10.1016/j.jbi.2023.104486. Epub 2023 Sep 16.

KEBLM: Knowledge-Enhanced Biomedical Language Models.KEBLM：知识增强型生物医学语言模型。

J Biomed Inform. 2023 Jul;143:104392. doi: 10.1016/j.jbi.2023.104392. Epub 2023 May 19.

External features enriched model for biomedical question answering.生物医学问答的外部特征丰富模型。

BMC Bioinformatics. 2021 May 26;22(1):272. doi: 10.1186/s12859-021-04176-7.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT：一种用于生物医学文本挖掘的预训练生物医学语言表示模型。

Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.

Pre-trained language models in medicine: A survey.医学领域的预训练语言模型：综述。

Artif Intell Med. 2024 Aug;154:102904. doi: 10.1016/j.artmed.2024.102904. Epub 2024 Jun 5.

Word embeddings and external resources for answer processing in biomedical factoid question answering.词向量和外部资源在生物医学事实问答中的答案处理

J Biomed Inform. 2019 Apr;92:103118. doi: 10.1016/j.jbi.2019.103118. Epub 2019 Feb 10.

Bioformer: an efficient transformer language model for biomedical text mining.生物former：一种用于生物医学文本挖掘的高效Transformer语言模型。

ArXiv. 2023 Feb 3:arXiv:2302.01588v1.

AMMU: A survey of transformer-based biomedical pretrained language models.基于变压器的生物医学预训练语言模型综述。

J Biomed Inform. 2022 Feb;126:103982. doi: 10.1016/j.jbi.2021.103982. Epub 2021 Dec 31.

A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。

J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions.SemBioNLQA：一个语义生物医学问答系统，用于检索自然语言问题的准确和理想答案。

Artif Intell Med. 2020 Jan;102:101767. doi: 10.1016/j.artmed.2019.101767. Epub 2019 Nov 28.

引用本文的文献

Question answering systems for health professionals at the point of care-a systematic review.在护理点为医疗保健专业人员提供问答系统——系统评价。

J Am Med Inform Assoc. 2024 Apr 3;31(4):1009-1024. doi: 10.1093/jamia/ocae015.

Semantics-enabled biomedical literature analytics.支持语义分析的生物医学文献分析

J Biomed Inform. 2024 Feb;150:104588. doi: 10.1016/j.jbi.2024.104588. Epub 2024 Jan 19.

Advancing biomedical engineering: Leveraging Hjorth features for electroencephalography signal analysis.推进生物医学工程：利用 Hjorth 特征进行脑电图信号分析。

J Electr Bioimpedance. 2023 Dec 31;14(1):66-72. doi: 10.2478/joeb-2023-0009. eCollection 2023 Jan.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于生物医学问答的自监督语言模型选择策略。

A self-supervised language model selection strategy for biomedical question answering.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献