• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过一致的 NLI-Transfer 和后白化来改进生物医学 ReQA。

Improving Biomedical ReQA With Consistent NLI-Transfer and Post-Whitening.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):1864-1875. doi: 10.1109/TCBB.2022.3219375. Epub 2023 Jun 5.

DOI:10.1109/TCBB.2022.3219375
PMID:36331640
Abstract

Retrieval Question Answering (ReQA) is an essential mechanism of information sharing which aims to find the answer to a posed question from large-scale candidates. Currently, the most efficient solution is Dual-Encoder which has shown great potential in the general domain, while it still lacks research on biomedical ReQA. Obtaining a robust Dual-Encoder from biomedical datasets is challenging, as scarce annotated data are not enough to sufficiently train the model which results in over-fitting problems. In this work, we first build ReQA BioASQ datasets for retrieving answers to biomedical questions, which can facilitate the corresponding research. On that basis, we propose a framework to solve the over-fitting issue for robust biomedical answer retrieval. Under the proposed framework, we first pre-train Dual-Encoder on natural language inference (NLI) task before the training on biomedical ReQA, where we appropriately change the pre-training objective of NLI to improve the consistency between NLI and biomedical ReQA, which significantly improve the transferability. Moreover, to eliminate the feature redundancies of Dual-Encoder, consistent post-whitening is proposed to conduct decorrelation on the training and trained sentence embeddings. With extensive experiments, the proposed framework achieves promising results and exhibits significant improvement compared with various competitive methods.

摘要

检索式问答 (ReQA) 是一种重要的信息共享机制,旨在从大规模的候选者中找到提出问题的答案。目前,最有效的解决方案是双编码器,它在通用领域显示出了巨大的潜力,而在生物医学 ReQA 方面的研究还很缺乏。从生物医学数据集中获得稳健的双编码器具有挑战性,因为稀缺的注释数据不足以充分训练模型,从而导致过拟合问题。在这项工作中,我们首先构建了用于检索生物医学问题答案的 ReQA BioASQ 数据集,这将有助于相关研究。在此基础上,我们提出了一种解决稳健的生物医学答案检索中过拟合问题的框架。在提出的框架中,我们首先在生物医学 ReQA 训练之前,在自然语言推理 (NLI) 任务上对双编码器进行预训练,我们适当地改变 NLI 的预训练目标,以提高 NLI 和生物医学 ReQA 之间的一致性,从而显著提高可转移性。此外,为了消除双编码器的特征冗余,我们提出了一致的后白化,以对训练和训练后的句子嵌入进行去相关。通过广泛的实验,所提出的框架取得了有希望的结果,并与各种竞争方法相比表现出显著的改进。

相似文献

1
Improving Biomedical ReQA With Consistent NLI-Transfer and Post-Whitening.通过一致的 NLI-Transfer 和后白化来改进生物医学 ReQA。
IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):1864-1875. doi: 10.1109/TCBB.2022.3219375. Epub 2023 Jun 5.
2
Short-Term Memory Impairment短期记忆障碍
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
7
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
8
Systemic Inflammatory Response Syndrome全身炎症反应综合征
9
Actor critic with experience replay-based automatic treatment planning for prostate cancer intensity modulated radiotherapy.基于经验回放的演员-评论家算法用于前列腺癌调强放射治疗的自动治疗计划
Med Phys. 2025 Jul;52(7):e17915. doi: 10.1002/mp.17915. Epub 2025 May 31.
10
Non-pharmacological interventions for improving language and communication in people with primary progressive aphasia.原发性进行性失语症患者语言和交流能力的非药物干预措施。
Cochrane Database Syst Rev. 2024 May 29;5(5):CD015067. doi: 10.1002/14651858.CD015067.pub2.

引用本文的文献

1
Question answering systems for health professionals at the point of care-a systematic review.在护理点为医疗保健专业人员提供问答系统——系统评价。
J Am Med Inform Assoc. 2024 Apr 3;31(4):1009-1024. doi: 10.1093/jamia/ocae015.