• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于零样本生物医学文本分类的上下文感知对比表示学习

Context-Aware Contrastive Representation Learning for Zero-Shot Biomedical Text Classification.

作者信息

Mukherjee Ratri, Jha Kishlay

机构信息

Department of Electrical and Computer Engineering University of Iowa, IA, USA.

出版信息

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2024 Dec;2024:3611-3614. doi: 10.1109/bibm62325.2024.10822585.

DOI:10.1109/bibm62325.2024.10822585
PMID:40103667
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11916847/
Abstract

Biomedical text classification refers to the task of annotating a biomedical text with its relevant labels from a candidate label set. Most of the existing approach operate in a fully supervised setting and thus heavily rely on human-annotated training data which is both labor-intensive and monetarily expensive. To address this, we propose to formulate biomedical text classification under the zero-shot learning (ZSL) paradigm that does not require any labeled training data and only relies on label surface names for training and inference. Specifically, we propose a new context-aware contrastive learning technique for ZSL that fully exploits the context information present in the biomedical text to generate semantically enriched feature representations needed for accurate zero-shot biomedical text classification. Unlike existing contrastive learning approaches that typically employ random text segmentation strategies to generate contrastive pairs, our approach utilizes the context information inherently present in biomedical text to generate semantically meaningful contrastive pairs. Extensive experiments on the largest available biomedical corpus validates the effectiveness of the proposed approach.

摘要

生物医学文本分类是指从候选标签集中为生物医学文本标注相关标签的任务。现有的大多数方法都在完全监督的环境下运行,因此严重依赖人工标注的训练数据,这既耗费人力又成本高昂。为了解决这个问题,我们建议在零样本学习(ZSL)范式下进行生物医学文本分类,该范式不需要任何有标签的训练数据,仅依赖标签表面名称进行训练和推理。具体而言,我们为ZSL提出了一种新的上下文感知对比学习技术,该技术充分利用生物医学文本中存在的上下文信息,以生成准确的零样本生物医学文本分类所需的语义丰富的特征表示。与现有的通常采用随机文本分割策略来生成对比对的对比学习方法不同,我们的方法利用生物医学文本中固有的上下文信息来生成语义有意义的对比对。在最大可用生物医学语料库上进行的大量实验验证了所提出方法的有效性。

相似文献

1
Context-Aware Contrastive Representation Learning for Zero-Shot Biomedical Text Classification.用于零样本生物医学文本分类的上下文感知对比表示学习
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2024 Dec;2024:3611-3614. doi: 10.1109/bibm62325.2024.10822585.
2
Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。
Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.
3
Multi-Label Zero-Shot Learning Via Contrastive Label-Based Attention.基于对比标签注意力的多标签零样本学习
Int J Neural Syst. 2025 Mar;35(3):2550010. doi: 10.1142/S0129065725500108. Epub 2025 Jan 23.
4
A veracity dissemination consistency-based few-shot fake news detection framework by synergizing adversarial and contrastive self-supervised learning.一种基于真实性传播一致性的少样本假新闻检测框架,通过协同对抗性和对比性自监督学习实现。
Sci Rep. 2024 Aug 22;14(1):19470. doi: 10.1038/s41598-024-70039-9.
5
Aligning Semantic in Brain and Language: A Curriculum Contrastive Method for Electroencephalography-to-Text Generation.脑与语言的语义对齐:一种用于脑电文本生成的课程对比方法。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:3874-3883. doi: 10.1109/TNSRE.2023.3314642. Epub 2023 Oct 9.
6
Exploiting instance-label dynamics through reciprocal anchored contrastive learning for few-shot relation extraction.通过互惠锚定对比学习利用实例标签动态进行少样本关系抽取。
Neural Netw. 2025 Jul;187:107259. doi: 10.1016/j.neunet.2025.107259. Epub 2025 Feb 25.
7
From Pixel to Patch: Synthesize Context-Aware Features for Zero-Shot Semantic Segmentation.从像素到图像块:为零样本语义分割合成上下文感知特征。
IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):7689-7703. doi: 10.1109/TNNLS.2022.3145962. Epub 2023 Oct 5.
8
HC L: Hybrid and Cooperative Contrastive Learning for Cross-Lingual Spoken Language Understanding.HCL:用于跨语言口语理解的混合协作对比学习
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8094-8105. doi: 10.1109/TPAMI.2024.3402746. Epub 2024 Nov 6.
9
sCL-ST: Supervised Contrastive Learning With Semantic Transformations for Multiple Lead ECG Arrhythmia Classification.sCL-ST:基于语义转换的监督对比学习在多导联 ECG 心律失常分类中的应用。
IEEE J Biomed Health Inform. 2023 Jun;27(6):2818-2828. doi: 10.1109/JBHI.2023.3246241. Epub 2023 Jun 5.
10
Multi-label zero-shot human action recognition via joint latent ranking embedding.基于联合潜在排序嵌入的多标签零镜头人体动作识别。
Neural Netw. 2020 Feb;122:1-23. doi: 10.1016/j.neunet.2019.09.029. Epub 2019 Oct 21.

本文引用的文献

1
BERTMeSH: deep contextual representation learning for large-scale high-performance MeSH indexing with full text.BERTMeSH:基于深度上下文表示学习的大规模高性能 MeSH 索引与全文检索
Bioinformatics. 2021 May 5;37(5):684-692. doi: 10.1093/bioinformatics/btaa837.
2
MeSHProbeNet: a self-attentive probe net for MeSH indexing.MeSHProbeNet:一种用于 MeSH 索引的自注意探针网络。
Bioinformatics. 2019 Oct 1;35(19):3794-3802. doi: 10.1093/bioinformatics/btz142.
3
SemaTyP: a knowledge graph based literature mining method for drug discovery.SemaTyP:一种基于知识图谱的药物发现文献挖掘方法。
BMC Bioinformatics. 2018 May 30;19(1):193. doi: 10.1186/s12859-018-2167-5.
4
Evaluation and comparison of bioinformatic tools for the enrichment analysis of metabolomics data.代谢组学数据富集分析的生物信息学工具评估与比较。
BMC Bioinformatics. 2018 Jan 2;19(1):1. doi: 10.1186/s12859-017-2006-0.
5
DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.深度医学主题词表:用于改进大规模医学主题词表索引的深度语义表示。
Bioinformatics. 2016 Jun 15;32(12):i70-i79. doi: 10.1093/bioinformatics/btw294.
6
PubMed and beyond: a survey of web tools for searching biomedical literature.PubMed 及其他:生物医学文献检索网络工具调查。
Database (Oxford). 2011 Jan 18;2011:baq036. doi: 10.1093/database/baq036. Print 2011.
7
ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.语境:一种从临床报告中确定否定、体验者和时间状态的算法。
J Biomed Inform. 2009 Oct;42(5):839-51. doi: 10.1016/j.jbi.2009.05.002. Epub 2009 May 10.
8
Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.生物医学文本到UMLS元词表的有效映射:MetaMap程序
Proc AMIA Symp. 2001:17-21.
9
Medical Subject Headings (MeSH).医学主题词表(MeSH)。
Bull Med Libr Assoc. 2000 Jul;88(3):265-6.
10
Dietary linoleic acid and salt-induced hypertension.膳食亚油酸与盐诱导的高血压
Can J Physiol Pharmacol. 1981 Aug;59(8):872-5. doi: 10.1139/y81-130.