• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于生物医学事件触发检测的基于有效类型标签的协同表示学习

Effective type label-based synergistic representation learning for biomedical event trigger detection.

作者信息

Hao Anran, Yuan Haohan, Hui Siu Cheung, Su Jian

机构信息

School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, Singapore.

Aural & Language Intelligence, Institute for Infocomm Research, Agency for Science, Technology and Research, 1 Fusionopolis Way, Singapore, Singapore.

出版信息

BMC Bioinformatics. 2024 Jul 31;25(1):251. doi: 10.1186/s12859-024-05851-1.

DOI:10.1186/s12859-024-05851-1
PMID:39085787
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11293144/
Abstract

BACKGROUND

Detecting event triggers in biomedical texts, which contain domain knowledge and context-dependent terms, is more challenging than in general-domain texts. Most state-of-the-art models rely mainly on external resources such as linguistic tools and knowledge bases to improve system performance. However, they lack effective mechanisms to obtain semantic clues from label specification and sentence context. Given its success in image classification, label representation learning is a promising approach to enhancing biomedical event trigger detection models by leveraging the rich semantics of pre-defined event type labels.

RESULTS

In this paper, we propose the Biomedical Label-based Synergistic representation Learning (BioLSL) model, which effectively utilizes event type labels by learning their correlation with trigger words and enriches the representation contextually. The BioLSL model consists of three modules. Firstly, the Domain-specific Joint Encoding module employs a transformer-based, domain-specific pre-trained architecture to jointly encode input sentences and pre-defined event type labels. Secondly, the Label-based Synergistic Representation Learning module learns the semantic relationships between input texts and event type labels, and generates a Label-Trigger Aware Representation (LTAR) and a Label-Context Aware Representation (LCAR) for enhanced semantic representations. Finally, the Trigger Classification module makes structured predictions, where each label is predicted with respect to its neighbours. We conduct experiments on three benchmark BioNLP datasets, namely MLEE, GE09, and GE11, to evaluate our proposed BioLSL model. Results show that BioLSL has achieved state-of-the-art performance, outperforming the baseline models.

CONCLUSIONS

The proposed BioLSL model demonstrates good performance for biomedical event trigger detection without using any external resources. This suggests that label representation learning and context-aware enhancement are promising directions for improving the task. The key enhancement is that BioLSL effectively learns to construct semantic linkages between the event mentions and type labels, which provide the latent information of label-trigger and label-context relationships in biomedical texts. Moreover, additional experiments on BioLSL show that it performs exceptionally well with limited training data under the data-scarce scenarios.

摘要

背景

在包含领域知识和上下文相关术语的生物医学文本中检测事件触发因素,比在通用领域文本中更具挑战性。大多数最先进的模型主要依赖语言工具和知识库等外部资源来提高系统性能。然而,它们缺乏从标签规范和句子上下文中获取语义线索的有效机制。鉴于标签表示学习在图像分类中的成功,它是一种通过利用预定义事件类型标签的丰富语义来增强生物医学事件触发检测模型的有前途的方法。

结果

在本文中,我们提出了基于生物医学标签的协同表示学习(BioLSL)模型,该模型通过学习事件类型标签与触发词的相关性来有效利用它们,并在上下文上丰富表示。BioLSL模型由三个模块组成。首先,特定领域联合编码模块采用基于Transformer的特定领域预训练架构来联合编码输入句子和预定义事件类型标签。其次,基于标签的协同表示学习模块学习输入文本与事件类型标签之间的语义关系,并生成标签触发感知表示(LTAR)和标签上下文感知表示(LCAR)以增强语义表示。最后,触发分类模块进行结构化预测,其中每个标签相对于其邻居进行预测。我们在三个基准生物自然语言处理数据集,即MLEE、GE09和GE11上进行实验,以评估我们提出的BioLSL模型。结果表明,BioLSL取得了最先进的性能,优于基线模型。

结论

所提出的BioLSL模型在不使用任何外部资源的情况下,在生物医学事件触发检测方面表现出良好的性能。这表明标签表示学习和上下文感知增强是改进该任务的有前途的方向。关键增强点在于BioLSL有效地学习构建事件提及与类型标签之间的语义联系,这提供了生物医学文本中标签-触发和标签-上下文关系的潜在信息。此外,对BioLSL的额外实验表明,在数据稀缺的情况下,它在有限的训练数据下表现异常出色。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/e48726e5b6e7/12859_2024_5851_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/2ca811c4ad0d/12859_2024_5851_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/7fc77d531fda/12859_2024_5851_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/9350ffac243e/12859_2024_5851_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/2256ca26e98c/12859_2024_5851_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/d0717aaa0466/12859_2024_5851_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/e48726e5b6e7/12859_2024_5851_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/2ca811c4ad0d/12859_2024_5851_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/7fc77d531fda/12859_2024_5851_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/9350ffac243e/12859_2024_5851_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/2256ca26e98c/12859_2024_5851_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/d0717aaa0466/12859_2024_5851_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a025/11293144/e48726e5b6e7/12859_2024_5851_Fig6_HTML.jpg

相似文献

1
Effective type label-based synergistic representation learning for biomedical event trigger detection.用于生物医学事件触发检测的基于有效类型标签的协同表示学习
BMC Bioinformatics. 2024 Jul 31;25(1):251. doi: 10.1186/s12859-024-05851-1.
2
Pipelined biomedical event extraction rivaling joint learning.流水线生物医学事件抽取媲美联合学习。
Methods. 2024 Jun;226:9-18. doi: 10.1016/j.ymeth.2024.04.003. Epub 2024 Apr 9.
3
Contextual label sensitive gated network for biomedical event trigger extraction.基于上下文标签敏感门控网络的生物医学事件触发抽取
J Biomed Inform. 2019 Jul;95:103221. doi: 10.1016/j.jbi.2019.103221. Epub 2019 Jun 5.
4
Bidirectional long short-term memory with CRF for detecting biomedical event trigger in FastText semantic space.双向长短时记忆模型与条件随机场结合,用于在 FastText 语义空间中检测生物医学事件触发词。
BMC Bioinformatics. 2018 Dec 21;19(Suppl 20):507. doi: 10.1186/s12859-018-2543-1.
5
Large scale biomedical texts classification: a kNN and an ESA-based approaches.大规模生物医学文本分类:基于k近邻算法和基于词嵌入语义分析的方法。
J Biomed Semantics. 2016 Jun 16;7:40. doi: 10.1186/s13326-016-0073-1.
6
Biological Event Trigger Identification with Noise Contrastive Estimation.基于噪声对比估计的生物事件触发识别。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1549-1559. doi: 10.1109/TCBB.2017.2710048.
7
Learning entity-oriented representation for biomedical relation extraction.学习面向实体的表示用于生物医学关系抽取。
J Biomed Inform. 2023 Nov;147:104527. doi: 10.1016/j.jbi.2023.104527. Epub 2023 Oct 16.
8
Improving biomedical Named Entity Recognition with additional external contexts.利用额外的外部语境提高生物医学命名实体识别的性能。
J Biomed Inform. 2024 Aug;156:104674. doi: 10.1016/j.jbi.2024.104674. Epub 2024 Jun 11.
9
Biomedical event trigger detection by dependency-based word embedding.基于依存关系的词嵌入进行生物医学事件触发检测
BMC Med Genomics. 2016 Aug 10;9 Suppl 2(Suppl 2):45. doi: 10.1186/s12920-016-0203-8.
10
Unsupervised Event Graph Representation and Similarity Learning on Biomedical Literature.基于生物医学文献的无监督事件图表示和相似性学习。
Sensors (Basel). 2021 Dec 21;22(1):3. doi: 10.3390/s22010003.

本文引用的文献

1
Child-Sum EATree-LSTMs: enhanced attentive Child-Sum Tree-LSTMs for biomedical event extraction.儿童和总和 EATree-LSTMs:用于生物医学事件抽取的增强型注意儿童和总和树-LSTMs。
BMC Bioinformatics. 2023 Jun 15;24(1):253. doi: 10.1186/s12859-023-05336-7.
2
Predicting MiRNA-Disease Associations by Graph Representation Learning Based on Jumping Knowledge Networks.基于跳跃知识网络的图表示学习预测微小RNA-疾病关联
IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):2629-2638. doi: 10.1109/TCBB.2022.3196394. Epub 2023 Oct 9.
3
A span-graph neural model for overlapping entity relation extraction in biomedical texts.
一种用于生物医学文献中重叠实体关系抽取的图神经网络模型。
Bioinformatics. 2021 Jul 12;37(11):1581-1589. doi: 10.1093/bioinformatics/btaa993.
4
DeepEventMine: end-to-end neural nested event extraction from biomedical texts.DeepEventMine:从生物医学文本中进行端到端的神经嵌套事件提取。
Bioinformatics. 2020 Dec 8;36(19):4910-4917. doi: 10.1093/bioinformatics/btaa540.
5
Enriching contextualized language model from knowledge graph for biomedical information extraction.从知识图谱中丰富上下文相关的语言模型以进行生物医学信息抽取。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa110.
6
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
7
Contextual label sensitive gated network for biomedical event trigger extraction.基于上下文标签敏感门控网络的生物医学事件触发抽取
J Biomed Inform. 2019 Jul;95:103221. doi: 10.1016/j.jbi.2019.103221. Epub 2019 Jun 5.
8
Bidirectional long short-term memory with CRF for detecting biomedical event trigger in FastText semantic space.双向长短时记忆模型与条件随机场结合,用于在 FastText 语义空间中检测生物医学事件触发词。
BMC Bioinformatics. 2018 Dec 21;19(Suppl 20):507. doi: 10.1186/s12859-018-2543-1.
9
A Two-Stage Biomedical Event Trigger Detection Method Integrating Feature Selection and Word Embeddings.一种结合特征选择和词嵌入的两阶段生物医学事件触发检测方法。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Jul-Aug;15(4):1325-1332. doi: 10.1109/TCBB.2017.2715016. Epub 2017 Jun 13.
10
TEES 2.2: Biomedical Event Extraction for Diverse Corpora.TEES 2.2:针对不同语料库的生物医学事件提取
BMC Bioinformatics. 2015;16 Suppl 16(Suppl 16):S4. doi: 10.1186/1471-2105-16-S16-S4. Epub 2015 Oct 30.