• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MeSHProbeNet:一种用于 MeSH 索引的自注意探针网络。

MeSHProbeNet: a self-attentive probe net for MeSH indexing.

机构信息

Department of Computer Science, University of Virginia, Charlottesville, VA, USA.

Department of Information and Communication Engineering, Beijing University of Technology, Beijing, China.

出版信息

Bioinformatics. 2019 Oct 1;35(19):3794-3802. doi: 10.1093/bioinformatics/btz142.

DOI:10.1093/bioinformatics/btz142
PMID:30851089
Abstract

MOTIVATION

MEDLINE is the primary bibliographic database maintained by National Library of Medicine (NLM). MEDLINE citations are indexed with Medical Subject Headings (MeSH), which is a controlled vocabulary curated by the NLM experts. This greatly facilitates the applications of biomedical research and knowledge discovery. Currently, MeSH indexing is manually performed by human experts. To reduce the time and monetary cost associated with manual annotation, many automatic MeSH indexing systems have been proposed to assist manual annotation, including DeepMeSH and NLM's official model Medical Text Indexer (MTI). However, the existing models usually rely on the intermediate results of other models and suffer from efficiency issues. We propose an end-to-end framework, MeSHProbeNet (formerly named as xgx), which utilizes deep learning and self-attentive MeSH probes to index MeSH terms. Each MeSH probe enables the model to extract one specific aspect of biomedical knowledge from an input article, thus comprehensive biomedical information can be extracted with different MeSH probes and interpretability can be achieved at word level. MeSH terms are finally recommended with a unified classifier, making MeSHProbeNet both time efficient and space efficient.

RESULTS

MeSHProbeNet won the first place in the latest batch of Task A in the 2018 BioASQ challenge. The result on the last test set of the challenge is reported in this paper. Compared with other state-of-the-art models, such as MTI and DeepMeSH, MeSHProbeNet achieves the highest scores in all the F-measures, including Example Based F-Measure, Macro F-Measure, Micro F-Measure, Hierarchical F-Measure and Lowest Common Ancestor F-measure. We also intuitively show how MeSHProbeNet is able to extract comprehensive biomedical knowledge from an input article.

摘要

动机

MEDLINE 是由美国国立医学图书馆(NLM)维护的主要书目数据库。MEDLINE 引文使用医学主题词(MeSH)进行索引,MeSH 是由 NLM 专家策划的受控词汇。这极大地方便了生物医学研究和知识发现的应用。目前,MeSH 索引是由人类专家手动完成的。为了降低与手动注释相关的时间和金钱成本,已经提出了许多自动 MeSH 索引系统来辅助手动注释,包括 DeepMeSH 和 NLM 的官方模型 Medical Text Indexer(MTI)。然而,现有的模型通常依赖于其他模型的中间结果,并存在效率问题。我们提出了一个端到端的框架,MeSHProbeNet(以前称为 xgx),它利用深度学习和自注意 MeSH 探针来索引 MeSH 术语。每个 MeSH 探针使模型能够从输入文章中提取一个特定的生物医学知识方面,因此可以使用不同的 MeSH 探针提取全面的生物医学信息,并在单词级别实现可解释性。MeSH 术语最终通过统一的分类器进行推荐,使 MeSHProbeNet 既高效又节省空间。

结果

MeSHProbeNet 在 2018 年 BioASQ 挑战赛的最新一轮任务 A 中获得第一名。本文报告了该挑战赛最后一个测试集的结果。与其他最先进的模型(如 MTI 和 DeepMeSH)相比,MeSHProbeNet 在所有 F 度量中(包括基于示例的 F 度量、宏 F 度量、微 F 度量、层次 F 度量和最低公共祖先 F 度量)均取得了最高分数。我们还直观地展示了 MeSHProbeNet 如何从输入文章中提取全面的生物医学知识。

相似文献

1
MeSHProbeNet: a self-attentive probe net for MeSH indexing.MeSHProbeNet:一种用于 MeSH 索引的自注意探针网络。
Bioinformatics. 2019 Oct 1;35(19):3794-3802. doi: 10.1093/bioinformatics/btz142.
2
FullMeSH: improving large-scale MeSH indexing with full text.全文 MeSH:利用全文提高大规模 MeSH 标引的质量。
Bioinformatics. 2020 Mar 1;36(5):1533-1541. doi: 10.1093/bioinformatics/btz756.
3
MeSHLabeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence.医学主题词表(MeSH)标注器:通过整合多种证据提高大规模医学主题词表索引的准确性。
Bioinformatics. 2015 Jun 15;31(12):i339-47. doi: 10.1093/bioinformatics/btv237.
4
DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.深度医学主题词表:用于改进大规模医学主题词表索引的深度语义表示。
Bioinformatics. 2016 Jun 15;32(12):i70-i79. doi: 10.1093/bioinformatics/btw294.
5
MeSH indexing based on automatically generated summaries.基于自动生成的摘要进行 MeSH 标引。
BMC Bioinformatics. 2013 Jun 26;14:208. doi: 10.1186/1471-2105-14-208.
6
MeSHLabeler and DeepMeSH: Recent Progress in Large-Scale MeSH Indexing.医学主题词标注器与深度医学主题词:大规模医学主题词标引的最新进展
Methods Mol Biol. 2018;1807:203-209. doi: 10.1007/978-1-4939-8561-6_15.
7
An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.BIOASQ大规模生物医学语义索引与问答竞赛概述。
BMC Bioinformatics. 2015 Apr 30;16:138. doi: 10.1186/s12859-015-0564-6.
8
Automated indexing using NLM's Medical Text Indexer (MTI) compared to human indexing in Medline: a pilot study.使用 NLM 的医学文本索引器 (MTI) 进行自动索引与 Medline 中的人工索引相比:一项试点研究。
J Med Libr Assoc. 2023 Jul 10;111(3):684-694. doi: 10.5195/jmla.2023.1588.
9
A recent advance in the automatic indexing of the biomedical literature.生物医学文献自动标引的最新进展。
J Biomed Inform. 2009 Oct;42(5):814-23. doi: 10.1016/j.jbi.2008.12.007. Epub 2008 Dec 30.
10
Reflective random indexing for semi-automatic indexing of the biomedical literature.基于反射随机索引的生物医学文献半自动索引方法。
J Biomed Inform. 2010 Oct;43(5):694-700. doi: 10.1016/j.jbi.2010.04.001. Epub 2010 Apr 9.

引用本文的文献

1
Enhancing automated indexing of publication types and study designs in biomedical literature using full-text features.利用全文特征增强生物医学文献中出版物类型和研究设计的自动索引。
medRxiv. 2025 Apr 28:2025.04.23.25326300. doi: 10.1101/2025.04.23.25326300.
2
Context-Aware Contrastive Representation Learning for Zero-Shot Biomedical Text Classification.用于零样本生物医学文本分类的上下文感知对比表示学习
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2024 Dec;2024:3611-3614. doi: 10.1109/bibm62325.2024.10822585.
3
Weightage Identified Network of Keywords Technique: A Structured Approach in Identifying Keywords for Systematic Reviews.
加权识别关键词网络技术:一种用于系统评价中识别关键词的结构化方法。
Healthc Inform Res. 2025 Jan;31(1):48-56. doi: 10.4258/hir.2025.31.1.48. Epub 2025 Jan 31.
4
LitCovid ensemble learning for COVID-19 multi-label classification.LitCovid 用于 COVID-19 多标签分类的集成学习。
Database (Oxford). 2022 Nov 25;2022. doi: 10.1093/database/baac103.
5
Multi-probe attention neural network for COVID-19 semantic indexing.多探针注意力神经网络用于 COVID-19 语义索引。
BMC Bioinformatics. 2022 Jun 29;23(1):259. doi: 10.1186/s12859-022-04803-x.
6
Thesaurus-based word embeddings for automated biomedical literature classification.基于词库的词嵌入用于自动化生物医学文献分类。
Neural Comput Appl. 2022;34(2):937-950. doi: 10.1007/s00521-021-06053-z. Epub 2021 May 11.
7
Automatic MeSH Indexing: Revisiting the Subheading Attachment Problem.自动主题词标引:重新审视副主题词附着问题。
AMIA Annu Symp Proc. 2021 Jan 25;2020:1031-1040. eCollection 2020.
8
FullMeSH: improving large-scale MeSH indexing with full text.全文 MeSH:利用全文提高大规模 MeSH 标引的质量。
Bioinformatics. 2020 Mar 1;36(5):1533-1541. doi: 10.1093/bioinformatics/btz756.