Suppr超能文献

自动主题词标引:重新审视副主题词附着问题。

Automatic MeSH Indexing: Revisiting the Subheading Attachment Problem.

机构信息

Lister Hill National Center for Biomedical Communications, National Library of Medicine, Bethesda, MD.

出版信息

AMIA Annu Symp Proc. 2021 Jan 25;2020:1031-1040. eCollection 2020.

Abstract

This year less than 200 National Library of Medicine indexers expect to index 1 million articles, and this would not be possible without the assistance of the Medical Text Indexer (MTI) system. MTI is an automated indexing system that provides MeSH main heading/subheading pair recommendations to assist indexers with their heavy workload. Over the years, a lot of research effort has focused on improving main heading prediction performance, but automated fine-grained indexing with main heading/subheading pairs has received much less attention. This work revisits the subheading attachment problem, and demonstrates very significant performance improvements using modern Convolutional Neural Network classifiers. The best performing method is shown to outperform the current MTI implementation with a 3.7% absolute improvement in precision, and a 27.6% absolute improvement in recall. We also conducted a manual review of false positive predictions, and 70% were found to be acceptable indexing.

摘要

今年,不到 200 名美国国家医学图书馆标引员预计将对 100 万篇文章进行标引,如果没有 Medical Text Indexer (MTI) 系统的协助,这是不可能实现的。MTI 是一个自动化标引系统,提供 MeSH 主要标题/副标题对的建议,以帮助标引员完成繁重的工作。多年来,大量的研究工作都集中在提高主要标题预测性能上,但对使用主要标题/副标题对的自动化细粒度标引关注较少。这项工作重新审视了副标题附加问题,并使用现代卷积神经网络分类器证明了非常显著的性能改进。结果表明,表现最好的方法比当前的 MTI 实现提高了 3.7%的绝对精度,提高了 27.6%的绝对召回率。我们还对假阳性预测进行了手动审查,发现 70%的预测是可以接受的索引。

相似文献

1
Automatic MeSH Indexing: Revisiting the Subheading Attachment Problem.
AMIA Annu Symp Proc. 2021 Jan 25;2020:1031-1040. eCollection 2020.
3
A recent advance in the automatic indexing of the biomedical literature.
J Biomed Inform. 2009 Oct;42(5):814-23. doi: 10.1016/j.jbi.2008.12.007. Epub 2008 Dec 30.
4
MeSH indexing based on automatically generated summaries.
BMC Bioinformatics. 2013 Jun 26;14:208. doi: 10.1186/1471-2105-14-208.
5
The NLM Indexing Initiative's Medical Text Indexer.
Stud Health Technol Inform. 2004;107(Pt 1):268-72.
6
12 years on - Is the NLM medical text indexer still useful and relevant?
J Biomed Semantics. 2017 Feb 23;8(1):8. doi: 10.1186/s13326-017-0113-5.
7
A bottom-up approach to MEDLINE indexing recommendations.
AMIA Annu Symp Proc. 2011;2011:1583-92. Epub 2011 Oct 22.
8
Automatic inference of indexing rules for MEDLINE.
BMC Bioinformatics. 2008 Nov 19;9 Suppl 11(Suppl 11):S11. doi: 10.1186/1471-2105-9-S11-S11.
9
MEDRank: using graph-based concept ranking to index biomedical texts.
Int J Med Inform. 2011 Jun;80(6):431-41. doi: 10.1016/j.ijmedinf.2011.02.008. Epub 2011 Mar 25.
10
Automated indexing using NLM's Medical Text Indexer (MTI) compared to human indexing in Medline: a pilot study.
J Med Libr Assoc. 2023 Jul 10;111(3):684-694. doi: 10.5195/jmla.2023.1588.

引用本文的文献

2
Artificial intelligence in clinical and translational science: Successes, challenges and opportunities.
Clin Transl Sci. 2022 Feb;15(2):309-321. doi: 10.1111/cts.13175. Epub 2021 Oct 30.

本文引用的文献

1
A High Recall Classifier for Selecting Articles for MEDLINE Indexing.
AMIA Annu Symp Proc. 2020 Mar 4;2019:727-734. eCollection 2019.
2
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
3
MeSHProbeNet: a self-attentive probe net for MeSH indexing.
Bioinformatics. 2019 Oct 1;35(19):3794-3802. doi: 10.1093/bioinformatics/btz142.
4
12 years on - Is the NLM medical text indexer still useful and relevant?
J Biomed Semantics. 2017 Feb 23;8(1):8. doi: 10.1186/s13326-017-0113-5.
5
DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.
Bioinformatics. 2016 Jun 15;32(12):i70-i79. doi: 10.1093/bioinformatics/btw294.
6
An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.
BMC Bioinformatics. 2015 Apr 30;16:138. doi: 10.1186/s12859-015-0564-6.
7
A recent advance in the automatic indexing of the biomedical literature.
J Biomed Inform. 2009 Oct;42(5):814-23. doi: 10.1016/j.jbi.2008.12.007. Epub 2008 Dec 30.
8
PubMed related articles: a probabilistic topic-based model for content similarity.
BMC Bioinformatics. 2007 Oct 30;8:423. doi: 10.1186/1471-2105-8-423.
9
Indexing consistency in MEDLINE.
Bull Med Libr Assoc. 1983 Apr;71(2):176-83.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验