相似文献

1

MeSH Up: effective MeSH text classification for improved document retrieval.

Bioinformatics. 2009 Jun 1;25(11):1412-8. doi: 10.1093/bioinformatics/btp249. Epub 2009 Apr 17.

2

Comparison of automated and human assignment of MeSH terms on publicly-available molecular datasets.

J Biomed Inform. 2011 Dec;44 Suppl 1(Suppl 1):S39-S43. doi: 10.1016/j.jbi.2011.03.007. Epub 2011 Mar 21.

3

Comment on 'MeSH-up: effective MeSH text classification for improved document retrieval'.

Bioinformatics. 2009 Oct 15;25(20):2770-1; author reply 2772. doi: 10.1093/bioinformatics/btp483. Epub 2009 Aug 11.

4

Discovering gene annotations in biomedical text databases.

BMC Bioinformatics. 2008 Mar 6;9:143. doi: 10.1186/1471-2105-9-143.

5

An evaluation of GO annotation retrieval for BioCreAtIvE and GOA.

BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S17. doi: 10.1186/1471-2105-6-S1-S17. Epub 2005 May 24.

6

Mapping annotations with textual evidence using an scLDA model.

AMIA Annu Symp Proc. 2011;2011:834-42. Epub 2011 Oct 22.

7

Enhancing MEDLINE document clustering by incorporating MeSH semantic similarity.

Bioinformatics. 2009 Aug 1;25(15):1944-51. doi: 10.1093/bioinformatics/btp338. Epub 2009 Jun 3.

8

NCBI disease corpus: a resource for disease name recognition and concept normalization.

J Biomed Inform. 2014 Feb;47:1-10. doi: 10.1016/j.jbi.2013.12.006. Epub 2014 Jan 3.

9

Evaluation of BioCreAtIvE assessment of task 2.

BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S16. doi: 10.1186/1471-2105-6-S1-S16. Epub 2005 May 24.

10

Ontology annotation treebrowser : an interactive tool where the complementarity of medical subject headings and gene ontology improves the interpretation of gene lists.

Appl Bioinformatics. 2006;5(4):225-36. doi: 10.2165/00822942-200605040-00005.

引用本文的文献

1

Biomedical Text Classification Using Augmented Word Representation Based on Distributional and Relational Contexts.

Comput Intell Neurosci. 2023 Feb 15;2023:2989791. doi: 10.1155/2023/2989791. eCollection 2023.

2

Recent advances in biomedical literature mining.

Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa057.

3

Predicting MeSH Beyond MEDLINE.

Proc 1st Workshop Sch Web Min (2017). 2017 Feb;2017:49-56. doi: 10.1145/3057148.3057155.

4

Biomedical literature classification with a CNNs-based hybrid learning network.

PLoS One. 2018 Jul 26;13(7):e0197933. doi: 10.1371/journal.pone.0197933. eCollection 2018.

5

Search and Graph Database Technologies for Biomedical Semantic Indexing: Experimental Analysis.

JMIR Med Inform. 2017 Dec 1;5(4):e48. doi: 10.2196/medinform.7059.

6

MeSH Now: automatic MeSH indexing at PubMed scale via learning to rank.

J Biomed Semantics. 2017 Apr 17;8(1):15. doi: 10.1186/s13326-017-0123-3.

7

Large scale biomedical texts classification: a kNN and an ESA-based approaches.

J Biomed Semantics. 2016 Jun 16;7:40. doi: 10.1186/s13326-016-0073-1.

8

DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.

Bioinformatics. 2016 Jun 15;32(12):i70-i79. doi: 10.1093/bioinformatics/btw294.

9

Deep Question Answering for protein annotation.

Database (Oxford). 2015 Sep 16;2015. doi: 10.1093/database/bav081. Print 2015.

10

Ranking Medical Subject Headings using a factor graph model.

AMIA Jt Summits Transl Sci Proc. 2015 Mar 23;2015:56-63. eCollection 2015.

本文引用的文献

1

ArrayExpress update--from an archive of functional genomics experiments to the atlas of gene expression.

Nucleic Acids Res. 2009 Jan;37(Database issue):D868-72. doi: 10.1093/nar/gkn889. Epub 2008 Nov 10.

2

Comparison of vocabularies, representations and ranking algorithms for gene prioritization by text mining.

Bioinformatics. 2008 Aug 15;24(16):i119-25. doi: 10.1093/bioinformatics/btn291.

3

A probabilistic generative model for GO enrichment analysis.

Nucleic Acids Res. 2008 Oct;36(17):e109. doi: 10.1093/nar/gkn434. Epub 2008 Aug 1.

4

Combining evidence, specificity, and proximity towards the normalization of Gene Ontology terms in text.

EURASIP J Bioinform Syst Biol. 2008;2008(1):342746. doi: 10.1155/2008/342746.

5

Optimal training sets for Bayesian prediction of MeSH assignment.

J Am Med Inform Assoc. 2008 Jul-Aug;15(4):546-53. doi: 10.1197/jamia.M2431. Epub 2008 Apr 24.

6

PubMed related articles: a probabilistic topic-based model for content similarity.

BMC Bioinformatics. 2007 Oct 30;8:423. doi: 10.1186/1471-2105-8-423.

7

Multilabel associative classification categorization of MEDLINE articles into MeSH keywords.

IEEE Eng Med Biol Mag. 2007 Mar-Apr;26(2):47-55. doi: 10.1109/memb.2007.335581.

8

Automatic assignment of biomedical categories: toward a generic approach.

Bioinformatics. 2006 Mar 15;22(6):658-64. doi: 10.1093/bioinformatics/bti783. Epub 2005 Nov 15.

9

The NLM Indexing Initiative's Medical Text Indexer.

Stud Health Technol Inform. 2004;107(Pt 1):268-72.

10

Automatic MeSH term assignment and quality assessment.

Proc AMIA Symp. 2001:319-23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

文档翻译

学术文献翻译模型，支持多种主流文档格式。

医学主题词表升级：用于改进文档检索的有效医学主题词表文本分类。

MeSH Up: effective MeSH text classification for improved document retrieval.

作者信息

Trieschnigg Dolf, Pezik Piotr, Lee Vivian, de Jong Franciska, Kraaij Wessel, Rebholz-Schuhmann Dietrich

机构信息

European Bioinformatics Institute, Hinxton, UK.

出版信息

Bioinformatics. 2009 Jun 1;25(11):1412-8. doi: 10.1093/bioinformatics/btp249. Epub 2009 Apr 17.

DOI:10.1093/bioinformatics/btp249

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2682526/

Abstract

MOTIVATION

Controlled vocabularies such as the Medical Subject Headings (MeSH) thesaurus and the Gene Ontology (GO) provide an efficient way of accessing and organizing biomedical information by reducing the ambiguity inherent to free-text data. Different methods of automating the assignment of MeSH concepts have been proposed to replace manual annotation, but they are either limited to a small subset of MeSH or have only been compared with a limited number of other systems.

RESULTS

We compare the performance of six MeSH classification systems [MetaMap, EAGL, a language and a vector space model-based approach, a K-Nearest Neighbor (KNN) approach and MTI] in terms of reproducing and complementing manual MeSH annotations. A KNN system clearly outperforms the other published approaches and scales well with large amounts of text using the full MeSH thesaurus. Our measurements demonstrate to what extent manual MeSH annotations can be reproduced and how they can be complemented by automatic annotations. We also show that a statistically significant improvement can be obtained in information retrieval (IR) when the text of a user's query is automatically annotated with MeSH concepts, compared to using the original textual query alone.

CONCLUSIONS

The annotation of biomedical texts using controlled vocabularies such as MeSH can be automated to improve text-only IR. Furthermore, the automatic MeSH annotation system we propose is highly scalable and it generates improvements in IR comparable with those observed for manual annotations.

摘要

动机

诸如医学主题词表（MeSH）和基因本体论（GO）之类的受控词汇表，通过减少自由文本数据中固有的歧义性，提供了一种访问和组织生物医学信息的有效方法。已经提出了不同的自动分配MeSH概念的方法来取代人工注释，但它们要么仅限于MeSH的一个小子集，要么仅与有限数量的其他系统进行了比较。

结果

我们比较了六个MeSH分类系统[MetaMap、EAGL、一种基于语言和向量空间模型的方法、一种K近邻（KNN）方法和MTI]在重现和补充人工MeSH注释方面的性能。一个KNN系统明显优于其他已发表的方法，并且使用完整的MeSH词表对大量文本具有良好的扩展性。我们的测量结果表明了人工MeSH注释能够被重现的程度以及它们如何能够通过自动注释得到补充。我们还表明，与仅使用原始文本查询相比，当用户查询文本用MeSH概念自动注释时，在信息检索（IR）方面可以获得具有统计学意义的改进。

结论

使用诸如MeSH之类的受控词汇表对生物医学文本进行注释可以实现自动化，以改进纯文本IR。此外，我们提出的自动MeSH注释系统具有高度的可扩展性，并且它在IR方面产生的改进与人工注释所观察到的改进相当。