• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用引文语境提高生物医学文献的 MeSH 分类

Improving MeSH classification of biomedical articles using citation contexts.

机构信息

Department of Computer Science and Software Engineering, The University of Melbourne, Victoria 3010, Australia.

出版信息

J Biomed Inform. 2011 Oct;44(5):881-96. doi: 10.1016/j.jbi.2011.05.007. Epub 2011 Jun 12.

DOI:10.1016/j.jbi.2011.05.007
PMID:21683802
Abstract

Medical Subject Headings (MeSH) are used to index the majority of databases generated by the National Library of Medicine. Essentially, MeSH terms are designed to make information, such as scientific articles, more retrievable and assessable to users of systems such as PubMed. This paper proposes a novel method for automating the assignment of biomedical publications with MeSH terms that takes advantage of citation references to these publications. Our findings show that analysing the citation references that point to a document can provide a useful source of terms that are not present in the document. The use of these citation contexts, as they are known, can thus help to provide a richer document feature representation, which in turn can help improve text mining and information retrieval applications, in our case MeSH term classification. In this paper, we also explore new methods of selecting and utilising citation contexts. In particular, we assess the effect of weighting the importance of citation terms (found in the citation contexts) according to two aspects: (i) the section of the paper they appear in and (ii) their distance to the citation marker. We conduct intrinsic and extrinsic evaluations of citation term quality. For the intrinsic evaluation, we rely on the UMLS Metathesaurus conceptual database to explore the semantic characteristics of the mined citation terms. We also analyse the "informativeness" of these terms using a class-entropy measure. For the extrinsic evaluation, we run a series of automatic document classification experiments over MeSH terms. Our experimental evaluation shows that citation contexts contain terms that are related to the original document, and that the integration of this knowledge results in better classification performance compared to two state-of-the-art MeSH classification systems: MeSHUP and MTI. Our experiments also demonstrate that the consideration of Section and Distance factors can lead to statistically significant improvements in citation feature quality, thus opening the way for better document feature representation in other biomedical text processing applications.

摘要

医学主题词(MeSH)用于索引美国国家医学图书馆生成的大多数数据库。从本质上讲,MeSH 术语旨在使信息(如科学文章)对 PubMed 等系统的用户更具可检索性和可评估性。本文提出了一种利用生物医学出版物的引文引用为这些出版物分配 MeSH 术语的新方法。我们的研究结果表明,分析指向文档的引文引用可以提供一个有用的术语来源,这些术语在文档中不存在。这些引文上下文(如已知的)的使用可以为文档的特征表示提供更丰富的信息,从而有助于提高文本挖掘和信息检索应用程序的性能,在我们的案例中是 MeSH 术语分类。在本文中,我们还探索了选择和利用引文上下文的新方法。特别是,我们根据两个方面评估了根据引文术语(在引文上下文中找到)的重要性对其进行加权的效果:(i)它们出现在论文中的部分和(ii)它们与引文标记的距离。我们对引文术语的质量进行了内在和外在的评估。对于内在评估,我们依赖 UMLS Metathesaurus 概念数据库来探索挖掘的引文术语的语义特征。我们还使用类别熵度量来分析这些术语的“信息量”。对于外在评估,我们对 MeSH 术语进行了一系列自动文档分类实验。我们的实验评估表明,引文上下文中包含与原始文档相关的术语,并且与两个最先进的 MeSH 分类系统(MeSHUP 和 MTI)相比,整合这些知识可以带来更好的分类性能。我们的实验还表明,考虑部分和距离因素可以导致引文特征质量的统计显著提高,从而为其他生物医学文本处理应用程序中的更好的文档特征表示开辟了道路。

相似文献

1
Improving MeSH classification of biomedical articles using citation contexts.利用引文语境提高生物医学文献的 MeSH 分类
J Biomed Inform. 2011 Oct;44(5):881-96. doi: 10.1016/j.jbi.2011.05.007. Epub 2011 Jun 12.
2
DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.深度医学主题词表:用于改进大规模医学主题词表索引的深度语义表示。
Bioinformatics. 2016 Jun 15;32(12):i70-i79. doi: 10.1093/bioinformatics/btw294.
3
Ranking documents with a thesaurus.使用叙词表对文档进行排序。
J Am Soc Inf Sci. 1989 Sep;40(5):304-10. doi: 10.1002/(SICI)1097-4571(198909)40:5<304::AID-ASI2>3.0.CO;2-6.
4
Using UMLS to map from a library to a clinical classification: Improving the functionality of a digital library.使用统一医学语言系统从文献库映射到临床分类:提升数字文献库的功能
Stud Health Technol Inform. 2006;121:86-95.
5
Reflective random indexing for semi-automatic indexing of the biomedical literature.基于反射随机索引的生物医学文献半自动索引方法。
J Biomed Inform. 2010 Oct;43(5):694-700. doi: 10.1016/j.jbi.2010.04.001. Epub 2010 Apr 9.
6
The use of PubMed/Medline in psychiatry. 1: Presentation of NLM and PubMed.PubMed/Medline在精神病学中的应用。1:美国国立医学图书馆及PubMed介绍。
Nord J Psychiatry. 2006;60(4):299-304. doi: 10.1080/08039480600790390.
7
A knowledge-driven approach to biomedical document conceptualization.基于知识的生物医学文献概念化方法。
Artif Intell Med. 2010 Jun;49(2):67-78. doi: 10.1016/j.artmed.2010.02.005. Epub 2010 Apr 3.
8
Searching the literature using medical subject headings versus text word with PubMed.使用医学主题词与文本词在PubMed中检索文献。
Laryngoscope. 2006 Feb;116(2):336-40. doi: 10.1097/01.mlg.0000195371.72887.a2.
9
Automatic Assignment of Non-Leaf MeSH Terms to Biomedical Articles.非叶状医学主题词自动分配至生物医学文章
AMIA Annu Symp Proc. 2015 Nov 5;2015:697-706. eCollection 2015.
10
Application of a Medical Text Indexer to an online dermatology atlas.医学文本索引器在在线皮肤病图谱中的应用。
Stud Health Technol Inform. 2004;107(Pt 1):287-91.

引用本文的文献

1
Candidate Key Proteins of Tinnitus in the Auditory and Motor Systems of the Thalamus.丘脑听觉和运动系统中耳鸣的候选关键蛋白
Int J Mol Sci. 2025 Jun 17;26(12):5804. doi: 10.3390/ijms26125804.
2
Enhancing Knowledge Graph Extraction and Validation From Scholarly Publications Using Bibliographic Metadata.利用书目元数据增强学术出版物中的知识图谱提取与验证
Front Res Metr Anal. 2021 May 28;6:694307. doi: 10.3389/frma.2021.694307. eCollection 2021.
3
Network-based approach highlighting interplay among anti-hypertensives: target coding-genes: diseases.
基于网络的方法突出抗高血压药物之间的相互作用:靶编码基因:疾病。
Sci Rep. 2020 Nov 19;10(1):20152. doi: 10.1038/s41598-020-76605-1.
4
Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research.重新设计医学主题词表以进行术语选择,以优化文献检索和知识重建,支持干细胞研究。
BMC Med Inform Decis Mak. 2016 May 23;16:54. doi: 10.1186/s12911-016-0298-z.
5
Surveillance for the prevention of chronic diseases through information association.通过信息关联进行慢性病预防监测。
BMC Med Genomics. 2014 Jan 30;7:7. doi: 10.1186/1755-8794-7-7.
6
Do peers see more in a paper than its authors?同行在一篇论文中看到的是否比作者更多?
Adv Bioinformatics. 2012;2012:750214. doi: 10.1155/2012/750214. Epub 2012 Nov 27.