• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医学主题词表现状:通过学习排序实现PubMed规模的自动医学主题词表索引编制。

MeSH Now: automatic MeSH indexing at PubMed scale via learning to rank.

作者信息

Mao Yuqing, Lu Zhiyong

机构信息

Nanjing University of Chinese Medicine, 138 Xianlin Avenue, Nanjing, Jiangsu, 210023, China.

National Center for Biotechnology Information (NCBI), 8600 Rockville Pike, Bethesda, MD, 20894, USA.

出版信息

J Biomed Semantics. 2017 Apr 17;8(1):15. doi: 10.1186/s13326-017-0123-3.

DOI:10.1186/s13326-017-0123-3
PMID:28412964
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5392968/
Abstract

BACKGROUND

MeSH indexing is the task of assigning relevant MeSH terms based on a manual reading of scholarly publications by human indexers. The task is highly important for improving literature retrieval and many other scientific investigations in biomedical research. Unfortunately, given its manual nature, the process of MeSH indexing is both time-consuming (new articles are not immediately indexed until 2 or 3 months later) and costly (approximately ten dollars per article). In response, automatic indexing by computers has been previously proposed and attempted but remains challenging. In order to advance the state of the art in automatic MeSH indexing, a community-wide shared task called BioASQ was recently organized.

METHODS

We propose MeSH Now, an integrated approach that first uses multiple strategies to generate a combined list of candidate MeSH terms for a target article. Through a novel learning-to-rank framework, MeSH Now then ranks the list of candidate terms based on their relevance to the target article. Finally, MeSH Now selects the highest-ranked MeSH terms via a post-processing module.

RESULTS

We assessed MeSH Now on two separate benchmarking datasets using traditional precision, recall and F-score metrics. In both evaluations, MeSH Now consistently achieved over 0.60 in F-score, ranging from 0.610 to 0.612. Furthermore, additional experiments show that MeSH Now can be optimized by parallel computing in order to process MEDLINE documents on a large scale.

CONCLUSIONS

We conclude that MeSH Now is a robust approach with state-of-the-art performance for automatic MeSH indexing and that MeSH Now is capable of processing PubMed scale documents within a reasonable time frame.

AVAILABILITY

http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/MeSHNow/ .

摘要

背景

医学主题词(MeSH)标引是人工标引员通过人工阅读学术出版物来分配相关MeSH词的任务。该任务对于改善生物医学研究中的文献检索及许多其他科学研究非常重要。不幸的是,鉴于其人工性质,MeSH标引过程既耗时(新文章在2至3个月后才会被立即标引)又昂贵(每篇文章约10美元)。作为回应,此前已提出并尝试通过计算机进行自动标引,但仍具有挑战性。为了推动自动MeSH标引技术的发展,最近组织了一项名为BioASQ的全社区共享任务。

方法

我们提出了MeSH Now,这是一种综合方法,首先使用多种策略为目标文章生成候选MeSH词的组合列表。然后,通过一个新颖的排序学习框架,MeSH Now根据候选词与目标文章的相关性对其列表进行排序。最后,MeSH Now通过后处理模块选择排名最高的MeSH词。

结果

我们使用传统的精确率、召回率和F值指标在两个单独的基准数据集上评估了MeSH Now。在这两项评估中,MeSH Now的F值始终超过0.60,范围从0.610到0.612。此外,额外的实验表明,MeSH Now可以通过并行计算进行优化,以便大规模处理MEDLINE文档。

结论

我们得出结论,MeSH Now是一种强大的自动MeSH标引方法,具有先进的性能,并且MeSH Now能够在合理的时间范围内处理PubMed规模的文档。

可用性

http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/MeSHNow/ 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a39/5392968/835ecd180e49/13326_2017_123_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a39/5392968/89447283de9f/13326_2017_123_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a39/5392968/835ecd180e49/13326_2017_123_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a39/5392968/89447283de9f/13326_2017_123_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a39/5392968/835ecd180e49/13326_2017_123_Fig2_HTML.jpg

相似文献

1
MeSH Now: automatic MeSH indexing at PubMed scale via learning to rank.医学主题词表现状:通过学习排序实现PubMed规模的自动医学主题词表索引编制。
J Biomed Semantics. 2017 Apr 17;8(1):15. doi: 10.1186/s13326-017-0123-3.
2
Recommending MeSH terms for annotating biomedical articles.推荐用于标注生物医学文章的 MeSH 术语。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):660-7. doi: 10.1136/amiajnl-2010-000055. Epub 2011 May 25.
3
MeSH indexing based on automatically generated summaries.基于自动生成的摘要进行 MeSH 标引。
BMC Bioinformatics. 2013 Jun 26;14:208. doi: 10.1186/1471-2105-14-208.
4
Influence of automated indexing in Medical Subject Headings (MeSH) selection for pharmacy practice journals.自动化索引对药学实践期刊的医学主题词(MeSH)选择的影响。
Res Social Adm Pharm. 2024 Sep;20(9):911-917. doi: 10.1016/j.sapharm.2024.06.003. Epub 2024 Jun 12.
5
A recent advance in the automatic indexing of the biomedical literature.生物医学文献自动标引的最新进展。
J Biomed Inform. 2009 Oct;42(5):814-23. doi: 10.1016/j.jbi.2008.12.007. Epub 2008 Dec 30.
6
Fine-grained indexing of the biomedical literature: MeSH subheading attachment for a MEDLINE indexing tool.生物医学文献的细粒度索引:用于MEDLINE索引工具的医学主题词副主题词附加
AMIA Annu Symp Proc. 2007 Oct 11;2007:553-7.
7
MEDRank: using graph-based concept ranking to index biomedical texts.MEDRank:基于图的概念排序在生物医学文本索引中的应用。
Int J Med Inform. 2011 Jun;80(6):431-41. doi: 10.1016/j.ijmedinf.2011.02.008. Epub 2011 Mar 25.
8
An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.BIOASQ大规模生物医学语义索引与问答竞赛概述。
BMC Bioinformatics. 2015 Apr 30;16:138. doi: 10.1186/s12859-015-0564-6.
9
NLM-Chem-BC7: manually annotated full-text resources for chemical entity annotation and indexing in biomedical articles.NLM-Chem-BC7:用于生物医学文章中化学实体注释和索引的人工标注全文资源。
Database (Oxford). 2022 Dec 1;2022. doi: 10.1093/database/baac102.
10
Automatic inference of indexing rules for MEDLINE.医学文献数据库(MEDLINE)索引规则的自动推理
BMC Bioinformatics. 2008 Nov 19;9 Suppl 11(Suppl 11):S11. doi: 10.1186/1471-2105-9-S11-S11.

引用本文的文献

1
The Perspective on Secondary Research Practices: A Cross-Sectional Analysis.二次研究实践的视角:一项横断面分析
Healthcare (Basel). 2025 Apr 17;13(8):927. doi: 10.3390/healthcare13080927.
2
Algorithmic indexing in MEDLINE frequently overlooks important concepts and may compromise literature search results.MEDLINE中的算法索引经常会忽略重要概念,可能会影响文献检索结果。
J Med Libr Assoc. 2025 Jan 14;113(1):39-48. doi: 10.5195/jmla.2025.1936.
3
Automatic Classification and Visualization of Text Data on Rare Diseases.罕见病文本数据的自动分类与可视化

本文引用的文献

1
Large-scale online semantic indexing of biomedical articles via an ensemble of multi-label classification models.通过多标签分类模型集成对生物医学文章进行大规模在线语义索引。
J Biomed Semantics. 2017 Sep 22;8(1):43. doi: 10.1186/s13326-017-0150-0.
2
Author Name Disambiguation for PubMed.PubMed的作者姓名消歧
J Assoc Inf Sci Technol. 2014 Apr;65(4):765-781. doi: 10.1002/asi.23063. Epub 2013 Nov 21.
3
Leveraging output term co-occurrence frequencies and latent associations in predicting medical subject headings.利用输出术语共现频率和潜在关联来预测医学主题词
J Pers Med. 2024 May 20;14(5):545. doi: 10.3390/jpm14050545.
4
What's next? Forecasting scientific research trends.接下来是什么?预测科学研究趋势。
Heliyon. 2023 Dec 15;10(1):e23781. doi: 10.1016/j.heliyon.2023.e23781. eCollection 2024 Jan 15.
5
ChatGPT in academic writing: Maximizing its benefits and minimizing the risks.ChatGPT 在学术写作中的应用:最大化其益处,最小化其风险。
Indian J Ophthalmol. 2023 Dec 1;71(12):3600-3606. doi: 10.4103/IJO.IJO_718_23. Epub 2023 Nov 20.
6
NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data.NeuroBridge:一个用于发现长尾神经影像数据的原型平台。
Front Neuroinform. 2023 Aug 31;17:1215261. doi: 10.3389/fninf.2023.1215261. eCollection 2023.
7
CureSCi Metadata Catalog-Making sickle cell studies findable.CureSCi 元数据目录——使镰状细胞研究可查找。
PLoS One. 2022 Dec 12;17(12):e0256248. doi: 10.1371/journal.pone.0256248. eCollection 2022.
8
Chemical identification and indexing in PubMed full-text articles using deep learning and heuristics.使用深度学习和启发式方法在 PubMed 全文文章中进行化学物质的识别和标引。
Database (Oxford). 2022 Jul 1;2022. doi: 10.1093/database/baac047.
9
Multi-probe attention neural network for COVID-19 semantic indexing.多探针注意力神经网络用于 COVID-19 语义索引。
BMC Bioinformatics. 2022 Jun 29;23(1):259. doi: 10.1186/s12859-022-04803-x.
10
Death and invasive mechanical ventilation risk in hospitalized COVID-19 patients treated with anti-SARS-CoV-2 monoclonal antibodies and/or antiviral agents: A systematic review and network meta-analysis protocol.标题:抗 SARS-CoV-2 单克隆抗体和/或抗病毒药物治疗的住院 COVID-19 患者死亡和有创机械通气风险的系统评价和网络荟萃分析方案 解析:原文的标题较长,在翻译时,采用“正题+副题”的形式,正题突出研究对象“抗 SARS-CoV-2 单克隆抗体和/或抗病毒药物治疗的住院 COVID-19 患者”,副题则是对研究内容的补充说明。
PLoS One. 2022 Jun 17;17(6):e0270196. doi: 10.1371/journal.pone.0270196. eCollection 2022.
Data Knowl Eng. 2014 Nov;94(B):189-201. doi: 10.1016/j.datak.2014.09.002. Epub 2014 Sep 18.
4
DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.深度医学主题词表:用于改进大规模医学主题词表索引的深度语义表示。
Bioinformatics. 2016 Jun 15;32(12):i70-i79. doi: 10.1093/bioinformatics/btw294.
5
MeSHLabeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence.医学主题词表(MeSH)标注器:通过整合多种证据提高大规模医学主题词表索引的准确性。
Bioinformatics. 2015 Jun 15;31(12):i339-47. doi: 10.1093/bioinformatics/btv237.
6
Stochastic Gradient Descent and the Prediction of MeSH for PubMed Records.随机梯度下降与PubMed记录的医学主题词预测
AMIA Annu Symp Proc. 2014 Nov 14;2014:1198-207. eCollection 2014.
7
An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.BIOASQ大规模生物医学语义索引与问答竞赛概述。
BMC Bioinformatics. 2015 Apr 30;16:138. doi: 10.1186/s12859-015-0564-6.
8
Three journal similarity metrics and their application to biomedical journals.三种期刊相似性指标及其在生物医学期刊中的应用。
PLoS One. 2014 Dec 23;9(12):e115681. doi: 10.1371/journal.pone.0115681. eCollection 2014.
9
LabeledIn: cataloging labeled indications for human drugs.领英:编目人用药品的标记适应症。
J Biomed Inform. 2014 Dec;52:448-56. doi: 10.1016/j.jbi.2014.08.004. Epub 2014 Aug 23.
10
Overview of the gene ontology task at BioCreative IV.生物创意IV基因本体任务概述。
Database (Oxford). 2014 Aug 25;2014. doi: 10.1093/database/bau086. Print 2014.