J Med Libr Assoc. 2023 Jul 10;111(3):684-694. doi: 10.5195/jmla.2023.1588.
In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that it would fully automate its indexing in Medline with an improved MTI by mid-2022. This pilot study examines indexing using a sample of records in Medline from 2000, and how an early, public version of MTI's outputs compares to records created by human indexers.
This pilot study examines twenty Medline records from 2000, a year before the MTI was introduced as a MeSH term recommender. We identified twenty higher- and lower-impact biomedical journals based on Journal Impact Factor (JIF) and examined the indexing of papers by feeding their PubMed records into the Interactive MTI tool.
In the sample, we found key differences between automated and human-indexed Medline records: MTI assigned more terms and used them more accurately for citations in the higher JIF group, and MTI tended to rank the Male check tag more highly than the Female check tag and to omit Aged check tags. Sometimes MTI chose more specific terms than human indexers but was inconsistent in applying specificity principles.
NLM's transition to fully automated indexing of the biomedical literature could introduce or perpetuate inconsistencies and biases in Medline. Librarians and searchers should assess changes to index terms, and their impact on PubMed's mapping features for a range of topics. Future research should evaluate automated indexing as it pertains to finding clinical information effectively, and in performing systematic searches.
2002 年,美国国家医学图书馆(NLM)引入了使用医学文本索引器(MTI)对 Medline 进行半自动索引的方法。2021 年,NLM 宣布将于 2022 年年中全面实现 Medline 的自动化索引,并采用改良版的 MTI。本试点研究通过对 2000 年 Medline 记录样本进行研究,评估 MTI 早期公开版本的输出与人工索引器创建的记录之间的差异。
本试点研究通过对 2000 年的 20 条 Medline 记录进行研究,这些记录是在 MTI 作为 MeSH 主题推荐器引入之前的一年收集的。我们根据期刊影响因子(JIF)确定了 20 种高影响力和低影响力的生物医学期刊,并通过将论文的 PubMed 记录输入交互式 MTI 工具,对这些期刊论文的索引情况进行评估。
在样本中,我们发现自动索引和人工索引的 Medline 记录之间存在关键差异:MTI 为 JIF 较高的组中的引文分配了更多的主题,并更准确地使用这些主题;MTI 倾向于将 Male 检查标签的优先级高于 Female 检查标签,且更可能忽略 Aged 检查标签。有时 MTI 选择的主题比人工索引器更具体,但在应用具体性原则方面并不一致。
NLM 向生物医学文献的完全自动化索引过渡可能会在 Medline 中引入或延续不一致性和偏见。图书馆员和检索者应评估索引术语的变化及其对 PubMed 映射功能的影响,以便覆盖一系列主题。未来的研究应评估自动化索引在有效查找临床信息和进行系统搜索方面的效果。