Névéol Aurélie, Shooshan Sonya E, Mork James G, Aronson Alan R
U.S. National Library of Medicine, Bethesda, Maryland; National Institutes of Health, Lister Hill Center for Biomedical Communications.
AMIA Annu Symp Proc. 2007 Oct 11;2007:553-7.
This paper reports on the latest results of an Indexing Initiative effort addressing the automatic attachment of subheadings to MeSH main headings recommended by the NLM's Medical Text Indexer.
Several linguistic and statistical approaches are used to retrieve and attach the subheadings. Continuing collaboration with NLM indexers also provided insight on how automatic methods can better enhance indexing practice.
The methods were evaluated on corpus of 50,000 MEDLINE citations. For main heading/subheading pair recommendations, the best precision is obtained with a post-processing rule method (58%) while the best recall is obtained by pooling all methods (64%). For stand-alone subheading recommendations, the best performance is obtained with the PubMed Related Citations algorithm.
Significant progress has been made in terms of subheading coverage. After further evaluation, some of this work may be integrated in the MEDLINE indexing workflow.
本文报告了一项索引倡议工作的最新成果,该工作旨在自动为美国国立医学图书馆(NLM)医学文本索引员推荐的医学主题词表(MeSH)主标题添加副标题。
采用了多种语言和统计方法来检索和添加副标题。与NLM索引员的持续合作也为自动方法如何更好地改进索引实践提供了见解。
在50000篇MEDLINE引文语料库上对这些方法进行了评估。对于主标题/副标题对的推荐,后处理规则方法获得了最佳精度(58%),而通过汇总所有方法获得了最佳召回率(64%)。对于独立副标题的推荐,PubMed相关引文算法表现最佳。
在副标题覆盖方面取得了显著进展。经过进一步评估后,这项工作中的一些内容可能会整合到MEDLINE索引工作流程中。