Gay Clifford W, Kayaalp Mehmet, Aronson Alan R
Lister Hill National Center for Biomedical Communications, National Library of Medicine, Bethesda, MD 20894, USA.
AMIA Annu Symp Proc. 2005;2005:271-5.
The main application of U.S. National Library of Medicine's Medical Text Indexer (MTI) is to provide indexing recommendations to the Library's indexing staff. The current input to MTI consists of the titles and abstracts of articles to be indexed. This study reports on an extension of MTI to the full text of articles appearing in online medical journals that are indexed for Medline. Using a collection of 17 journal issues containing 500 articles, we report on the effectiveness of the contribution of terms by the whole article and also by each section. We obtain the best results using a model consisting of the sections Results, Results and Discussion, and Conclusions together with the article's title and abstract, the captions of tables and figures, and sections that have no titles. The resulting model provides indexing significantly better (7.4%) than what is currently achieved using only titles and abstracts.
美国国立医学图书馆的医学文本索引器(MTI)的主要应用是为该图书馆的索引编制人员提供索引建议。MTI目前的输入包括待索引文章的标题和摘要。本研究报告了MTI扩展到为Medline编制索引的在线医学期刊中文章全文的情况。我们使用包含500篇文章的17期期刊合集,报告了整篇文章以及各部分术语贡献的有效性。我们使用一个由“结果”“结果与讨论”“结论”部分以及文章标题、摘要、表格和图表标题以及无标题部分组成的模型获得了最佳结果。由此产生的模型提供的索引效果比目前仅使用标题和摘要时显著更好(提高了7.4%)。