Suppr超能文献

单词、概念或两者兼而有之:自动化信息检索的最佳索引单元。

Words, concepts, or both: optimal indexing units for automated information retrieval.

作者信息

Hersh W R, Hickam D H, Leone T J

机构信息

Biomedical Information Communication Center, Oregon Health Sciences University, Portland.

出版信息

Proc Annu Symp Comput Appl Med Care. 1992:644-8.

Abstract

What is the best way to represent the content of documents in an information retrieval system? This study compares the retrieval effectiveness of five different methods for automated (machine-assigned) indexing using three test collections. The consistently best methods are those that use indexing based on the words that occur in the available text of each document. Methods used to map text into concepts from a controlled vocabulary showed no advantage over the word-based methods. This study also looked at an approach to relevance feedback which showed benefit for both word-based and concept-based methods.

摘要

在信息检索系统中,呈现文档内容的最佳方式是什么?本研究使用三个测试集比较了五种不同的自动(机器分配)索引方法的检索效果。始终表现最佳的方法是那些基于每个文档可用文本中出现的单词进行索引的方法。用于将文本映射到来自受控词汇表的概念的方法与基于单词的方法相比没有优势。本研究还研究了一种相关反馈方法,该方法对基于单词的方法和基于概念的方法都有好处。

相似文献

5
A comparison of retrieval effectiveness for three methods of indexing medical literature.
Am J Med Sci. 1992 May;303(5):292-300. doi: 10.1097/00000441-199205000-00004.
7
Font adaptive word indexing of modern printed documents.
IEEE Trans Pattern Anal Mach Intell. 2006 Aug;28(8):1187-99. doi: 10.1109/TPAMI.2006.162.
8
The SAPHIRE server: a new algorithm and implementation.
Proc Annu Symp Comput Appl Med Care. 1995:858-62.
9
An automatic indexing method for medical documents.
Proc Annu Symp Comput Appl Med Care. 1991:1011-7.

引用本文的文献

1
Integrating query of relational and textual data in clinical databases: a case study.
J Am Med Inform Assoc. 2003 Jan-Feb;10(1):21-38. doi: 10.1197/jamia.m1133.
2
UMLS concept indexing for production databases: a feasibility study.
J Am Med Inform Assoc. 2001 Jan-Feb;8(1):80-91. doi: 10.1136/jamia.2001.0080080.
5
Towards new measures of information retrieval evaluation.
Proc Annu Symp Comput Appl Med Care. 1994:895-9. doi: 10.1145/215206.215355.

本文引用的文献

1
A method of comparing the areas under receiver operating characteristic curves derived from the same cases.
Radiology. 1983 Sep;148(3):839-43. doi: 10.1148/radiology.148.3.6878708.
2
Indexing consistency in MEDLINE.
Bull Med Libr Assoc. 1983 Apr;71(2):176-83.
4
A comparison of retrieval effectiveness for three methods of indexing medical literature.
Am J Med Sci. 1992 May;303(5):292-300. doi: 10.1097/00000441-199205000-00004.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验