使用MEDLINE测试集对SAPHIRE进行性能与故障分析。

A performance and failure analysis of SAPHIRE with a MEDLINE test collection.

作者信息

Hersh W R, Hickam D H, Haynes R B, McKibbon K A

机构信息

Biomedical Information Communication Center, Oregon Health Sciences University, Portland 97201, USA.

出版信息

J Am Med Inform Assoc. 1994 Jan-Feb;1(1):51-60. doi: 10.1136/jamia.1994.95236136.

DOI:10.1136/jamia.1994.95236136

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC116184/

Abstract

OBJECTIVE

Assess the performance of the SAPHIRE automated information retrieval system.

DESIGN

Comparative study of automated and human searching of a MEDLINE test collection.

MEASUREMENTS

Recall and precision of SAPHIRE were compared with those attributes of novice physicians, expert physicians, and librarians for a test collection of 75 queries and 2,334 citations. Failure analysis assessed the efficacy of the Metathesaurus as a concept vocabulary; the reasons for retrieval of nonrelevant articles and nonretrieval of relevant articles; and the effect of changing the weighting formula for relevance ranking of retrieved articles.

RESULTS

Recall and precision of SAPHIRE were comparable to those of both physician groups, but less than those of librarians.

CONCLUSION

The current version of the Metathesaurus, as utilized by SAPHIRE, was unable to represent the conceptual content of one-fourth of physician-generated MEDLINE queries. The most likely cause for retrieval of nonrelevant articles was the presence of some or all of the search terms in the article, with frequencies high enough to lead to retrieval. The most likely cause for nonretrieval of relevant articles was the absence of the actual terms from the query, with synonyms or hierarchically related terms present instead. There were significant variations in performance when SAPHIRE's concept-weighing formulas were modified.

摘要

目的

评估蓝宝石自动信息检索系统的性能。

设计

对MEDLINE测试集进行自动检索与人工检索的对比研究。

测量

在一个包含75个查询和2334条引文的测试集中，将蓝宝石系统的召回率和精确率与新手医生、专家医生及图书馆员的相应指标进行比较。失败分析评估了元词表作为概念词汇表的有效性；检索到不相关文章及未检索到相关文章的原因；以及更改检索文章相关性排名加权公式的影响。

结果

蓝宝石系统的召回率和精确率与两个医生组相当，但低于图书馆员。

结论

蓝宝石系统所使用的当前版本元词表无法体现四分之一由医生生成的MEDLINE查询的概念内容。检索到不相关文章的最可能原因是文章中存在部分或所有搜索词，其出现频率高到足以导致被检索。未检索到相关文章的最可能原因是查询中缺少实际用词，取而代之的是同义词或层次相关词。修改蓝宝石系统的概念加权公式时，性能存在显著差异。

相似文献

1

A performance and failure analysis of SAPHIRE with a MEDLINE test collection.使用MEDLINE测试集对SAPHIRE进行性能与故障分析。

J Am Med Inform Assoc. 1994 Jan-Feb;1(1):51-60. doi: 10.1136/jamia.1994.95236136.

2

Evaluation of SAPHIRE: an automated approach to indexing and retrieving medical literature.对蓝宝石系统（SAPHIRE）的评估：一种医学文献索引与检索的自动化方法。

Proc Annu Symp Comput Appl Med Care. 1991:808-12.

3

A comparison of retrieval effectiveness for three methods of indexing medical literature.三种医学文献索引方法的检索效果比较。

Am J Med Sci. 1992 May;303(5):292-300. doi: 10.1097/00000441-199205000-00004.

4

A comparison of two methods for indexing and retrieval from a full-text medical database.全文医学数据库中两种索引与检索方法的比较。

Med Decis Making. 1993 Jul-Sep;13(3):220-6. doi: 10.1177/0272989X9301300308.

5

Information retrieval in medicine: the SAPHIRE experience.医学信息检索：蓝宝石计划的经验

Medinfo. 1995;8 Pt 2:1433-7.

6

Assessing thesaurus-based query expansion using the UMLS Metathesaurus.使用统一医学语言系统（UMLS）元词表评估基于词库的查询扩展。

Proc AMIA Symp. 2000:344-8.

7

Ranking the whole MEDLINE database according to a large training set using text indexing.使用文本索引根据一个大型训练集对整个MEDLINE数据库进行排名。

BMC Bioinformatics. 2005 Mar 24;6:75. doi: 10.1186/1471-2105-6-75.

8

The SAPHIRE server: a new algorithm and implementation.蓝宝石服务器：一种新算法及其实现

Proc Annu Symp Comput Appl Med Care. 1995:858-62.

9

Automated semantic indexing of imaging reports to support retrieval of medical images in the multimedia electronic medical record.影像报告的自动语义索引，以支持在多媒体电子病历中检索医学图像。

Methods Inf Med. 1999 Dec;38(4-5):303-7.

10

Words or concepts: the features of indexing units and their optimal use in information retrieval.词汇或概念：索引单元的特征及其在信息检索中的最佳应用。

Proc Annu Symp Comput Appl Med Care. 1993:685-9.

引用本文的文献

1

Automated semantic indexing of figure captions to improve radiology image retrieval.图注的自动语义索引以改善放射学图像检索

J Am Med Inform Assoc. 2009 May-Jun;16(3):380-6. doi: 10.1197/jamia.M2945. Epub 2009 Mar 4.

2

An alternative to the hand searching gold standard: validating methodological search filters using relative recall.手工检索金标准的替代方法：使用相对召回率验证方法学检索过滤器

BMC Med Res Methodol. 2006 Jul 18;6:33. doi: 10.1186/1471-2288-6-33.

3

Can electronic search engines optimize screening of search results in systematic reviews: an empirical study.电子搜索引擎能否优化系统评价中检索结果的筛选：一项实证研究

BMC Med Res Methodol. 2006 Feb 24;6:7. doi: 10.1186/1471-2288-6-7.

4

Personalized online information search and visualization.个性化在线信息搜索与可视化。

BMC Med Inform Decis Mak. 2005 Mar 14;5:6. doi: 10.1186/1472-6947-5-6.

5

The horizontal and vertical nature of patient phenotype retrieval: new directions for clinical text processing.患者表型检索的横向与纵向特性：临床文本处理的新方向

Proc AMIA Symp. 2002:165-9.

6

Automatic MeSH term assignment and quality assessment.自动医学主题词表术语分配与质量评估。

Proc AMIA Symp. 2001:319-23.

7

MedlineQBE (Query-by-Example).医学文献在线查询示例（按示例查询）

Proc AMIA Symp. 2001:47-51.

8

Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.生物医学文本到UMLS元词表的有效映射：MetaMap程序

Proc AMIA Symp. 2001:17-21.

9

Reference standards, judges, and comparison subjects: roles for experts in evaluating system performance.参考标准、评判者与对照对象：专家在评估系统性能中的作用。

J Am Med Inform Assoc. 2002 Jan-Feb;9(1):1-15. doi: 10.1136/jamia.2002.0090001.

10

Exploring the UMLS: a rough sets based theoretical framework.探索统一医学语言系统：一个基于粗糙集的理论框架。

Proc AMIA Symp. 1999:156-60.

本文引用的文献

1

Ranking documents with a thesaurus.使用叙词表对文档进行排序。

J Am Soc Inf Sci. 1989 Sep;40(5):304-10. doi: 10.1002/(SICI)1097-4571(198909)40:5<304::AID-ASI2>3.0.CO;2-6.

2

An end user search service in an academic health sciences library.学术健康科学图书馆中的终端用户搜索服务。

Med Ref Serv Q. 1985 Spring;4(1):11-21. doi: 10.1300/j115v04n01_02.

3

A method of comparing the areas under receiver operating characteristic curves derived from the same cases.一种比较源自相同病例的受试者工作特征曲线下面积的方法。

Radiology. 1983 Sep;148(3):839-43. doi: 10.1148/radiology.148.3.6878708.

4

Indexing consistency in MEDLINE.医学文献数据库（MEDLINE）中的索引一致性

Bull Med Libr Assoc. 1983 Apr;71(2):176-83.

5

Online access to MEDLINE in clinical settings. A study of use and usefulness.临床环境中对MEDLINE的在线访问：使用情况与实用性研究

Ann Intern Med. 1990 Jan 1;112(1):78-84. doi: 10.7326/0003-4819-112-1-78.

6

How good are clinical MEDLINE searches? A comparative study of clinical end-user and librarian searches.临床MEDLINE检索的效果如何？临床终端用户与图书馆员检索的比较研究。

Comput Biomed Res. 1990 Dec;23(6):583-93. doi: 10.1016/0010-4809(90)90042-b.

7

Information retrieval in medicine: state of the art.医学信息检索：当前技术水平

MD Comput. 1990 Sep-Oct;7(5):302-11.

8

Evaluation of Meta-1 for a concept-based approach to the automated indexing and retrieval of bibliographic and full-text databases.Meta-1用于基于概念的书目数据库和全文数据库自动索引与检索方法的评估。

Med Decis Making. 1991 Oct-Dec;11(4 Suppl):S120-4.

9

Automatic indexing of abstracts via natural-language processing using a simple thesaurus.

Med Decis Making. 1991 Oct-Dec;11(4 Suppl):S108-15.

10

A comparison of retrieval effectiveness for three methods of indexing medical literature.三种医学文献索引方法的检索效果比较。

Am J Med Sci. 1992 May;303(5):292-300. doi: 10.1097/00000441-199205000-00004.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验