使用医学文献数据库查询来生成用于基准测试的图像检索任务。

Using medline queries to generate image retrieval tasks for benchmarking.

作者信息

Müller Henning, Kalpathy-Cramer Jayashree, Hersh William, Geissbuhler Antoine

机构信息

Medical Informatics Service, University & Hospitals of Geneva, Geneva, Switzerland.

出版信息

Stud Health Technol Inform. 2008;136:523-8.

PMID:18487784

Abstract

Medical visual information retrieval has been a very active research area over the past ten years as an increasing amount of images is produced digitally and made available in the electronic patient record. Tools are required to give access to the images and exploit the information inherently stored in medical cases including images. To compare image retrieval techniques of research prototypes based on the same data and tasks, ImageCLEF was started in 2003 and a medical task was added in 2004. Since then, every year a database was distributed, tasks developed, and systems compared based on realistic search tasks and large databases. For the year 2007 a set of almost 68,000 images was distributed among 38 research groups registered for the medical retrieval task. Realistic query topics were developed based on a log file of Medline. This log file contains the queries performed on Pubmed during 24 hours. Most queries could not be used as search topics directly as they do not contain image-related themes, but a few thousand do. Other types of queries had to be filtered out as well, as many stated information needs are very vague; for evaluation on the other hand clear and focused topics are necessary to obtain a limited number of relevant documents and limit ambiguity in the evaluation process. In the end, 30 queries were developed and 13 research groups submitted a total of 149 runs using a large variety of techniques, from textual to purely visual retrieval and multi-modal approaches.

摘要

在过去十年中，医学视觉信息检索一直是一个非常活跃的研究领域，因为越来越多的图像以数字方式生成并可在电子病历中获取。需要工具来访问这些图像并利用包含图像的医疗案例中固有存储的信息。为了基于相同的数据和任务比较研究原型的图像检索技术，ImageCLEF于2003年启动，并于2004年增加了医学任务。从那时起，每年都会分发一个数据库，开发任务，并基于实际搜索任务和大型数据库比较系统。2007年，为注册参加医学检索任务的38个研究小组分发了一组近68,000张图像。基于Medline的日志文件开发了实际查询主题。该日志文件包含在24小时内在Pubmed上执行的查询。大多数查询不能直接用作搜索主题，因为它们不包含与图像相关的主题，但有几千个查询包含。其他类型的查询也必须过滤掉，因为许多陈述的信息需求非常模糊；另一方面，为了进行评估，清晰且重点突出的主题对于获得有限数量的相关文档并限制评估过程中的歧义是必要的。最终，开发了30个查询，13个研究小组使用从文本检索到纯视觉检索以及多模态方法等多种技术总共提交了149次运行结果。