Névéol Aurélie, Deserno Thomas M, Darmoni Stéfan J, Güld Mark Oliver, Aronson Alan R
U.S. National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894. E-mail:
J Am Soc Inf Sci Technol. 2008 Sep 18;60(1):123-134. doi: 10.1002/asi.20955.
One of the most significant recent advances in health information systems has been the shift from paper to electronic documents. While research on automatic text and image processing has taken separate paths, there is a growing need for joint efforts, particularly for electronic health records and biomedical literature databases. This work aims at comparing text-based versus image-based access to multimodal medical documents using state-of-the-art methods of processing text and image components. A collection of 180 medical documents containing an image accompanied by a short text describing it was divided into training and test sets. Content-based image analysis and natural language processing techniques are applied individually and combined for multimodal document analysis. The evaluation consists of an indexing task and a retrieval task based on the "gold standard" codes manually assigned to corpus documents. The performance of text-based and image-based access, as well as combined document features, is compared. Image analysis proves more adequate for both the indexing and retrieval of the images. In the indexing task, multimodal analysis outperforms both independent image and text analysis. This experiment shows that text describing images can be usefully analyzed in the framework of a hybrid text/image retrieval system.
健康信息系统最近最重要的进展之一是从纸质文档向电子文档的转变。虽然对自动文本和图像处理的研究各自发展,但越来越需要共同努力,特别是在电子健康记录和生物医学文献数据库方面。这项工作旨在使用最先进的文本和图像组件处理方法,比较基于文本和基于图像的多模态医学文档访问方式。收集了180份包含图像及描述该图像的简短文本的医学文档,并将其分为训练集和测试集。基于内容的图像分析和自然语言处理技术分别应用并结合用于多模态文档分析。评估包括基于手动分配给语料库文档的“黄金标准”代码的索引任务和检索任务。比较了基于文本和基于图像的访问性能以及组合文档特征。图像分析在图像索引和检索方面都更适用。在索引任务中,多模态分析优于独立的图像和文本分析。该实验表明,在混合文本/图像检索系统框架中,可以有效地分析描述图像的文本。