Kurtz Camille, Beaulieu Christopher F, Napel Sandy, Rubin Daniel L
Department of Radiology, School of Medicine, Stanford University, USA; LIPADE (EA 2517), University Paris Descartes, France.
Department of Radiology, School of Medicine, Stanford University, USA.
J Biomed Inform. 2014 Jun;49:227-44. doi: 10.1016/j.jbi.2014.02.018. Epub 2014 Mar 12.
Computer-assisted image retrieval applications could assist radiologist interpretations by identifying similar images in large archives as a means to providing decision support. However, the semantic gap between low-level image features and their high level semantics may impair the system performances. Indeed, it can be challenging to comprehensively characterize the images using low-level imaging features to fully capture the visual appearance of diseases on images, and recently the use of semantic terms has been advocated to provide semantic descriptions of the visual contents of images. However, most of the existing image retrieval strategies do not consider the intrinsic properties of these terms during the comparison of the images beyond treating them as simple binary (presence/absence) features. We propose a new framework that includes semantic features in images and that enables retrieval of similar images in large databases based on their semantic relations. It is based on two main steps: (1) annotation of the images with semantic terms extracted from an ontology, and (2) evaluation of the similarity of image pairs by computing the similarity between the terms using the Hierarchical Semantic-Based Distance (HSBD) coupled to an ontological measure. The combination of these two steps provides a means of capturing the semantic correlations among the terms used to characterize the images that can be considered as a potential solution to deal with the semantic gap problem. We validate this approach in the context of the retrieval and the classification of 2D regions of interest (ROIs) extracted from computed tomographic (CT) images of the liver. Under this framework, retrieval accuracy of more than 0.96 was obtained on a 30-images dataset using the Normalized Discounted Cumulative Gain (NDCG) index that is a standard technique used to measure the effectiveness of information retrieval algorithms when a separate reference standard is available. Classification results of more than 95% were obtained on a 77-images dataset. For comparison purpose, the use of the Earth Mover's Distance (EMD), which is an alternative distance metric that considers all the existing relations among the terms, led to results retrieval accuracy of 0.95 and classification results of 93% with a higher computational cost. The results provided by the presented framework are competitive with the state-of-the-art and emphasize the usefulness of the proposed methodology for radiology image retrieval and classification.
计算机辅助图像检索应用程序可以通过在大型档案中识别相似图像来辅助放射科医生进行解读,以此提供决策支持。然而,低级图像特征与其高级语义之间的语义鸿沟可能会损害系统性能。实际上,使用低级成像特征全面表征图像以充分捕捉图像上疾病的视觉外观可能具有挑战性,最近有人主张使用语义术语来提供图像视觉内容的语义描述。然而,大多数现有的图像检索策略在图像比较过程中并未考虑这些术语的内在属性,只是将它们视为简单的二元(存在/不存在)特征。我们提出了一个新框架,该框架将语义特征纳入图像,并能够基于语义关系在大型数据库中检索相似图像。它基于两个主要步骤:(1)使用从本体中提取的语义术语对图像进行标注,以及(2)通过使用基于层次语义的距离(HSBD)结合本体度量来计算术语之间的相似度,从而评估图像对的相似度。这两个步骤的结合提供了一种捕捉用于表征图像的术语之间语义相关性的方法,可被视为解决语义鸿沟问题的潜在方案。我们在从肝脏计算机断层扫描(CT)图像中提取的二维感兴趣区域(ROI)的检索和分类背景下验证了这种方法。在此框架下,使用归一化折损累计增益(NDCG)指数在一个30幅图像的数据集上获得了超过0.96的检索准确率,NDCG指数是在有单独参考标准时用于衡量信息检索算法有效性的标准技术。在一个77幅图像的数据集上获得了超过95%的分类结果。为作比较,使用推土机距离(EMD),它是一种考虑术语之间所有现有关系的替代距离度量,导致检索准确率为0.95,分类结果为93%,但计算成本更高。所提出框架提供的结果与现有技术具有竞争力,并强调了所提出方法在放射学图像检索和分类中的实用性。