学习内窥镜视频检索的语义和视觉相似性。

Learning semantic and visual similarity for endomicroscopy video retrieval.

机构信息

Mauna Kea Technologies, 75010 Paris, France.

出版信息

IEEE Trans Med Imaging. 2012 Jun;31(6):1276-88. doi: 10.1109/TMI.2012.2188301. Epub 2012 Feb 16.

DOI:10.1109/TMI.2012.2188301

Abstract

Content-based image retrieval (CBIR) is a valuable computer vision technique which is increasingly being applied in the medical community for diagnosis support. However, traditional CBIR systems only deliver visual outputs, i.e., images having a similar appearance to the query, which is not directly interpretable by the physicians. Our objective is to provide a system for endomicroscopy video retrieval which delivers both visual and semantic outputs that are consistent with each other. In a previous study, we developed an adapted bag-of-visual-words method for endomicroscopy retrieval, called "Dense-Sift," that computes a visual signature for each video. In this paper, we present a novel approach to complement visual similarity learning with semantic knowledge extraction, in the field of in vivo endomicroscopy. We first leverage a semantic ground truth based on eight binary concepts, in order to transform these visual signatures into semantic signatures that reflect how much the presence of each semantic concept is expressed by the visual words describing the videos. Using cross-validation, we demonstrate that, in terms of semantic detection, our intuitive Fisher-based method transforming visual-word histograms into semantic estimations outperforms support vector machine (SVM) methods with statistical significance. In a second step, we propose to improve retrieval relevance by learning an adjusted similarity distance from a perceived similarity ground truth. As a result, our distance learning method allows to statistically improve the correlation with the perceived similarity. We also demonstrate that, in terms of perceived similarity, the recall performance of the semantic signatures is close to that of visual signatures and significantly better than those of several state-of-the-art CBIR methods. The semantic signatures are thus able to communicate high-level medical knowledge while being consistent with the low-level visual signatures and much shorter than them. In our resulting retrieval system, we decide to use visual signatures for perceived similarity learning and retrieval, and semantic signatures for the output of an additional information, expressed in the endoscopist own language, which provides a relevant semantic translation of the visual retrieval outputs.

摘要

基于内容的图像检索 (CBIR) 是一种有价值的计算机视觉技术，越来越多地应用于医学领域以支持诊断。然而，传统的 CBIR 系统仅提供视觉输出，即与查询具有相似外观的图像，这对于医生来说是无法直接解释的。我们的目标是提供一种用于内窥镜视频检索的系统，该系统提供相互一致的视觉和语义输出。在之前的研究中，我们开发了一种适用于内窥镜检索的自适应视觉词袋方法，称为“Dense-Sift”，它为每个视频计算一个视觉签名。在本文中，我们提出了一种新颖的方法，通过语义知识提取来补充视觉相似性学习，该方法适用于体内内窥镜领域。我们首先利用基于八个二进制概念的语义真实数据，将这些视觉签名转换为语义签名，以反映描述视频的视觉词对每个语义概念的表达程度。通过交叉验证，我们证明在语义检测方面，我们直观的基于 Fisher 的方法将视觉词直方图转换为语义估计的方法优于具有统计意义的支持向量机 (SVM) 方法。在第二步中，我们提出通过从感知相似性真实数据中学习调整后的相似性距离来提高检索相关性。因此，我们的距离学习方法允许从统计上提高与感知相似性的相关性。我们还证明，在感知相似性方面，语义签名的召回性能接近视觉签名，并且明显优于几种最先进的 CBIR 方法。因此，语义签名能够传达高级医学知识，同时与低级视觉签名保持一致，并且比它们短得多。在我们的检索系统中，我们决定使用视觉签名进行感知相似性学习和检索，使用语义签名作为额外信息的输出，以内窥镜医生自己的语言表达，这提供了视觉检索输出的相关语义翻译。

相似文献

Learning semantic and visual similarity for endomicroscopy video retrieval.

IEEE Trans Med Imaging. 2012 Jun;31(6):1276-88. doi: 10.1109/TMI.2012.2188301. Epub 2012 Feb 16.

A smart atlas for endomicroscopy using automated video retrieval.

Med Image Anal. 2011 Aug;15(4):460-76. doi: 10.1016/j.media.2011.02.003. Epub 2011 Feb 24.

Semisupervised biased maximum margin analysis for interactive image retrieval.

IEEE Trans Image Process. 2012 Apr;21(4):2294-308. doi: 10.1109/TIP.2011.2177846. Epub 2011 Dec 2.

Conjunctive patches subspace learning with side information for collaborative image retrieval.

IEEE Trans Image Process. 2012 Aug;21(8):3707-20. doi: 10.1109/TIP.2012.2195014. Epub 2012 Apr 17.

An image retrieval approach to setup difficulty levels in training systems for endomicroscopy diagnosis.

Med Image Comput Comput Assist Interv. 2010;13(Pt 2):480-7. doi: 10.1007/978-3-642-15745-5_59.

A boosting framework for visuality-preserving distance metric learning and its application to medical image retrieval.

IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):30-44. doi: 10.1109/TPAMI.2008.273.

IEEE Trans Med Imaging. 2004 Oct;23(10):1233-44. doi: 10.1109/TMI.2004.834601.

Signature detection and matching for document image retrieval.

IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2015-31. doi: 10.1109/TPAMI.2008.237.

Medical image retrieval with probabilistic multi-class support vector machine classifiers and adaptive similarity fusion.

Comput Med Imaging Graph. 2008 Mar;32(2):95-108. doi: 10.1016/j.compmedimag.2007.10.001. Epub 2007 Nov 26.

A statistical framework for image category search from a mental picture.

IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1087-101. doi: 10.1109/TPAMI.2008.259.

引用本文的文献

Toward Intraoperative Visual Intelligence: Real-Time Surgical Instrument Segmentation for Enhanced Surgical Monitoring.

Healthcare (Basel). 2024 May 29;12(11):1112. doi: 10.3390/healthcare12111112.

Boosting few-shot confocal endomicroscopy image recognition with feature-level MixSiam.

Biomed Opt Express. 2023 Feb 7;14(3):1054-1070. doi: 10.1364/BOE.478832. eCollection 2023 Mar 1.

An Efficient Content-Based Image Retrieval System for the Diagnosis of Lung Diseases.

J Digit Imaging. 2020 Aug;33(4):971-987. doi: 10.1007/s10278-020-00338-w.

Image computing for fibre-bundle endomicroscopy: A review.

Med Image Anal. 2020 May;62:101620. doi: 10.1016/j.media.2019.101620. Epub 2019 Dec 25.

Context aware decision support in neurosurgical oncology based on an efficient classification of endomicroscopic data.

Int J Comput Assist Radiol Surg. 2018 Aug;13(8):1187-1199. doi: 10.1007/s11548-018-1806-7. Epub 2018 Jun 13.

Computerized Prediction of Radiological Observations Based on Quantitative Feature Analysis: Initial Experience in Liver Lesions.

J Digit Imaging. 2017 Aug;30(4):506-518. doi: 10.1007/s10278-017-9987-0.

Dictionary Pruning with Visual Word Significance for Medical Image Retrieval.

Neurocomputing (Amst). 2016 Feb 12;177:75-88. doi: 10.1016/j.neucom.2015.11.008. Epub 2015 Nov 17.

Biomedical image representation approach using visualness and spatial information in a concept feature space for interactive region-of-interest-based retrieval.

J Med Imaging (Bellingham). 2015 Oct;2(4):046502. doi: 10.1117/1.JMI.2.4.046502. Epub 2015 Dec 30.

Pairwise Latent Semantic Association for Similarity Computation in Medical Imaging.

IEEE Trans Biomed Eng. 2016 May;63(5):1058-1069. doi: 10.1109/TBME.2015.2478028. Epub 2015 Sep 10.

On combining image-based and ontological semantic dissimilarities for medical image retrieval applications.

Med Image Anal. 2014 Oct;18(7):1082-100. doi: 10.1016/j.media.2014.06.009. Epub 2014 Jul 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

学习内窥镜视频检索的语义和视觉相似性。

Learning semantic and visual similarity for endomicroscopy video retrieval.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献