• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用对比多模态图像表示的跨模态子图像检索。

Cross-modality sub-image retrieval using contrastive multimodal image representations.

作者信息

Breznik Eva, Wetzer Elisabeth, Lindblad Joakim, Sladoje Nataša

机构信息

Department of Information Technology, Uppsala University, 751 05, Uppsala, Sweden.

Department of Biomedical Engineering and Health Systems, Royal Institute of Technology, 141 52, Stockholm, Sweden.

出版信息

Sci Rep. 2024 Aug 13;14(1):18798. doi: 10.1038/s41598-024-68800-1.

DOI:10.1038/s41598-024-68800-1
PMID:39138271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11322435/
Abstract

In tissue characterization and cancer diagnostics, multimodal imaging has emerged as a powerful technique. Thanks to computational advances, large datasets can be exploited to discover patterns in pathologies and improve diagnosis. However, this requires efficient and scalable image retrieval methods. Cross-modality image retrieval is particularly challenging, since images of similar (or even the same) content captured by different modalities might share few common structures. We propose a new application-independent content-based image retrieval (CBIR) system for reverse (sub-)image search across modalities, which combines deep learning to generate representations (embedding the different modalities in a common space) with robust feature extraction and bag-of-words models for efficient and reliable retrieval. We illustrate its advantages through a replacement study, exploring a number of feature extractors and learned representations, as well as through comparison to recent (cross-modality) CBIR methods. For the task of (sub-)image retrieval on a (publicly available) dataset of brightfield and second harmonic generation microscopy images, the results show that our approach is superior to all tested alternatives. We discuss the shortcomings of the compared methods and observe the importance of equivariance and invariance properties of the learned representations and feature extractors in the CBIR pipeline. Code is available at: https://github.com/MIDA-group/CrossModal_ImgRetrieval .

摘要

在组织表征和癌症诊断中,多模态成像已成为一种强大的技术。由于计算技术的进步,可以利用大型数据集来发现病理模式并改善诊断。然而,这需要高效且可扩展的图像检索方法。跨模态图像检索尤其具有挑战性,因为不同模态捕获的相似(甚至相同)内容的图像可能几乎没有共同结构。我们提出了一种新的与应用无关的基于内容的图像检索(CBIR)系统,用于跨模态的反向(子)图像搜索,该系统将深度学习用于生成表示(将不同模态嵌入到一个公共空间中)与强大的特征提取和词袋模型相结合,以实现高效可靠的检索。我们通过一项替换研究展示了它的优势,探索了多种特征提取器和学习到的表示,还与最近的(跨模态)CBIR方法进行了比较。对于在明场和二次谐波产生显微镜图像的(公开可用)数据集上进行(子)图像检索的任务,结果表明我们的方法优于所有测试的替代方法。我们讨论了比较方法的缺点,并观察了在CBIR管道中学习到的表示和特征提取器的等变性和不变性属性的重要性。代码可在以下网址获取:https://github.com/MIDA-group/CrossModal_ImgRetrieval 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bab5/11322435/73f879638815/41598_2024_68800_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bab5/11322435/b165d1de7df1/41598_2024_68800_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bab5/11322435/1b7b819db684/41598_2024_68800_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bab5/11322435/38326922a06a/41598_2024_68800_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bab5/11322435/73f879638815/41598_2024_68800_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bab5/11322435/b165d1de7df1/41598_2024_68800_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bab5/11322435/1b7b819db684/41598_2024_68800_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bab5/11322435/38326922a06a/41598_2024_68800_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bab5/11322435/73f879638815/41598_2024_68800_Fig4_HTML.jpg

相似文献

1
Cross-modality sub-image retrieval using contrastive multimodal image representations.使用对比多模态图像表示的跨模态子图像检索。
Sci Rep. 2024 Aug 13;14(1):18798. doi: 10.1038/s41598-024-68800-1.
2
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training.高效的基于令牌的图像-文本检索与一致的多模态对比训练。
IEEE Trans Image Process. 2023;32:3622-3633. doi: 10.1109/TIP.2023.3286710. Epub 2023 Jul 3.
3
Scalable Database Indexing and Fast Image Retrieval Based on Deep Learning and Hierarchically Nested Structure Applied to Remote Sensing and Plant Biology.基于深度学习和层次嵌套结构的可扩展数据库索引与快速图像检索及其在遥感和植物生物学中的应用
J Imaging. 2019 Mar 1;5(3):33. doi: 10.3390/jimaging5030033.
4
Medical Image Retrieval: A Multimodal Approach.医学图像检索:一种多模态方法。
Cancer Inform. 2015 Jul 22;13(Suppl 3):125-36. doi: 10.4137/CIN.S14053. eCollection 2014.
5
Learning image representations for content-based image retrieval of radiotherapy treatment plans.学习用于放射治疗计划基于内容的图像检索的图像表示。
Phys Med Biol. 2023 May 3;68(9). doi: 10.1088/1361-6560/accdb0.
6
Bidirectional feature matching based on deep pairwise contrastive learning for multiparametric MRI image synthesis.基于深度成对对比学习的双向特征匹配在多参数 MRI 图像合成中的应用。
Phys Med Biol. 2023 Jun 15;68(12). doi: 10.1088/1361-6560/acda78.
7
A Novel Trademark Image Retrieval System Based on Multi-Feature Extraction and Deep Networks.一种基于多特征提取和深度网络的新型商标图像检索系统。
J Imaging. 2022 Sep 2;8(9):238. doi: 10.3390/jimaging8090238.
8
Content Based Image Retrieval by Using Color Descriptor and Discrete Wavelet Transform.基于颜色描述符和离散小波变换的图像检索。
J Med Syst. 2018 Jan 25;42(3):44. doi: 10.1007/s10916-017-0880-7.
9
Is image-to-image translation the panacea for multimodal image registration? A comparative study.图像到图像的翻译是否是多模态图像配准的万能药?一项对比研究。
PLoS One. 2022 Nov 28;17(11):e0276196. doi: 10.1371/journal.pone.0276196. eCollection 2022.
10
Saliency-Enhanced Content-Based Image Retrieval for Diagnosis Support in Dermatology Consultation: Reader Study.用于皮肤科会诊诊断支持的显著增强型基于内容的图像检索:读者研究
JMIR Dermatol. 2023 Aug 24;6:e42129. doi: 10.2196/42129.

引用本文的文献

1
Applications of Artificial Intelligence, Deep Learning, and Machine Learning to Support the Analysis of Microscopic Images of Cells and Tissues.人工智能、深度学习和机器学习在支持细胞与组织微观图像分析中的应用。
J Imaging. 2025 Feb 15;11(2):59. doi: 10.3390/jimaging11020059.
2
Multi-modal medical image classification using deep residual network and genetic algorithm.基于深度残差网络和遗传算法的多模态医学图像分类。
PLoS One. 2023 Jun 29;18(6):e0287786. doi: 10.1371/journal.pone.0287786. eCollection 2023.

本文引用的文献

1
Influence of hematoxylin and eosin staining on the quantitative analysis of second harmonic generation imaging of fixed tissue sections.苏木精和伊红染色对固定组织切片二次谐波产生成像定量分析的影响
Biomed Opt Express. 2021 Aug 25;12(9):5829-5843. doi: 10.1364/BOE.428701. eCollection 2021 Sep 1.
2
Deep triplet hashing network for case-based medical image retrieval.基于实例的医学图像检索的深度三重哈希网络。
Med Image Anal. 2021 Apr;69:101981. doi: 10.1016/j.media.2021.101981. Epub 2021 Feb 3.
3
Deep Bayesian Hashing With Center Prior for Multi-Modal Neuroimage Retrieval.
基于中心先验的深度贝叶斯哈希的多模态神经影像检索
IEEE Trans Med Imaging. 2021 Feb;40(2):503-513. doi: 10.1109/TMI.2020.3030752. Epub 2021 Feb 2.
4
Digital pathology and computational image analysis in nephropathology.数字病理学和肾脏病学中的计算图像分析。
Nat Rev Nephrol. 2020 Nov;16(11):669-685. doi: 10.1038/s41581-020-0321-6. Epub 2020 Aug 26.
5
Interactive thyroid whole slide image diagnostic system using deep representation.基于深度表示的交互式甲状腺全切片图像诊断系统
Comput Methods Programs Biomed. 2020 Oct;195:105630. doi: 10.1016/j.cmpb.2020.105630. Epub 2020 Jun 27.
6
2DKD: a toolkit for content-based local image search.2DKD:一种基于内容的局部图像搜索工具包。
Source Code Biol Med. 2020 Feb 10;15:1. doi: 10.1186/s13029-020-0077-1. eCollection 2020.
7
Deep Learning-Based Retrieval System for Gigapixel Histopathology Cases and the Open Access Literature.基于深度学习的千兆像素组织病理学病例及开放获取文献检索系统
J Pathol Inform. 2019 Jul 1;10:19. doi: 10.4103/jpi.jpi_88_18. eCollection 2019.
8
Similar image search for histopathology: SMILY.用于组织病理学的相似图像搜索:SMILY。
NPJ Digit Med. 2019 Jun 21;2:56. doi: 10.1038/s41746-019-0131-z. eCollection 2019.
9
Second-harmonic generation imaging of cancer.癌症的二次谐波产生成像
Methods Cell Biol. 2014;123:531-46. doi: 10.1016/B978-0-12-420138-5.00028-8.
10
Comparison of image patches using local moment invariants.基于局部矩不变量的图像块比较。
IEEE Trans Image Process. 2014 May;23(5):2369-79. doi: 10.1109/TIP.2014.2315923.