• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于视觉手性线索的镜像检测

Mirror Detection With the Visual Chirality Cue.

作者信息

Tan Xin, Lin Jiaying, Xu Ke, Chen Pan, Ma Lizhuang, Lau Rynson W H

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3492-3504. doi: 10.1109/TPAMI.2022.3181030. Epub 2023 Feb 3.

DOI:10.1109/TPAMI.2022.3181030
PMID:35687623
Abstract

Mirror detection is challenging because the visual appearances of mirrors change depending on those of their surroundings. As existing mirror detection methods are mainly based on extracting contextual contrast and relational similarity between mirror and non-mirror regions, they may fail to identify a mirror region if these assumptions are violated. Inspired by a recent study of applying a CNN to help distinguish whether an image is flipped or not based on the visual chirality property, in this paper, we rethink this image-level visual chirality property and reformulate it as a learnable pixel level cue for mirror detection. Specifically, we first propose a novel flipping-convolution-flipping (FCF) transformation to model visual chirality as learnable commutative residual. We then propose a novel visual chirality embedding (VCE) module to exploit this commutative residual in multi-scale feature maps, to embed the visual chirality features into our mirror detection model. Besides, we also propose a visual chirality-guided edge detection (CED) module to integrate the visual chirality features with contextual features for detection refinement. Extensive experiments show that the proposed method outperforms state-of-the-art methods on three benchmark datasets.

摘要

镜像检测具有挑战性,因为镜子的视觉外观会根据其周围环境而变化。由于现有的镜像检测方法主要基于提取镜像区域和非镜像区域之间的上下文对比度和关系相似度,如果这些假设不成立,它们可能无法识别镜像区域。受最近一项关于应用卷积神经网络(CNN)基于视觉手性属性帮助区分图像是否翻转的研究启发,在本文中,我们重新思考这种图像级别的视觉手性属性,并将其重新表述为用于镜像检测的可学习像素级线索。具体而言,我们首先提出一种新颖的翻转-卷积-翻转(FCF)变换,将视觉手性建模为可学习的交换残差。然后,我们提出一种新颖的视觉手性嵌入(VCE)模块,以利用多尺度特征图中的这种交换残差,将视觉手性特征嵌入到我们的镜像检测模型中。此外,我们还提出了一种视觉手性引导的边缘检测(CED)模块,将视觉手性特征与上下文特征集成以进行检测细化。大量实验表明,所提出的方法在三个基准数据集上优于现有方法。

相似文献

1
Mirror Detection With the Visual Chirality Cue.基于视觉手性线索的镜像检测
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3492-3504. doi: 10.1109/TPAMI.2022.3181030. Epub 2023 Feb 3.
2
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
3
CSFwinformer: Cross-Space-Frequency Window Transformer for Mirror Detection.
IEEE Trans Image Process. 2024;33:1853-1867. doi: 10.1109/TIP.2024.3372468. Epub 2024 Mar 12.
4
Conditional Feature Embedding by Visual Clue Correspondence Graph for Person Re-Identification.基于视觉线索对应图的条件特征嵌入用于行人重识别
IEEE Trans Image Process. 2022;31:6188-6199. doi: 10.1109/TIP.2022.3206617. Epub 2022 Sep 28.
5
HARNU-Net: Hierarchical Attention Residual Nested U-Net for Change Detection in Remote Sensing Images.HARNU-Net:用于遥感图像变化检测的分层注意力残差嵌套 U-Net
Sensors (Basel). 2022 Jun 19;22(12):4626. doi: 10.3390/s22124626.
6
MDAN: Mirror Difference Aware Network for Brain Stroke Lesion Segmentation.MDAN:用于脑卒中风损分割的镜像差异感知网络。
IEEE J Biomed Health Inform. 2022 Apr;26(4):1628-1639. doi: 10.1109/JBHI.2021.3113460. Epub 2022 Apr 14.
7
Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation.语境翻译嵌入的视觉关系检测和场景图生成。
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):3820-3832. doi: 10.1109/TPAMI.2020.2992222. Epub 2021 Oct 1.
8
Referring Segmentation in Images and Videos With Cross-Modal Self-Attention Network.基于跨模态自注意力网络的图像和视频指代分割
IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3719-3732. doi: 10.1109/TPAMI.2021.3054384. Epub 2022 Jun 3.
9
Feature Aggregation and Propagation Network for Camouflaged Object Detection.用于伪装目标检测的特征聚合与传播网络
IEEE Trans Image Process. 2022;31:7036-7047. doi: 10.1109/TIP.2022.3217695. Epub 2022 Nov 14.
10
Guided multi-scale refinement network for camouflaged object detection.用于伪装目标检测的引导式多尺度细化网络。
Multimed Tools Appl. 2023;82(4):5785-5801. doi: 10.1007/s11042-022-13274-4. Epub 2022 Jul 30.

引用本文的文献

1
Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach.使用K近邻插补合成少数过采样技术特征和多模型集成学习方法改善宫颈癌预测
Cancers (Basel). 2023 Sep 4;15(17):4412. doi: 10.3390/cancers15174412.
2
Enhancing Skin Cancer Detection and Classification in Dermoscopic Images through Concatenated MobileNetV2 and Xception Models.通过串联MobileNetV2和Xception模型增强皮肤镜图像中的皮肤癌检测与分类
Bioengineering (Basel). 2023 Aug 19;10(8):979. doi: 10.3390/bioengineering10080979.
3
The analysis and optimization of thermal sensation of train drivers under occupational thermal exposure.
职业热暴露下司机热感觉的分析与优化。
Front Public Health. 2023 Jun 9;11:1164817. doi: 10.3389/fpubh.2023.1164817. eCollection 2023.
4
The Application of Electroencephalogram in Driving Safety: Current Status and Future Prospects.脑电图在驾驶安全中的应用:现状与未来展望。
Front Psychol. 2022 Jul 22;13:919695. doi: 10.3389/fpsyg.2022.919695. eCollection 2022.