• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

跨领域视觉匹配通过广义相似性度量和特征学习。

Cross-Domain Visual Matching via Generalized Similarity Measure and Feature Learning.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1089-1102. doi: 10.1109/TPAMI.2016.2567386. Epub 2016 May 12.

DOI:10.1109/TPAMI.2016.2567386
PMID:27187945
Abstract

Cross-domain visual data matching is one of the fundamental problems in many real-world vision tasks, e.g., matching persons across ID photos and surveillance videos. Conventional approaches to this problem usually involves two steps: i) projecting samples from different domains into a common space, and ii) computing (dis-)similarity in this space based on a certain distance. In this paper, we present a novel pairwise similarity measure that advances existing models by i) expanding traditional linear projections into affine transformations and ii) fusing affine Mahalanobis distance and Cosine similarity by a data-driven combination. Moreover, we unify our similarity measure with feature representation learning via deep convolutional neural networks. Specifically, we incorporate the similarity measure matrix into the deep architecture, enabling an end-to-end way of model optimization. We extensively evaluate our generalized similarity model in several challenging cross-domain matching tasks: person re-identification under different views and face verification over different modalities (i.e., faces from still images and videos, older and younger faces, and sketch and photo portraits). The experimental results demonstrate superior performance of our model over other state-of-the-art methods.

摘要

跨领域视觉数据匹配是许多现实世界视觉任务中的基本问题之一,例如在 ID 照片和监控视频中匹配人员。该问题的传统方法通常涉及两个步骤:i)将不同领域的样本投影到公共空间中,ii)基于某种距离在该空间中计算(不)相似性。在本文中,我们提出了一种新颖的成对相似性度量标准,通过 i)将传统线性投影扩展为仿射变换,ii)通过数据驱动的组合融合仿射 Mahalanobis 距离和余弦相似度,从而改进了现有模型。此外,我们通过深度卷积神经网络将相似性度量标准与特征表示学习统一起来。具体来说,我们将相似性度量矩阵纳入深度架构中,实现了模型优化的端到端方式。我们在几个具有挑战性的跨领域匹配任务中广泛评估了我们的广义相似性模型:不同视角下的人员重新识别和不同模态下的人脸验证(即来自静态图像和视频的人脸、年长和年轻的人脸、草图和照片肖像)。实验结果表明,我们的模型在其他最先进的方法上表现出优越的性能。

相似文献

1
Cross-Domain Visual Matching via Generalized Similarity Measure and Feature Learning.跨领域视觉匹配通过广义相似性度量和特征学习。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1089-1102. doi: 10.1109/TPAMI.2016.2567386. Epub 2016 May 12.
2
Person Re-Identification by Camera Correlation Aware Feature Augmentation.基于相机关联感知特征增强的行人再识别。
IEEE Trans Pattern Anal Mach Intell. 2018 Feb;40(2):392-408. doi: 10.1109/TPAMI.2017.2666805. Epub 2017 Feb 9.
3
Person Re-identification by Multi-hypergraph Fusion.基于多超图融合的行人再识别。
IEEE Trans Neural Netw Learn Syst. 2017 Nov;28(11):2763-2774. doi: 10.1109/TNNLS.2016.2602082.
4
A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。
Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.
5
Video-Based Person Re-Identification by an End-To-End Learning Architecture with Hybrid Deep Appearance-Temporal Feature.基于端到端学习架构的混合深度表观-时间特征的视频人物再识别
Sensors (Basel). 2018 Oct 29;18(11):3669. doi: 10.3390/s18113669.
6
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.
7
Vehicle Re-Identification by Deep Hidden Multi-View Inference.基于深度隐式多视图推理的车辆再识别。
IEEE Trans Image Process. 2018 Jul;27(7):3275-3287. doi: 10.1109/TIP.2018.2819820.
8
Unsupervised Person Re-identification via Cross-camera Similarity Exploration.通过跨摄像头相似性探索实现无监督行人重识别
IEEE Trans Image Process. 2020 Apr 1. doi: 10.1109/TIP.2020.2982826.
9
Person re-identification over camera networks using multi-task distance metric learning.基于多任务距离度量学习的摄像机网络中的人像再识别。
IEEE Trans Image Process. 2014 Aug;23(8):3656-70. doi: 10.1109/TIP.2014.2331755. Epub 2014 Jun 18.
10
Person Re-Identification by Contour Sketch Under Moderate Clothing Change.中等程度衣物变化下的轮廓草图人物再识别
IEEE Trans Pattern Anal Mach Intell. 2021 Jun;43(6):2029-2046. doi: 10.1109/TPAMI.2019.2960509. Epub 2021 May 11.

引用本文的文献

1
MIC: Breast Cancer Multi-label Diagnostic Framework Based on Multi-modal Fusion Interaction.MIC:基于多模态融合交互的乳腺癌多标签诊断框架。
J Imaging Inform Med. 2025 Jan 6. doi: 10.1007/s10278-024-01361-x.
2
An Audiovisual Correlation Matching Method Based on Fine-Grained Emotion and Feature Fusion.基于细粒度情感和特征融合的视听相关匹配方法。
Sensors (Basel). 2024 Aug 31;24(17):5681. doi: 10.3390/s24175681.
3
Image local structure information learning for fine-grained visual classification.细粒度视觉分类中的图像局部结构信息学习。
Sci Rep. 2022 Nov 10;12(1):19205. doi: 10.1038/s41598-022-23835-0.
4
Relationship Discovery and Hierarchical Embedding for Web Service Quality Prediction.关系发现与层次嵌入的 Web 服务质量预测
Comput Intell Neurosci. 2022 Oct 5;2022:9240843. doi: 10.1155/2022/9240843. eCollection 2022.
5
Anomaly Detection in EEG Signals: A Case Study on Similarity Measure.脑电信号中的异常检测:基于相似度度量的案例研究。
Comput Intell Neurosci. 2020 Jan 10;2020:6925107. doi: 10.1155/2020/6925107. eCollection 2020.