• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TwinsReID:基于 Twins Transformer 的多层次特征的行人重识别。

TwinsReID: Person re-identification based on twins transformer's multi-level features.

机构信息

Zhuoyue Honors College, Hangzhou Dianzi University, Hangzhou, China.

College of Automation, Hangzhou Dianzi University, Hangzhou, China.

出版信息

Math Biosci Eng. 2023 Jan;20(2):2110-2130. doi: 10.3934/mbe.2023098. Epub 2022 Nov 14.

DOI:10.3934/mbe.2023098
PMID:36899525
Abstract

In the traditional person re-identification model, the CNN network is usually used for feature extraction. When converting the feature map into a feature vector, a large number of convolution operations are used to reduce the size of the feature map. In CNN, since the receptive field of the latter layer is obtained by convolution operation on the feature map of the previous layer, the size of this local receptive field is limited, and the computational cost is large. For these problems, combined with the self-attention characteristics of Transformer, an end-to-end person re-identification model (twinsReID) is designed that integrates feature information between levels in this article. For Transformer, the output of each layer is the correlation between its previous layer and other elements. This operation is equivalent to the global receptive field because each element needs to calculate the correlation with other elements, and the calculation is simple, so its cost is small. From these perspectives, Transformer has certain advantages over CNN's convolution operation. This paper uses Twins-SVT Transformer to replace the CNN network, combines the features extracted from the two different stages and divides them into two branches. First, convolve the feature map to obtain a fine-grained feature map, perform global adaptive average pooling on the second branch to obtain the feature vector. Then divide the feature map level into two sections, perform global adaptive average pooling on each. These three feature vectors are obtained and sent to the Triplet Loss respectively. After sending the feature vectors to the fully connected layer, the output is input to the Cross-Entropy Loss and Center-Loss. The model is verified On the Market-1501 dataset in the experiments. The mAP/rank1 index reaches 85.4%/93.7%, and reaches 93.6%/94.9% after reranking. The statistics of the parameters show that the parameters of the model are less than those of the traditional CNN model.

摘要

在传统的人像再识别模型中,通常使用 CNN 网络进行特征提取。在将特征图转换为特征向量时,会使用大量卷积操作来减小特征图的大小。在 CNN 中,由于后一层的感受野是通过在前一层特征图上进行卷积操作获得的,因此这个局部感受野的大小是有限的,计算成本也很大。针对这些问题,结合 Transformer 的自注意力特点,设计了一个端到端的人像再识别模型(twinsReID),本文在这个模型中整合了各层之间的特征信息。对于 Transformer,每一层的输出都是其前一层与其他元素之间的相关性。这种操作相当于全局感受野,因为每个元素都需要与其他元素计算相关性,而且计算简单,因此其成本较小。从这些方面来看,Transformer 相对于 CNN 的卷积操作具有一定的优势。本文使用 Twins-SVT Transformer 替换了 CNN 网络,将从两个不同阶段提取的特征结合起来并分为两个分支。首先,对特征图进行卷积以获得细粒度的特征图,然后在第二分支上进行全局自适应平均池化以获得特征向量。然后将特征图的级别分为两部分,对每一部分进行全局自适应平均池化。得到这三个特征向量,分别将它们发送到三元组损失中。将特征向量发送到全连接层后,输出输入到交叉熵损失和中心损失中。在实验中,该模型在 Market-1501 数据集上进行了验证。mAP/rank1 指标达到 85.4%/93.7%,重新排序后达到 93.6%/94.9%。参数统计显示,模型的参数少于传统的 CNN 模型。

相似文献

1
TwinsReID: Person re-identification based on twins transformer's multi-level features.TwinsReID:基于 Twins Transformer 的多层次特征的行人重识别。
Math Biosci Eng. 2023 Jan;20(2):2110-2130. doi: 10.3934/mbe.2023098. Epub 2022 Nov 14.
2
Palmprint recognition based on gating mechanism and adaptive feature fusion.基于门控机制和自适应特征融合的掌纹识别
Front Neurorobot. 2023 May 26;17:1203962. doi: 10.3389/fnbot.2023.1203962. eCollection 2023.
3
A reliable and low-cost deep learning model integrating convolutional neural network and transformer structure for fine-grained classification of chicken Eimeria species.一种可靠且低成本的深度学习模型,集成卷积神经网络和变压器结构,用于鸡艾美耳球虫种的细粒度分类。
Poult Sci. 2023 Mar;102(3):102459. doi: 10.1016/j.psj.2022.102459. Epub 2022 Dec 30.
4
Towards more efficient ophthalmic disease classification and lesion location via convolution transformer.通过卷积变换器实现更高效的眼科疾病分类和病变定位
Comput Methods Programs Biomed. 2022 Jun;220:106832. doi: 10.1016/j.cmpb.2022.106832. Epub 2022 Apr 27.
5
Heterogeneous feature-aware Transformer-CNN coupling network for person re-identification.用于行人重识别的异构特征感知Transformer-CNN耦合网络
PeerJ Comput Sci. 2022 Sep 27;8:e1098. doi: 10.7717/peerj-cs.1098. eCollection 2022.
6
DBMF: Dual Branch Multiscale Feature Fusion Network for polyp segmentation.DBMF:用于息肉分割的双分支多尺度特征融合网络。
Comput Biol Med. 2022 Dec;151(Pt A):106304. doi: 10.1016/j.compbiomed.2022.106304. Epub 2022 Nov 9.
7
Dynamic Weighting Network for Person Re-Identification.用于行人重识别的动态加权网络
Sensors (Basel). 2023 Jun 14;23(12):5579. doi: 10.3390/s23125579.
8
DRFnet: Dynamic receptive field network for object detection and image recognition.DRFnet:用于目标检测和图像识别的动态感受野网络。
Front Neurorobot. 2023 Jan 10;16:1100697. doi: 10.3389/fnbot.2022.1100697. eCollection 2022.
9
A novel hybrid transformer-CNN architecture for environmental microorganism classification.一种用于环境微生物分类的新型混合变压器-CNN 架构。
PLoS One. 2022 Nov 11;17(11):e0277557. doi: 10.1371/journal.pone.0277557. eCollection 2022.
10
Detecting COVID-19 patients via MLES-Net deep learning models from X-Ray images.基于 X 光图像的 MLES-Net 深度学习模型对 COVID-19 患者的检测。
BMC Med Imaging. 2022 Jul 30;22(1):135. doi: 10.1186/s12880-022-00861-y.

引用本文的文献

1
A Multi-Attention Approach for Person Re-Identification Using Deep Learning.基于深度学习的多注意力机制行人再识别方法。
Sensors (Basel). 2023 Apr 2;23(7):3678. doi: 10.3390/s23073678.