• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过互连卷积神经网络实现端到端面部解析

End-to-end face parsing via interlinked convolutional neural networks.

作者信息

Yin Zi, Yiu Valentin, Hu Xiaolin, Tang Liang

机构信息

School of Technology, Beijing Forestry University, Beijing, 100083 China.

Department of Computer Science and Technology,State Key Laboratory of Intelligent Technology and Systems, Institute for Artificial Intelligence,Beijing National Research Center for Information Science and Technology,THBI, Tsinghua University, Beijing, 100084 China.

出版信息

Cogn Neurodyn. 2021 Feb;15(1):169-179. doi: 10.1007/s11571-020-09615-4. Epub 2020 Jul 13.

DOI:10.1007/s11571-020-09615-4
PMID:33786087
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7947053/
Abstract

Face parsing is an important computer vision task that requires accurate pixel segmentation of facial parts (such as eyes, nose, mouth, etc.), providing a basis for further face analysis, modification, and other applications. Interlinked Convolutional Neural Networks (iCNN) was proved to be an effective two-stage model for face parsing. However, the original iCNN was trained separately in two stages, limiting its performance. To solve this problem, we introduce a simple, end-to-end face parsing framework: STN-aided iCNN(STN-iCNN), which extends the iCNN by adding a Spatial Transformer Network (STN) between the two isolated stages. The STN-iCNN uses the STN to provide a trainable connection to the original two-stage iCNN pipeline, making end-to-end joint training possible. Moreover, as a by-product, STN also provides more precise cropped parts than the original cropper. Due to these two advantages, our approach significantly improves the accuracy of the original model. Our model achieved competitive performance on the Helen Dataset, the standard face parsing dataset. It also achieved superior performance on CelebAMask-HQ dataset, proving its good generalization. Our code has been released at https://github.com/aod321/STN-iCNN.

摘要

面部解析是一项重要的计算机视觉任务,它需要对面部各个部分(如眼睛、鼻子、嘴巴等)进行精确的像素分割,为进一步的面部分析、修改及其他应用提供基础。互联卷积神经网络(iCNN)被证明是一种用于面部解析的有效的两阶段模型。然而,原始的iCNN是分两个阶段单独训练的,这限制了其性能。为了解决这个问题,我们引入了一个简单的端到端面部解析框架:基于空间变换器网络辅助的iCNN(STN-iCNN),它通过在两个独立阶段之间添加一个空间变换器网络(STN)来扩展iCNN。STN-iCNN使用STN为原始的两阶段iCNN管道提供一个可训练的连接,从而实现端到端联合训练。此外,作为一个副产品,STN还能提供比原始裁剪器更精确的裁剪部分。由于这两个优点,我们的方法显著提高了原始模型的准确性。我们的模型在标准面部解析数据集海伦数据集上取得了有竞争力的性能。它在CelebAMask-HQ数据集上也取得了优异的性能,证明了其良好的泛化能力。我们的代码已在https://github.com/aod321/STN-iCNN上发布。

相似文献

1
End-to-end face parsing via interlinked convolutional neural networks.通过互连卷积神经网络实现端到端面部解析
Cogn Neurodyn. 2021 Feb;15(1):169-179. doi: 10.1007/s11571-020-09615-4. Epub 2020 Jul 13.
2
A Bayesian-optimized design for an interpretable convolutional neural network to decode and analyze the P300 response in autism.贝叶斯优化设计用于可解释卷积神经网络的自闭症 P300 反应解码和分析。
J Neural Eng. 2022 Jul 14;19(4). doi: 10.1088/1741-2552/ac7908.
3
CLRNet: Component-Level Refinement Network for Deep Face Parsing.CLRNet:用于深度人脸解析的组件级细化网络。
IEEE Trans Neural Netw Learn Syst. 2023 Mar;34(3):1439-1453. doi: 10.1109/TNNLS.2021.3105386. Epub 2023 Feb 28.
4
A Multi-Task Framework for Facial Attributes Classification through End-to-End Face Parsing and Deep Convolutional Neural Networks.基于端到端人脸解析和深度卷积神经网络的人脸属性分类多任务框架。
Sensors (Basel). 2020 Jan 7;20(2):328. doi: 10.3390/s20020328.
5
Multiscale segmentation- and error-guided iterative convolutional neural network for cerebral neuron segmentation in microscopic images.多尺度分割和误差引导迭代卷积神经网络在微观图像中的脑神经元分割。
Microsc Res Tech. 2022 Nov;85(11):3541-3552. doi: 10.1002/jemt.24206. Epub 2022 Jul 20.
6
Semi-supervised body parsing and pose estimation for enhancing infant general movement assessment.用于增强婴儿一般运动评估的半监督身体解析与姿势估计
Med Image Anal. 2023 Jan;83:102654. doi: 10.1016/j.media.2022.102654. Epub 2022 Oct 14.
7
Face-mask-aware Facial Expression Recognition based on Face Parsing and Vision Transformer.基于面部解析和视觉Transformer的口罩感知面部表情识别
Pattern Recognit Lett. 2022 Dec;164:173-182. doi: 10.1016/j.patrec.2022.11.004. Epub 2022 Nov 9.
8
Global Aggregation Then Local Distribution for Scene Parsing.用于场景解析的全局聚合然后局部分布
IEEE Trans Image Process. 2021;30:6829-6842. doi: 10.1109/TIP.2021.3099366.
9
Multiobjective Reinforcement Learning-Based Neural Architecture Search for Efficient Portrait Parsing.基于多目标强化学习的高效人像解析神经架构搜索
IEEE Trans Cybern. 2023 Feb;53(2):1158-1169. doi: 10.1109/TCYB.2021.3104866. Epub 2023 Jan 13.
10
Automatic localization of target point for subthalamic nucleus-deep brain stimulation via hierarchical attention-UNet based MRI segmentation.基于分层注意力-UNet的MRI分割实现丘脑底核深部脑刺激靶点的自动定位
Med Phys. 2023 Jan;50(1):50-60. doi: 10.1002/mp.15956. Epub 2022 Sep 9.

引用本文的文献

1
Affective EEG-based cross-session person identification using hierarchical graph embedding.基于情感脑电图的跨会话人员识别:使用层次图嵌入方法
Cogn Neurodyn. 2024 Oct;18(5):2897-2908. doi: 10.1007/s11571-024-10132-x. Epub 2024 May 29.
2
DRF-DRC: dynamic receptive field and dense residual connections for model compression.DRF-DRC:用于模型压缩的动态感受野和密集残差连接
Cogn Neurodyn. 2023 Dec;17(6):1561-1573. doi: 10.1007/s11571-022-09913-z. Epub 2022 Nov 14.
3
Image denoising and segmentation model construction based on IWOA-PCNN.基于改进鲸鱼优化算法-脉冲耦合神经网络的图像去噪与分割模型构建
Sci Rep. 2023 Nov 13;13(1):19848. doi: 10.1038/s41598-023-47089-6.
4
Facial expression recognition on partially occluded faces using component based ensemble stacked CNN.基于组件的集成堆叠卷积神经网络对部分遮挡面部的表情识别
Cogn Neurodyn. 2023 Aug;17(4):985-1008. doi: 10.1007/s11571-022-09879-y. Epub 2022 Sep 17.
5
Real-time echocardiography image analysis and quantification of cardiac indices.实时超声心动图图像分析和心功能指数的定量评估。
Med Image Anal. 2022 Aug;80:102438. doi: 10.1016/j.media.2022.102438. Epub 2022 Jun 9.
6
Trilateral Attention Network for Real-Time Cardiac Region Segmentation.用于实时心脏区域分割的三边注意力网络
IEEE Access. 2021;9:118205-118214. doi: 10.1109/access.2021.3107303. Epub 2021 Aug 24.
7
High-Resolution U-Net: Preserving Image Details for Cultivated Land Extraction.高分辨率 U-Net:保留图像细节进行耕地提取。
Sensors (Basel). 2020 Jul 22;20(15):4064. doi: 10.3390/s20154064.

本文引用的文献

1
Detecting prostate cancer using deep learning convolution neural network with transfer learning approach.使用具有迁移学习方法的深度学习卷积神经网络检测前列腺癌。
Cogn Neurodyn. 2020 Aug;14(4):523-533. doi: 10.1007/s11571-020-09587-5. Epub 2020 Apr 11.
2
EEG classification of driver mental states by deep learning.基于深度学习的驾驶员心理状态脑电图分类
Cogn Neurodyn. 2018 Dec;12(6):597-606. doi: 10.1007/s11571-018-9496-y. Epub 2018 Jul 18.
3
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
4
Banknote recognition: investigating processing and cognition framework using competitive neural network.纸币识别:使用竞争神经网络研究处理与认知框架
Cogn Neurodyn. 2017 Feb;11(1):67-79. doi: 10.1007/s11571-016-9404-2. Epub 2016 Aug 22.
5
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
6
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
7
Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.
8
Receptive fields, binocular interaction and functional architecture in the cat's visual cortex.猫视觉皮层中的感受野、双眼相互作用及功能结构
J Physiol. 1962 Jan;160(1):106-54. doi: 10.1113/jphysiol.1962.sp006837.