• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于语义分割的区域感知池化的面部图像 BMI 估计。

Estimation of BMI from facial images using semantic segmentation based region-aware pooling.

机构信息

Intelligent Machine Lab, Information Technology University, Pakistan.

Machine Learning and Data Science @ the Home Depot, USA.

出版信息

Comput Biol Med. 2021 Jun;133:104392. doi: 10.1016/j.compbiomed.2021.104392. Epub 2021 Apr 15.

DOI:10.1016/j.compbiomed.2021.104392
PMID:33895458
Abstract

Body-Mass-Index (BMI) conveys important information about one's life such as health and socio-economic conditions. Large-scale automatic estimation of BMIs can help predict several societal behaviors such as health, job opportunities, friendships, and popularity. The recent works have either employed hand-crafted geometrical face features or face-level deep convolutional neural network features for face to BMI prediction. The hand-crafted geometrical face feature lack generalizability and face-level deep features don't have detailed local information. Although useful, these methods missed the detailed local information which is essential for exact BMI prediction. In this paper, we propose to use deep features that are pooled from different face regions (eye, nose, eyebrow, lips, etc.) and demonstrate that this explicit pooling from face regions can significantly boost the performance of BMI prediction. To address the problem of accurate and pixel-level face regions localization, we propose to use face semantic segmentation in our framework. Extensive experiments are performed using different Convolutional Neural Network (CNN) backbones including FaceNet and VGG-face on three publicly available datasets: VisualBMI, Bollywood and VIP attributes. Experimental results demonstrate that, as compared to the recent works, the proposed Reg-GAP gives a percentage improvement of 22.4% on VIP-attribute, 3.3% on VisualBMI, and 63.09% on the Bollywood dataset.

摘要

身体质量指数(BMI)传达了有关个人生活的重要信息,例如健康和社会经济状况。大规模自动估计 BMI 可以帮助预测健康、工作机会、友谊和受欢迎程度等几种社会行为。最近的研究工作要么使用手工制作的几何人脸特征,要么使用人脸级别的深度卷积神经网络特征进行人脸到 BMI 的预测。手工制作的几何人脸特征缺乏泛化能力,而人脸级别的深度特征则没有详细的局部信息。虽然这些方法很有用,但它们忽略了精确 BMI 预测所必需的详细局部信息。在本文中,我们提出使用从不同人脸区域(眼睛、鼻子、眉毛、嘴唇等)提取的深度特征,并证明这种从人脸区域的显式池化可以显著提高 BMI 预测的性能。为了解决准确和像素级人脸区域定位的问题,我们在框架中提出使用人脸语义分割。我们在三个公开可用的数据集上进行了广泛的实验,包括 VisualBMI、Bollywood 和 VIP 属性,使用了不同的卷积神经网络(CNN)骨干网络,包括 FaceNet 和 VGG-face。实验结果表明,与最近的研究工作相比,所提出的 Reg-GAP 在 VIP 属性上的百分比提高了 22.4%,在 VisualBMI 上提高了 3.3%,在 Bollywood 数据集上提高了 63.09%。

相似文献

1
Estimation of BMI from facial images using semantic segmentation based region-aware pooling.基于语义分割的区域感知池化的面部图像 BMI 估计。
Comput Biol Med. 2021 Jun;133:104392. doi: 10.1016/j.compbiomed.2021.104392. Epub 2021 Apr 15.
2
On Symbiosis of Attribute Prediction and Semantic Segmentation.属性预测与语义分割的共生。
IEEE Trans Pattern Anal Mach Intell. 2021 May;43(5):1620-1635. doi: 10.1109/TPAMI.2019.2956039. Epub 2021 Apr 1.
3
Coupled Attribute Learning for Heterogeneous Face Recognition.用于异构人脸识别的耦合属性学习
IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4699-4712. doi: 10.1109/TNNLS.2019.2957285. Epub 2020 Oct 29.
4
A comparison between two semantic deep learning frameworks for the autosomal dominant polycystic kidney disease segmentation based on magnetic resonance images.基于磁共振图像的常染色体显性遗传性多囊肾病分割的两种语义深度学习框架的比较。
BMC Med Inform Decis Mak. 2019 Dec 12;19(Suppl 9):244. doi: 10.1186/s12911-019-0988-4.
5
Improving Depth Estimation by Embedding Semantic Segmentation: A Hybrid CNN Model.通过嵌入语义分割来提高深度估计:一种混合 CNN 模型。
Sensors (Basel). 2022 Feb 21;22(4):1669. doi: 10.3390/s22041669.
6
Region-Enhancing Network for Semantic Segmentation of Remote-Sensing Imagery.区域增强网络在遥感图像语义分割中的应用。
Sensors (Basel). 2021 Nov 3;21(21):7316. doi: 10.3390/s21217316.
7
Multi-Scale Squeeze U-SegNet with Multi Global Attention for Brain MRI Segmentation.多尺度挤压 U-Net 与多全局注意力融合的脑 MRI 分割方法
Sensors (Basel). 2021 May 12;21(10):3363. doi: 10.3390/s21103363.
8
A comparative study of pre-trained convolutional neural networks for semantic segmentation of breast tumors in ultrasound.用于超声乳腺肿瘤语义分割的预训练卷积神经网络的比较研究
Comput Biol Med. 2020 Nov;126:104036. doi: 10.1016/j.compbiomed.2020.104036. Epub 2020 Oct 8.
9
GC-Net: Global context network for medical image segmentation.GC-Net:用于医学图像分割的全局上下文网络。
Comput Methods Programs Biomed. 2020 Jul;190:105121. doi: 10.1016/j.cmpb.2019.105121. Epub 2019 Oct 4.
10
ADR-Net: Context extraction network based on M-Net for medical image segmentation.ADR-Net:基于M-Net的医学图像分割上下文提取网络。
Med Phys. 2020 Sep;47(9):4254-4264. doi: 10.1002/mp.14364. Epub 2020 Aug 2.

引用本文的文献

1
ARAN: Age-Restricted Anonymized Dataset of Children Images and Body Measurements.ARAN:儿童图像和身体测量的年龄受限匿名数据集。
J Imaging. 2025 Apr 30;11(5):142. doi: 10.3390/jimaging11050142.
2
3D facial imaging: a novel approach for metabolic abnormalities risk profiling.三维面部成像:一种用于代谢异常风险评估的新方法。
Sci China Life Sci. 2025 Feb 18. doi: 10.1007/s11427-024-2726-8.
3
Using Advanced Convolutional Neural Network Approaches to Reveal Patient Age, Gender, and Weight Based on Tongue Images.基于舌象的深度学习卷积神经网络模型自动分析患者年龄、性别和体重
Biomed Res Int. 2024 Aug 1;2024:5551209. doi: 10.1155/2024/5551209. eCollection 2024.
4
An efficient multi-class classification of skin cancer using optimized vision transformer.利用优化后的视觉转换器实现高效的皮肤癌多分类。
Med Biol Eng Comput. 2024 Mar;62(3):773-789. doi: 10.1007/s11517-023-02969-x. Epub 2023 Nov 23.