Intelligent Machine Lab, Information Technology University, Pakistan.
Machine Learning and Data Science @ the Home Depot, USA.
Comput Biol Med. 2021 Jun;133:104392. doi: 10.1016/j.compbiomed.2021.104392. Epub 2021 Apr 15.
Body-Mass-Index (BMI) conveys important information about one's life such as health and socio-economic conditions. Large-scale automatic estimation of BMIs can help predict several societal behaviors such as health, job opportunities, friendships, and popularity. The recent works have either employed hand-crafted geometrical face features or face-level deep convolutional neural network features for face to BMI prediction. The hand-crafted geometrical face feature lack generalizability and face-level deep features don't have detailed local information. Although useful, these methods missed the detailed local information which is essential for exact BMI prediction. In this paper, we propose to use deep features that are pooled from different face regions (eye, nose, eyebrow, lips, etc.) and demonstrate that this explicit pooling from face regions can significantly boost the performance of BMI prediction. To address the problem of accurate and pixel-level face regions localization, we propose to use face semantic segmentation in our framework. Extensive experiments are performed using different Convolutional Neural Network (CNN) backbones including FaceNet and VGG-face on three publicly available datasets: VisualBMI, Bollywood and VIP attributes. Experimental results demonstrate that, as compared to the recent works, the proposed Reg-GAP gives a percentage improvement of 22.4% on VIP-attribute, 3.3% on VisualBMI, and 63.09% on the Bollywood dataset.
身体质量指数(BMI)传达了有关个人生活的重要信息,例如健康和社会经济状况。大规模自动估计 BMI 可以帮助预测健康、工作机会、友谊和受欢迎程度等几种社会行为。最近的研究工作要么使用手工制作的几何人脸特征,要么使用人脸级别的深度卷积神经网络特征进行人脸到 BMI 的预测。手工制作的几何人脸特征缺乏泛化能力,而人脸级别的深度特征则没有详细的局部信息。虽然这些方法很有用,但它们忽略了精确 BMI 预测所必需的详细局部信息。在本文中,我们提出使用从不同人脸区域(眼睛、鼻子、眉毛、嘴唇等)提取的深度特征,并证明这种从人脸区域的显式池化可以显著提高 BMI 预测的性能。为了解决准确和像素级人脸区域定位的问题,我们在框架中提出使用人脸语义分割。我们在三个公开可用的数据集上进行了广泛的实验,包括 VisualBMI、Bollywood 和 VIP 属性,使用了不同的卷积神经网络(CNN)骨干网络,包括 FaceNet 和 VGG-face。实验结果表明,与最近的研究工作相比,所提出的 Reg-GAP 在 VIP 属性上的百分比提高了 22.4%,在 VisualBMI 上提高了 3.3%,在 Bollywood 数据集上提高了 63.09%。