Ibrahim Mohamed R, Lyons Terry
Centre for Data Analysis and Policy, University of Leeds, Leeds, United Kingdom.
Leeds Institute for Data Analytics (LIDA), Leeds, United Kingdom.
PLoS One. 2024 Jun 13;19(6):e0288670. doi: 10.1371/journal.pone.0288670. eCollection 2024.
Through our respiratory system, many viruses and diseases frequently spread and pass from one person to another. Covid-19 served as an example of how crucial it is to track down and cut back on contacts to stop its spread. There is a clear gap in finding automatic methods that can detect hand-to-face contact in complex urban scenes or indoors. In this paper, we introduce a computer vision framework, called FaceTouch, based on deep learning. It comprises deep sub-models to detect humans and analyse their actions. FaceTouch seeks to detect hand-to-face touches in the wild, such as through video chats, bus footage, or CCTV feeds. Despite partial occlusion of faces, the introduced system learns to detect face touches from the RGB representation of a given scene by utilising the representation of the body gestures such as arm movement. This has been demonstrated to be useful in complex urban scenarios beyond simply identifying hand movement and its closeness to faces. Relying on Supervised Contrastive Learning, the introduced model is trained on our collected dataset, given the absence of other benchmark datasets. The framework shows a strong validation in unseen datasets which opens the door for potential deployment.
通过我们的呼吸系统,许多病毒和疾病经常传播并在人与人之间传染。新冠疫情就是一个例子,说明追踪并减少接触以阻止其传播是多么关键。在寻找能够在复杂城市场景或室内检测手与脸接触的自动方法方面,存在明显差距。在本文中,我们介绍了一种基于深度学习的计算机视觉框架,称为FaceTouch。它由用于检测人体并分析其动作的深度子模型组成。FaceTouch旨在在自然场景中检测手与脸的接触,例如通过视频聊天、公交车监控录像或闭路电视画面。尽管面部存在部分遮挡,所介绍的系统通过利用诸如手臂运动等身体姿势的表示,从给定场景的RGB表示中学习检测脸部接触。这已被证明在复杂城市场景中很有用,而不仅仅是识别手部运动及其与脸部的接近程度。由于没有其他基准数据集,所介绍的模型依靠监督对比学习在我们收集的数据集上进行训练。该框架在未见过的数据集上显示出强大的验证效果,为潜在的部署打开了大门。