Department of Computer Science and Information Technology, Central University of Jammu, Samba, Jammu and Kashmir 181143, India.
Department of Computer Science and Information Technology, Central University of Jammu, Samba, Jammu and Kashmir 181143, India.
Artif Intell Med. 2023 May;139:102544. doi: 10.1016/j.artmed.2023.102544. Epub 2023 Apr 7.
The outbreak of COVID-19 pandemic poses new challenges to research community to investigate novel mechanisms for monitoring as well as controlling its further spread via crowded scenes. Moreover, the contemporary methods of COVID-19 preventions are enforcing strict protocols in the public places. The emergence of robust computer vision-enabled applications leverages intelligent frameworks for monitoring of the pandemic deterrence in public places. The employment of COVID-19 protocols via wearing face masks by human is an effective procedure that is implemented in several countries across the world. It is a challenging task for authorities to manually monitor these protocols particularly in densely crowded public gatherings such as, shopping malls, railway stations, airports, religious places etc. Thus, to overcome these issues, the proposed research aims to design an operative method that automatically detects the violation of face mask regulation for COVID-19 pandemic. In this research work, we expound a novel technique for COVID-19 protocol desecration via video summarization in the crowded scenes (CoSumNet). Our approach automatically yields short summaries from crowded video scenes (i.e., with and without mask human). Besides, the CoSumNet can be deployed in crowded places that may assist the controlling agencies to take appropriate actions to enforce the penalty to the protocol violators. To evaluate the efficacy of the approach, the CoSumNet is trained on a benchmark "Face Mask Detection ∼12K Images Dataset" and validated through various real-time CCTV videos. The CoSumNet demonstrates superior performance of 99.98 % and 99.92 % detection accuracy in the seen and unseen scenarios respectively. Our method offers promising performance in cross-datasets environments as well as on a variety of face masks. Furthermore, the model can convert the longer videos to short summaries in nearly 5-20 s approximately.
COVID-19 大流行的爆发给研究界带来了新的挑战,需要研究新的机制来监测和控制其在拥挤场所的进一步传播。此外,当前 COVID-19 的预防方法正在公共场所强制实施严格的协议。强大的计算机视觉应用的出现利用智能框架来监测公共场所的大流行防范。通过人类佩戴口罩来实施 COVID-19 协议是一种有效的方法,在世界上许多国家都得到了实施。对于当局来说,手动监控这些协议,特别是在购物中心、火车站、机场、宗教场所等人员密集的公共场所,是一项具有挑战性的任务。因此,为了克服这些问题,本研究旨在设计一种自动检测 COVID-19 大流行期间口罩佩戴违规行为的有效方法。在这项研究工作中,我们阐述了一种通过拥挤场景中的视频摘要来检测 COVID-19 协议违规的新方法(CoSumNet)。我们的方法可以自动从拥挤的视频场景(即有戴口罩和无戴口罩的人类)中生成简短的摘要。此外,CoSumNet 可以部署在拥挤的地方,以帮助控制机构采取适当的行动,对违反协议的人进行处罚。为了评估该方法的有效性,我们在一个基准“口罩检测 ∼12K 图像数据集”上对 CoSumNet 进行了训练,并通过各种实时闭路电视视频进行了验证。在可见和不可见场景中,CoSumNet 分别达到了 99.98%和 99.92%的检测精度。我们的方法在跨数据集环境以及各种口罩上都具有很好的性能。此外,该模型可以在大约 5-20 秒内将较长的视频转换为简短的摘要。