Alex Deepthy Mary, M Kalpana Chowdary, Mengash Hanan Abdullah, M Venkata Dasu, Kryvinska Natalia, J Chinna Babu, Kiran Ajmeera
Department of Electronics and Communication Engineering, Mangalam College of Engineering, Ettumanoor, Kerala, India.
Department of Computer Science and Engineering, MLR Institute of Technology, Hyderabad, Telangana, India.
PeerJ Comput Sci. 2024 Dec 18;10:e2603. doi: 10.7717/peerj-cs.2603. eCollection 2024.
Biometric identification, particularly ear biometrics, has gained prominence amidst the global prevalence of mask-wearing, exacerbated by the COVID-19 outbreak. This shift has highlighted the need for reliable biometric systems that can function effectively even when facial features are partially obscured. Despite numerous proposed convolutional neural network (CNN) based deep learning techniques for ear detection, achieving the expected efficiency and accuracy remains a challenge. In this manuscript, we propose a sophisticated method for ear biometric identification, named the encoder-decoder deep learning ensemble technique incorporating attention blocks. This innovative approach leverages the strengths of encoder-decoder architectures and attention mechanisms to enhance the precision and reliability of ear detection and segmentation. Specifically, our method employs an ensemble of two YSegNets, which significantly improves the performance over a single YSegNet. The use of an ensemble method is crucial in ear biometrics due to the variability and complexity of ear shapes and the potential for partial occlusions. By combining the outputs of two YSegNets, our approach can capture a wider range of features and reduce the risk of false positives and negatives, leading to more robust and accurate segmentation results. Experimental validation of the proposed method was conducted using a combination of data from the EarVN1.0, AMI, and Human Face datasets. The results demonstrate the effectiveness of our approach, achieving a segmentation framework accuracy of 98.93%. This high level of accuracy underscores the potential of our method for practical applications in biometric identification. The proposed innovative method demonstrates significant potential for individual recognition, particularly in scenarios involving large gatherings. When complemented by an effective surveillance system, our method can contribute to improved security and identification processes in public spaces. This research not only advances the field of ear biometrics but also provides a viable solution for biometric identification in the context of mask-wearing and other facial obstructions.
生物特征识别,尤其是耳部生物特征识别,在全球普遍佩戴口罩的情况下,因新冠疫情的爆发而愈发凸显其重要性。这种转变凸显了对可靠生物特征识别系统的需求,即使面部特征部分被遮挡,该系统也能有效运行。尽管已经提出了许多基于卷积神经网络(CNN)的深度学习技术用于耳部检测,但要实现预期的效率和准确性仍然是一个挑战。在本论文中,我们提出了一种用于耳部生物特征识别的复杂方法,即结合注意力模块的编码器-解码器深度学习集成技术。这种创新方法利用了编码器-解码器架构和注意力机制的优势,以提高耳部检测和分割的精度和可靠性。具体而言,我们的方法采用了两个YSegNet的集成,相较于单个YSegNet,显著提高了性能。由于耳部形状的多样性和复杂性以及部分遮挡的可能性,集成方法在耳部生物特征识别中至关重要。通过结合两个YSegNet的输出,我们的方法可以捕捉更广泛的特征,并降低误报和漏报的风险,从而得到更稳健、准确的分割结果。使用来自EarVN1.0、AMI和人脸数据集的数据组合对所提出的方法进行了实验验证。结果证明了我们方法的有效性,分割框架准确率达到了98.93%。这种高水平的准确率突出了我们的方法在生物特征识别实际应用中的潜力。所提出的创新方法在个人识别方面显示出巨大潜力,特别是在涉及大型集会的场景中。当与有效的监控系统相结合时,我们的方法可以有助于改善公共场所的安全和识别流程。这项研究不仅推动了耳部生物特征识别领域的发展,还为在佩戴口罩和其他面部遮挡情况下的生物特征识别提供了可行的解决方案。