School of Mechanical and Electronic Engineering, Suzhou University, Suzhou, Anhui 23400, China.
School of Math and Statistic, Suzhou University, Suzhou, Anhui 23400, China.
Comput Intell Neurosci. 2022 Jun 6;2022:9980928. doi: 10.1155/2022/9980928. eCollection 2022.
Multimodal tasks based on attention mechanism and language face numerous problems. Based on multimodal hierarchical attention mechanism and genetic neural network, this paper studies the application of image segmentation algorithm in data completion and 3D scene reconstruction. The algorithm refers to the process of concentrating attention that humans subjectively pay attention to and calculates the difference between each pixel in the genetic neural network test image in the color space and the average value of the target image, which solves the problem of static feature maps and dynamic feature maps of image sequences. In addition, in view of the problem that the number of attention enhancement feature extraction modules is too large and the parameters are too large, the recursive mechanism is used as the feature extraction branch, and new model parameters are not added when the network depth is increased. The simulation results show that the accuracy of the improved image saliency detection algorithm based on the attention mechanism reaches 89.7%, and the difference between the average value of the single-point pixel and the target image is reduced to 0.132, which further promotes the practicability and reliability of the image segmentation model.
基于注意力机制和语言的多模态任务面临许多问题。本文基于多模态层次注意机制和遗传神经网络,研究了图像分割算法在数据补全和 3D 场景重建中的应用。该算法借鉴了人类主观关注的集中注意力过程,并计算了遗传神经网络测试图像中每个像素在颜色空间中与目标图像平均值之间的差异,解决了图像序列的静态特征图和动态特征图的问题。此外,针对注意力增强特征提取模块数量过多、参数过大的问题,采用递归机制作为特征提取分支,在增加网络深度时不增加新的模型参数。仿真结果表明,基于注意力机制的改进图像显著检测算法的准确率达到 89.7%,单点像素的平均值与目标图像的差异减小到 0.132,进一步提高了图像分割模型的实用性和可靠性。