基于多模态神经网络模型的光环境优化自动图像处理算法。

Automatic Image Processing Algorithm for Light Environment Optimization Based on Multimodal Neural Network Model.

机构信息

College of Information Engineering, Henan Vocational College of Agricuture, Zhengzhou, Henan 451450, China.

出版信息

Comput Intell Neurosci. 2022 Jun 3;2022:5156532. doi: 10.1155/2022/5156532. eCollection 2022.

DOI:10.1155/2022/5156532

PMID:35694600

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9187444/

Abstract

In this paper, we conduct an in-depth study and analysis of the automatic image processing algorithm based on a multimodal Recurrent Neural Network (m-RNN) for light environment optimization. By analyzing the structure of m-RNN and combining the current research frontiers of image processing and natural language processing, we find out the problem of the ineffectiveness of m-RNN for some image generation descriptions, starting from both the image feature extraction part and text sequence data processing. Unlike traditional image automatic processing algorithms, this algorithm does not need to add complex rules manually. Still, it evaluates and filters through the training image collection and finally generates image automatic processing models by m-RNN. An image semantic segmentation algorithm is proposed based on multimodal attention and adaptive feature fusion. The main idea of the algorithm is to combine adaptive and feature fusion and then introduce data enhancement for small-scale multimodal light environment datasets by extracting the importance between images through multimodal attention. The model proposed in this paper can span the semantic differences of different modalities and construct feature relationships between different modalities to achieve an inferable, interpretable, and scalable feature representation of multimodal data. The automatic processing of light environment images using multimodal neural networks based on traditional algorithms eliminates manual processing and greatly reduces the time and effort of image processing.

摘要

在本文中，我们深入研究和分析了基于多模态递归神经网络（m-RNN）的自动图像处理算法，用于优化光环境。通过分析 m-RNN 的结构，并结合图像处理和自然语言处理的当前研究前沿，我们发现 m-RNN 对于某些图像生成描述的效果不佳，这一问题源于图像特征提取部分和文本序列数据处理两方面。与传统的图像自动处理算法不同，该算法不需要手动添加复杂的规则，而是通过训练图像集进行评估和过滤，最终通过 m-RNN 生成图像自动处理模型。提出了一种基于多模态注意力和自适应特征融合的图像语义分割算法。该算法的主要思想是通过多模态注意力提取图像之间的重要性，结合自适应和特征融合，然后对小规模多模态光环境数据集进行数据增强。本文提出的模型可以跨越不同模态的语义差异，构建不同模态之间的特征关系，实现多模态数据可推断、可解释和可扩展的特征表示。基于传统算法的多模态神经网络对光环境图像的自动处理消除了人工处理，大大减少了图像处理的时间和精力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e90/9187444/671e4cbec5c7/CIN2022-5156532.001.jpg

相似文献

Automatic Image Processing Algorithm for Light Environment Optimization Based on Multimodal Neural Network Model.基于多模态神经网络模型的光环境优化自动图像处理算法。

Comput Intell Neurosci. 2022 Jun 3;2022:5156532. doi: 10.1155/2022/5156532. eCollection 2022.

A multibranch and multiscale neural network based on semantic perception for multimodal medical image fusion.基于语义感知的多分支多尺度神经网络用于多模态医学图像融合。

Sci Rep. 2024 Jul 30;14(1):17609. doi: 10.1038/s41598-024-68183-3.

Medical lesion segmentation by combining multimodal images with modality weighted UNet.基于模态加权 UNet 融合多模态图像的医学病灶分割。

Med Phys. 2022 Jun;49(6):3692-3704. doi: 10.1002/mp.15610. Epub 2022 Apr 7.

Research on Image Segmentation Algorithm Based on Multimodal Hierarchical Attention Mechanism and Genetic Neural Network.基于多模态分层注意力机制和遗传神经网络的图像分割算法研究。

Comput Intell Neurosci. 2022 Jun 6;2022:9980928. doi: 10.1155/2022/9980928. eCollection 2022.

Fully automatic image colorization based on semantic segmentation technology.基于语义分割技术的全自动图像着色。

PLoS One. 2021 Nov 30;16(11):e0259953. doi: 10.1371/journal.pone.0259953. eCollection 2021.

Optimization of Artistic Image Segmentation Algorithm Based on Feed Forward Neural Network under Complex Background Environment.基于前馈神经网络的复杂背景环境下艺术图像分割算法优化。

J Environ Public Health. 2022 Sep 13;2022:9454344. doi: 10.1155/2022/9454344. eCollection 2022.

MFNet: Multimodal medical image fusion network via multi-receptive-field and multi-scale feature integration.MFNet：一种基于多感受野和多尺度特征融合的多模态医学图像融合网络。

Comput Biol Med. 2023 Jun;159:106923. doi: 10.1016/j.compbiomed.2023.106923. Epub 2023 Apr 14.

Automatic segmentation of breast cancer histological images based on dual-path feature extraction network.基于双通道特征提取网络的乳腺癌组织学图像自动分割。

Math Biosci Eng. 2022 Aug 3;19(11):11137-11153. doi: 10.3934/mbe.2022519.

Automatic Detection of Grammatical Errors in English Verbs Based on RNN Algorithm: Auxiliary Objectives for Neural Error Detection Models.基于 RNN 算法的英语动词语法错误自动检测：神经错误检测模型的辅助目标。

Comput Intell Neurosci. 2021 Oct 16;2021:6052873. doi: 10.1155/2021/6052873. eCollection 2021.

Designing Interpretable Recurrent Neural Networks for Video Reconstruction via Deep Unfolding.通过深度展开设计用于视频重建的可解释循环神经网络。

IEEE Trans Image Process. 2021;30:4099-4113. doi: 10.1109/TIP.2021.3069296. Epub 2021 Apr 8.

本文引用的文献

Multimodal genomic features predict outcome of immune checkpoint blockade in non-small-cell lung cancer.多模态基因组特征预测非小细胞肺癌免疫检查点阻断的疗效。

Nat Cancer. 2020 Jan;1(1):99-111. doi: 10.1038/s43018-019-0008-8. Epub 2020 Jan 13.

Deep Learning-Based Single-Cell Optical Image Studies.基于深度学习的单细胞光学图像研究。

Cytometry A. 2020 Mar;97(3):226-240. doi: 10.1002/cyto.a.23973. Epub 2020 Jan 25.

Impact of image preprocessing methods on reproducibility of radiomic features in multimodal magnetic resonance imaging in glioblastoma.多模态磁共振成像中影像预处理方法对胶质母细胞瘤放射组学特征可重复性的影响。

J Appl Clin Med Phys. 2020 Jan;21(1):179-190. doi: 10.1002/acm2.12795. Epub 2019 Dec 27.

Connected Vehicle as a Mobile Sensor for Real Time Queue Length at Signalized Intersections.联网车辆作为信号交叉口实时排队长度的移动传感器

Sensors (Basel). 2019 May 2;19(9):2059. doi: 10.3390/s19092059.

Origin and Evolution of Core Components Responsible for Monitoring Light Environment Changes during Plant Terrestrialization.负责监测植物陆地化过程中光环境变化的核心成分的起源与演化。

Mol Plant. 2019 Jun 3;12(6):847-862. doi: 10.1016/j.molp.2019.04.006. Epub 2019 Apr 19.

Seasonal and drought-related changes in leaf area profiles depend on height and light environment in an Amazon forest.叶片面积廓线在季节性和干旱相关变化取决于亚马逊森林的高度和光照环境。

New Phytol. 2019 May;222(3):1284-1297. doi: 10.1111/nph.15726. Epub 2019 Mar 9.

A structure fidelity approach for big data collection in wireless sensor networks.一种用于无线传感器网络中大数据收集的结构保真方法。

Sensors (Basel). 2014 Dec 25;15(1):248-73. doi: 10.3390/s150100248.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于多模态神经网络模型的光环境优化自动图像处理算法。

Automatic Image Processing Algorithm for Light Environment Optimization Based on Multimodal Neural Network Model.

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献