• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于复杂网络环境的自监督多模态语义传输机制

The self supervised multimodal semantic transmission mechanism for complex network environments.

作者信息

Zou Jiajun, Wan Zhiping, Wang Feng, Ye Shitong, Liu Shaojiang

机构信息

School of Information and Intelligence Engineering, Guangzhou Xinhua University, Dongguan, 523133, China.

Artificial Intelligence Institute of Guangzhou Huashang College, Guangzhou, 511300, China.

出版信息

Sci Rep. 2025 Aug 14;15(1):29899. doi: 10.1038/s41598-025-15162-x.

DOI:10.1038/s41598-025-15162-x
PMID:40813458
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12354918/
Abstract

With the rapid development of intelligent transportation systems, the challenge of achieving efficient and accurate multimodal traffic data transmission and collaborative processing in complex network environments with bandwidth limitations, signal interference, and high concurrency has become a key issue that needs to be addressed. This paper proposes a Self-supervised Multi-modal and Reinforcement learning-based Traffic data semantic collaboration Transmission mechanism (SMART), aiming to optimize the transmission efficiency and robustness of multimodal data through a combination of self-supervised learning and reinforcement learning. The sending end employs a self-supervised conditional variational autoencoder and Transformer-DRL-based dynamic semantic compression strategy to intelligently filter and transmit the most core semantic information from video, radar, and LiDAR data. The receiving end combines Transformer and graph neural networks for deep decoding and feature fusion of multimodal data, while also using reinforcement learning self-supervised multi-task optimization engine to collaboratively enhance multiple task scenarios (such as traffic accident detection and vehicle behavior recognition). Experimental results show that SMART significantly outperforms traditional methods in low signal-to-noise ratio, high packet loss rate, and large-scale concurrency environments, excelling in key indicators such as semantic similarity, transmission efficiency, robustness, and end-to-end latency, demonstrating its effectiveness and innovation in smart transportation scenarios.

摘要

随着智能交通系统的快速发展,在存在带宽限制、信号干扰和高并发的复杂网络环境中实现高效准确的多模态交通数据传输与协同处理面临的挑战,已成为一个亟待解决的关键问题。本文提出了一种基于自监督多模态和强化学习的交通数据语义协作传输机制(SMART),旨在通过自监督学习和强化学习相结合的方式,优化多模态数据的传输效率和鲁棒性。发送端采用自监督条件变分自编码器和基于Transformer-DRL的动态语义压缩策略,从视频、雷达和激光雷达数据中智能筛选并传输最核心的语义信息。接收端将Transformer和图神经网络相结合,对多模态数据进行深度解码和特征融合,同时还使用强化学习自监督多任务优化引擎,协同增强多种任务场景(如交通事故检测和车辆行为识别)。实验结果表明,在低信噪比、高丢包率和大规模并发环境中,SMART显著优于传统方法,在语义相似度、传输效率、鲁棒性和端到端延迟等关键指标上表现出色,证明了其在智能交通场景中的有效性和创新性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/f84da306a2f5/41598_2025_15162_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/2757286cf545/41598_2025_15162_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/b0533181f39c/41598_2025_15162_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/9d39d8357506/41598_2025_15162_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/ee617d088691/41598_2025_15162_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/cb42ca6faab3/41598_2025_15162_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/4086895799f0/41598_2025_15162_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/8f6721461642/41598_2025_15162_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/b92141451a6e/41598_2025_15162_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/f84da306a2f5/41598_2025_15162_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/2757286cf545/41598_2025_15162_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/b0533181f39c/41598_2025_15162_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/9d39d8357506/41598_2025_15162_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/ee617d088691/41598_2025_15162_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/cb42ca6faab3/41598_2025_15162_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/4086895799f0/41598_2025_15162_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/8f6721461642/41598_2025_15162_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/b92141451a6e/41598_2025_15162_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b19e/12354918/f84da306a2f5/41598_2025_15162_Fig7_HTML.jpg

相似文献

1
The self supervised multimodal semantic transmission mechanism for complex network environments.用于复杂网络环境的自监督多模态语义传输机制
Sci Rep. 2025 Aug 14;15(1):29899. doi: 10.1038/s41598-025-15162-x.
2
Integrated neural network framework for multi-object detection and recognition using UAV imagery.用于使用无人机图像进行多目标检测与识别的集成神经网络框架。
Front Neurorobot. 2025 Jul 30;19:1643011. doi: 10.3389/fnbot.2025.1643011. eCollection 2025.
3
Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.使用语义语言内容和变压器深度学习架构评估认知能力下降。
Int J Lang Commun Disord. 2024 May-Jun;59(3):1110-1127. doi: 10.1111/1460-6984.12973. Epub 2023 Nov 16.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Short-Term Memory Impairment短期记忆障碍
6
Enhancing breast cancer detection on screening mammogram using self-supervised learning and a hybrid deep model of Swin Transformer and convolutional neural networks.使用自监督学习以及Swin Transformer和卷积神经网络的混合深度模型提高筛查乳腺钼靶片中的乳腺癌检测率。
J Med Imaging (Bellingham). 2025 Nov;12(Suppl 2):S22007. doi: 10.1117/1.JMI.12.S2.S22007. Epub 2025 May 14.
7
Long-term care plan recommendation for older adults with disabilities: a bipartite graph transformer and self-supervised approach.针对残疾老年人的长期护理计划建议:一种二分图变压器和自监督方法。
J Am Med Inform Assoc. 2025 Apr 1;32(4):689-701. doi: 10.1093/jamia/ocae327.
8
Shapley value-driven multi-modal deep reinforcement learning for complex decision-making.用于复杂决策的沙普利值驱动多模态深度强化学习
Neural Netw. 2025 Nov;191:107650. doi: 10.1016/j.neunet.2025.107650. Epub 2025 Jun 21.
9
A fake news detection model using the integration of multimodal attention mechanism and residual convolutional network.一种融合多模态注意力机制和残差卷积网络的假新闻检测模型。
Sci Rep. 2025 Jul 1;15(1):20544. doi: 10.1038/s41598-025-05702-w.
10
Cross-shaped windows transformer with self-supervised pretraining for clinically significant prostate cancer detection in bi-parametric MRI.用于双参数磁共振成像中具有临床意义的前列腺癌检测的带自监督预训练的十字形窗口变换器
Med Phys. 2025 Feb;52(2):993-1004. doi: 10.1002/mp.17546. Epub 2024 Nov 26.

本文引用的文献

1
When language and vision meet road safety: Leveraging multimodal large language models for video-based traffic accident analysis.
Accid Anal Prev. 2025 Sep;219:108077. doi: 10.1016/j.aap.2025.108077. Epub 2025 Jun 4.
2
A Survey on Semantic Communications in Internet of Vehicles.车联网中的语义通信研究
Entropy (Basel). 2025 Apr 20;27(4):445. doi: 10.3390/e27040445.
3
Cross-Modal Graph Semantic Communication Assisted by Generative AI in the Metaverse for 6G.生成式人工智能助力元宇宙中6G的跨模态图语义通信。
Research (Wash D C). 2024 Apr 29;7:0342. doi: 10.34133/research.0342. eCollection 2024.
4
Intelligent driving intelligence test for autonomous vehicles with naturalistic and adversarial environment.自动驾驶智能测试 自然与对抗环境下的自主车辆
Nat Commun. 2021 Feb 2;12(1):748. doi: 10.1038/s41467-021-21007-8.