• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在超自动化机制中使用卷积神经网络提高深度视频编码效率的策略。

Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism.

作者信息

Wang Xiaolan

机构信息

The Higher Educational Key Laboratory for Flexible Manufacturing Equipment Integration of Fujian Province (Xiamen Institute of Technology), Xiamen, 361021, China.

出版信息

Sci Rep. 2025 Jan 7;15(1):1079. doi: 10.1038/s41598-025-85602-1.

DOI:10.1038/s41598-025-85602-1
PMID:39775162
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11706951/
Abstract

With ongoing social progress, three-dimensional (3D) video is becoming increasingly prevalent in everyday life. As a key component of 3D video technology, depth video plays a crucial role by providing information about the distance and spatial distribution of objects within a scene. This study focuses on deep video encoding and proposes an efficient encoding method that integrates the Convolutional Neural Network (CNN) with a hyperautomation mechanism. First, an overview of the principles underlying CNNs and the concept of hyperautomation is presented, and the application of CNNs in the intra-frame prediction module of video encoding is explored. By incorporating the hyperautomation mechanism, this study emphasizes the potential of Artificial Intelligence to enhance encoding efficiency. Next, a CNN-based method for variable-resolution intra-frame prediction of depth video is proposed. This method utilizes a multi-level feature fusion network to reconstruct coding units. The effectiveness of the proposed variable-resolution coding technique is then evaluated by comparing its performance against the original method on the high-efficiency video coding (HEVC) test platform. The results demonstrate that, compared to the original test platform method (HTM-16.2), the proposed method achieves an average Bjøntegaard delta bit rate (BDBR) savings of 8.12% across all tested video sequences. This indicates a significant improvement in coding efficiency. Furthermore, the viewpoint BDBR loss of the variable-resolution coding method is only 0.15%, which falls within an acceptable margin of error. This suggests that the method is both stable and reliable in viewpoint coding, and it performs well across a broad range of quantization parameter settings. Additionally, compared to other encoding methods, the proposed approach exhibits superior peak signal-to-noise ratio, structural similarity index, and perceptual quality metrics. This study introduces a novel and efficient approach to 3D video compression, and the integration of CNNs with hyperautomation provides valuable insights for future innovations in video encoding.

摘要

随着社会的不断进步,三维(3D)视频在日常生活中越来越普遍。作为3D视频技术的关键组成部分,深度视频通过提供场景中物体的距离和空间分布信息发挥着至关重要的作用。本研究聚焦于深度视频编码,并提出了一种将卷积神经网络(CNN)与超自动化机制相结合的高效编码方法。首先,介绍了CNN的基本原理和超自动化的概念,并探讨了CNN在视频编码帧内预测模块中的应用。通过纳入超自动化机制,本研究强调了人工智能提高编码效率的潜力。接下来,提出了一种基于CNN的深度视频可变分辨率帧内预测方法。该方法利用多级特征融合网络来重建编码单元。然后,通过在高效视频编码(HEVC)测试平台上与原始方法比较性能,评估了所提出的可变分辨率编码技术的有效性。结果表明,与原始测试平台方法(HTM - 16.2)相比,所提出的方法在所有测试视频序列上平均节省了8.12%的Bjøntegaard 增量比特率(BDBR)。这表明编码效率有显著提高。此外,可变分辨率编码方法的视点BDBR损失仅为0.15%,处于可接受的误差范围内。这表明该方法在视点编码中既稳定又可靠,并且在广泛的量化参数设置下表现良好。此外,与其他编码方法相比,所提出的方法在峰值信噪比、结构相似性指数和感知质量指标方面表现优异。本研究介绍了一种新颖且高效的3D视频压缩方法,CNN与超自动化的集成可为视频编码的未来创新提供有价值的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/c1e22f210246/41598_2025_85602_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/dbb82046497c/41598_2025_85602_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/1949d6fcdc65/41598_2025_85602_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/e30c5cc2539d/41598_2025_85602_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/49bfc48aca45/41598_2025_85602_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/398ca1c44b86/41598_2025_85602_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/0fc70480a215/41598_2025_85602_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/d310217281db/41598_2025_85602_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/7e704e5ce6dc/41598_2025_85602_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/b3e0a0dcafa8/41598_2025_85602_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/c1e22f210246/41598_2025_85602_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/dbb82046497c/41598_2025_85602_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/1949d6fcdc65/41598_2025_85602_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/e30c5cc2539d/41598_2025_85602_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/49bfc48aca45/41598_2025_85602_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/398ca1c44b86/41598_2025_85602_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/0fc70480a215/41598_2025_85602_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/d310217281db/41598_2025_85602_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/7e704e5ce6dc/41598_2025_85602_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/b3e0a0dcafa8/41598_2025_85602_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc09/11706951/c1e22f210246/41598_2025_85602_Fig10_HTML.jpg

相似文献

1
Strategies for enhancing deep video encoding efficiency using the Convolutional Neural Network in a hyperautomation mechanism.在超自动化机制中使用卷积神经网络提高深度视频编码效率的策略。
Sci Rep. 2025 Jan 7;15(1):1079. doi: 10.1038/s41598-025-85602-1.
2
High Efficiency Video Coding (HEVC)-Based Surgical Telementoring System Using Shallow Convolutional Neural Network.基于高效视频编码 (HEVC) 的浅层卷积神经网络手术远程指导系统。
J Digit Imaging. 2019 Dec;32(6):1027-1043. doi: 10.1007/s10278-019-00206-2.
3
Deep Learning Post-Filtering Using Multi-Head Attention and Multiresolution Feature Fusion for Image and Intra-Video Quality Enhancement.基于多头注意力和多分辨率特征融合的深度学习后滤波技术在图像和视频内质量增强中的应用。
Sensors (Basel). 2022 Feb 10;22(4):1353. doi: 10.3390/s22041353.
4
A Hardware-Friendlyand High-Efficiency H.265/HEVC Encoder for Visual Sensor Networks.面向视觉传感器网络的硬件友好型、高效率 H.265/HEVC 编码器。
Sensors (Basel). 2023 Feb 27;23(5):2625. doi: 10.3390/s23052625.
5
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
6
Convolutional Neural Network Based Synthesized View Quality Enhancement for 3D Video Coding.基于卷积神经网络的3D视频编码合成视图质量增强
IEEE Trans Image Process. 2018 Jul 20. doi: 10.1109/TIP.2018.2858022.
7
Efficient In-loop Filtering Based on Enhanced Deep Convolutional Neural Networks for HEVC.基于增强深度卷积神经网络的高效HEVC帧内滤波
IEEE Trans Image Process. 2020 Mar 27. doi: 10.1109/TIP.2020.2982534.
8
High-Efficiency 3D Depth Coding Based on Perceptual Quality of Synthesized Video.基于合成视频感知质量的高效3D深度编码
IEEE Trans Image Process. 2016 Dec;25(12):5877-5891. doi: 10.1109/TIP.2016.2615290. Epub 2016 Oct 5.
9
Visual saliency guided perceptual adaptive quantization based on HEVC intra-coding for planetary images.基于高效视频编码(HEVC)帧内编码的视觉显著性引导的行星图像感知自适应量化
PLoS One. 2022 Feb 9;17(2):e0263729. doi: 10.1371/journal.pone.0263729. eCollection 2022.
10
Fast CU Partition Algorithm for Intra Frame Coding Based on Joint Texture Classification and CNN.基于联合纹理分类与卷积神经网络的帧内编码快速CU划分算法
Sensors (Basel). 2023 Sep 15;23(18):7923. doi: 10.3390/s23187923.

引用本文的文献

1
Telesurgery: current status and strategies for latency reduction.远程手术:现状与减少延迟的策略
J Robot Surg. 2025 Apr 12;19(1):153. doi: 10.1007/s11701-025-02333-1.

本文引用的文献

1
An effective classification approach for EEG-based motor imagery tasks combined with attention mechanisms.一种结合注意力机制的基于脑电图的运动想象任务的有效分类方法。
Cogn Neurodyn. 2024 Oct;18(5):2689-2707. doi: 10.1007/s11571-024-10115-y. Epub 2024 May 3.
2
Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics.面向机器的视频编码:用于智能协作分析的紧凑视觉表示压缩
IEEE Trans Pattern Anal Mach Intell. 2024 Jul;46(7):5174-5191. doi: 10.1109/TPAMI.2024.3367293. Epub 2024 Jun 5.
3
Memristor-Based Neural Network Circuit of Associative Memory With Overshadowing and Emotion Congruent Effect.
具有遮蔽效应和情绪一致性效应的基于忆阻器的联想记忆神经网络电路
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3618-3630. doi: 10.1109/TNNLS.2023.3348553. Epub 2025 Feb 6.
4
Fully automatic segmentation of the mandible based on convolutional neural networks (CNNs).基于卷积神经网络(CNN)的下颌骨全自动分割
Orthod Craniofac Res. 2021 Dec;24 Suppl 2:100-107. doi: 10.1111/ocr.12536. Epub 2021 Dec 1.
5
Deeply Subwavelength Localization with Reverberation-Coded Aperture.利用混响编码孔径实现亚波长深度局域化。
Phys Rev Lett. 2021 Jul 23;127(4):043903. doi: 10.1103/PhysRevLett.127.043903.
6
Self-Regulatory Goal Motivational Processes in Sustained New Year Resolution Pursuit and Mental Wellbeing.自我调节目标激励过程在持续新年决心追求和心理健康中的作用。
Int J Environ Res Public Health. 2021 Mar 17;18(6):3084. doi: 10.3390/ijerph18063084.
7
A Multi-Level Feature Fusion Network for Remote Sensing Image Segmentation.基于多级特征融合网络的遥感图像分割。
Sensors (Basel). 2021 Feb 10;21(4):1267. doi: 10.3390/s21041267.
8
Convolutional neural networks in medical image understanding: a survey.医学图像理解中的卷积神经网络:一项综述。
Evol Intell. 2022;15(1):1-22. doi: 10.1007/s12065-020-00540-3. Epub 2021 Jan 3.
9
The coding capacity of SARS-CoV-2.SARS-CoV-2 的编码能力。
Nature. 2021 Jan;589(7840):125-130. doi: 10.1038/s41586-020-2739-1. Epub 2020 Sep 9.