• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于资源受限和低比特率应用的基于感知的H.264/AVC视频编码

Perception-Based H.264/AVC Video Coding for Resource-Constrained and Low-Bit-Rate Applications.

作者信息

Kau Lih-Jen, Tseng Chin-Kun, Lee Ming-Xian

机构信息

Department of Electronic Engineering, National Taipei University of Technology, Taipei 106344, Taiwan.

Tri-Service General Hospital Songshan Branch, Taipei 105309, Taiwan.

出版信息

Sensors (Basel). 2025 Jul 8;25(14):4259. doi: 10.3390/s25144259.

DOI:10.3390/s25144259
PMID:40732387
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12298108/
Abstract

With the rapid expansion of Internet of Things (IoT) and edge computing applications, efficient video transmission under constrained bandwidth and limited computational resources has become increasingly critical. In such environments, perception-based video coding plays a vital role in maintaining acceptable visual quality while minimizing bit rate and processing overhead. Although newer video coding standards have emerged, H.264/AVC remains the dominant compression format in many deployed systems, particularly in commercial CCTV surveillance, due to its compatibility, stability, and widespread hardware support. Motivated by these practical demands, this paper proposes a perception-based video coding algorithm specifically tailored for low-bit-rate H.264/AVC applications. By targeting regions most relevant to the human visual system, the proposed method enhances perceptual quality while optimizing resource usage, making it particularly suitable for embedded systems and bandwidth-limited communication channels. In general, regions containing human faces and those exhibiting significant motion are of primary importance for human perception and should receive higher bit allocation to preserve visual quality. To this end, macroblocks (MBs) containing human faces are detected using the Viola-Jones algorithm, which leverages AdaBoost for feature selection and a cascade of classifiers for fast and accurate detection. This approach is favored over deep learning-based models due to its low computational complexity and real-time capability, making it ideal for latency- and resource-constrained IoT and edge environments. Motion-intensive macroblocks were identified by comparing their motion intensity against the average motion level of preceding reference frames. Based on these criteria, a dynamic quantization parameter (QP) adjustment strategy was applied to assign finer quantization to perceptually important regions of interest (ROIs) in low-bit-rate scenarios. The experimental results show that the proposed method achieves superior subjective visual quality and objective Peak Signal-to-Noise Ratio (PSNR) compared to the standard JM software and other state-of-the-art algorithms under the same bit rate constraints. Moreover, the approach introduces only a marginal increase in computational complexity, highlighting its efficiency. Overall, the proposed algorithm offers an effective balance between visual quality and computational performance, making it well suited for video transmission in bandwidth-constrained, resource-limited IoT and edge computing environments.

摘要

随着物联网(IoT)和边缘计算应用的迅速扩展,在带宽受限和计算资源有限的情况下进行高效视频传输变得越来越关键。在这样的环境中,基于感知的视频编码在保持可接受的视觉质量的同时,最小化比特率和处理开销方面发挥着至关重要的作用。尽管出现了更新的视频编码标准,但由于其兼容性、稳定性和广泛的硬件支持,H.264/AVC在许多已部署的系统中仍然是主导的压缩格式,特别是在商业闭路电视监控中。受这些实际需求的推动,本文提出了一种专门为低比特率H.264/AVC应用量身定制的基于感知的视频编码算法。通过针对与人类视觉系统最相关的区域,该方法在优化资源使用的同时提高了感知质量,使其特别适用于嵌入式系统和带宽受限的通信信道。一般来说,包含人脸的区域和那些呈现显著运动的区域对于人类感知至关重要,应该分配更高的比特以保持视觉质量。为此,使用Viola-Jones算法检测包含人脸的宏块,该算法利用AdaBoost进行特征选择,并使用级联分类器进行快速准确的检测。由于其低计算复杂度和实时能力,这种方法比基于深度学习的模型更受青睐,使其成为延迟和资源受限的物联网和边缘环境的理想选择。通过将运动强度与先前参考帧的平均运动水平进行比较来识别运动密集的宏块。基于这些标准,应用了动态量化参数(QP)调整策略,以便在低比特率场景中为感知上重要的感兴趣区域(ROI)分配更精细的量化。实验结果表明,与标准JM软件和其他现有算法相比,在相同比特率约束下,所提出的方法实现了卓越的主观视觉质量和客观峰值信噪比(PSNR)。此外,该方法仅略微增加了计算复杂度,突出了其效率。总体而言,所提出的算法在视觉质量和计算性能之间实现了有效的平衡,使其非常适合在带宽受限、资源有限的物联网和边缘计算环境中进行视频传输。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/60415093b736/sensors-25-04259-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/8c8444ef14b9/sensors-25-04259-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/1fac098d927e/sensors-25-04259-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/5bbfd166b24b/sensors-25-04259-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/111b933b8f75/sensors-25-04259-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/7414c67ffc0c/sensors-25-04259-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/f7fd37bce6a0/sensors-25-04259-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/e7098f88256c/sensors-25-04259-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/636d9a0203ac/sensors-25-04259-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/ed710c20d3a9/sensors-25-04259-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/7d2e47650033/sensors-25-04259-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/99a76b8190b5/sensors-25-04259-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/12318e1c779e/sensors-25-04259-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/c509e466c512/sensors-25-04259-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/e94b10cda8fb/sensors-25-04259-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/2515871dd916/sensors-25-04259-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/60415093b736/sensors-25-04259-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/8c8444ef14b9/sensors-25-04259-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/1fac098d927e/sensors-25-04259-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/5bbfd166b24b/sensors-25-04259-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/111b933b8f75/sensors-25-04259-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/7414c67ffc0c/sensors-25-04259-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/f7fd37bce6a0/sensors-25-04259-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/e7098f88256c/sensors-25-04259-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/636d9a0203ac/sensors-25-04259-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/ed710c20d3a9/sensors-25-04259-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/7d2e47650033/sensors-25-04259-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/99a76b8190b5/sensors-25-04259-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/12318e1c779e/sensors-25-04259-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/c509e466c512/sensors-25-04259-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/e94b10cda8fb/sensors-25-04259-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/2515871dd916/sensors-25-04259-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77bc/12298108/60415093b736/sensors-25-04259-g016.jpg

相似文献

1
Perception-Based H.264/AVC Video Coding for Resource-Constrained and Low-Bit-Rate Applications.用于资源受限和低比特率应用的基于感知的H.264/AVC视频编码
Sensors (Basel). 2025 Jul 8;25(14):4259. doi: 10.3390/s25144259.
2
Short-Term Memory Impairment短期记忆障碍
3
Systemic Inflammatory Response Syndrome全身炎症反应综合征
4
DRGNet: Enhanced VVC Reconstructed Frames Using Dual-Path Residual Gating for High-Resolution Video.DRGNet:使用双路径残差门控的增强型VVC重建帧用于高分辨率视频
Sensors (Basel). 2025 Jun 15;25(12):3744. doi: 10.3390/s25123744.
5
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
6
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
7
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
8
Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施:一项网状Meta分析
Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.
9
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
10
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.

本文引用的文献

1
Visual Attention and Applications in Multimedia Technologies.视觉注意力及其在多媒体技术中的应用。
Proc IEEE Inst Electr Electron Eng. 2013 Sep;101(9):2058-2067. doi: 10.1109/JPROC.2013.2265801.