• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于循环学习的自适应视频流凸包预测

Convex Hull Prediction for Adaptive Video Streaming by Recurrent Learning.

作者信息

Paul Somdyuti, Norkin Andrey, Bovik Alan C

出版信息

IEEE Trans Image Process. 2024;33:5114-5128. doi: 10.1109/TIP.2024.3455989. Epub 2024 Sep 19.

DOI:10.1109/TIP.2024.3455989
PMID:39264770
Abstract

Adaptive video streaming relies on the construction of efficient bitrate ladders to deliver the best possible visual quality to viewers under bandwidth constraints. The traditional method of content dependent bitrate ladder selection requires a video shot to be pre-encoded with multiple encoding parameters to find the optimal operating points given by the convex hull of the resulting rate-quality curves. However, this pre-encoding step is equivalent to an exhaustive search process over the space of possible encoding parameters, which causes significant overhead in terms of both computation and time expenditure. To reduce this overhead, we propose a deep learning based method of content aware convex hull prediction. We employ a recurrent convolutional network (RCN) to implicitly analyze the spatiotemporal complexity of video shots in order to predict their convex hulls. A two-step transfer learning scheme is adopted to train our proposed RCN-Hull model, which ensures sufficient content diversity to analyze scene complexity, while also making it possible to capture the scene statistics of pristine source videos. Our experimental results reveal that our proposed model yields better approximations of the optimal convex hulls, and offers competitive time savings as compared to existing approaches. On average, the pre-encoding time was reduced by 53.8% by our method, while the average Bjøntegaard delta bitrate (BD-rate) of the predicted convex hulls against ground truth was 0.26%, and the mean absolute deviation of the BD-rate distribution was 0.57%.

摘要

自适应视频流依赖于构建高效的比特率阶梯,以便在带宽受限的情况下为观众提供尽可能好的视觉质量。传统的基于内容的比特率阶梯选择方法需要对视频镜头进行多编码参数预编码,以找到由所得速率-质量曲线的凸包给出的最佳工作点。然而,这个预编码步骤相当于在可能的编码参数空间上进行穷举搜索过程,这在计算和时间消耗方面都会导致显著的开销。为了减少这种开销,我们提出了一种基于深度学习的内容感知凸包预测方法。我们采用递归卷积网络(RCN)来隐式分析视频镜头的时空复杂性,以便预测它们的凸包。采用两步迁移学习方案来训练我们提出的RCN-Hull模型,这确保了足够的内容多样性以分析场景复杂性,同时也能够捕捉原始源视频的场景统计信息。我们的实验结果表明,我们提出的模型能够更好地逼近最优凸包,并且与现有方法相比,在时间节省方面具有竞争力。平均而言,我们的方法将预编码时间减少了53.8%,而预测凸包相对于真实情况的平均Bjøntegaard 增量比特率(BD-rate)为0.26%,BD-rate分布的平均绝对偏差为0.57%。

相似文献

1
Convex Hull Prediction for Adaptive Video Streaming by Recurrent Learning.基于循环学习的自适应视频流凸包预测
IEEE Trans Image Process. 2024;33:5114-5128. doi: 10.1109/TIP.2024.3455989. Epub 2024 Sep 19.
2
High Efficiency Video Coding (HEVC)-Based Surgical Telementoring System Using Shallow Convolutional Neural Network.基于高效视频编码 (HEVC) 的浅层卷积神经网络手术远程指导系统。
J Digit Imaging. 2019 Dec;32(6):1027-1043. doi: 10.1007/s10278-019-00206-2.
3
Probabilistic prediction of material stability: integrating convex hulls into active learning.材料稳定性的概率预测:将凸包整合到主动学习中。
Mater Horiz. 2024 Oct 28;11(21):5381-5393. doi: 10.1039/d4mh00432a.
4
Neural networks for convex hull computation.用于凸包计算的神经网络。
IEEE Trans Neural Netw. 1997;8(3):601-11. doi: 10.1109/72.572099.
5
Local convex hulls for a special class of integer multicommodity flow problems.一类特殊整数多商品流问题的局部凸包
Comput Optim Appl. 2016;64(3):881-919. doi: 10.1007/s10589-016-9831-3. Epub 2016 Feb 12.
6
Speeding up VP9 Intra Encoder with Hierarchical Deep Learning Based Partition Prediction.基于分层深度学习的分区预测加速VP9帧内编码器
IEEE Trans Image Process. 2020 Jul 28;PP. doi: 10.1109/TIP.2020.3011270.
7
Building Coarse to Fine Convex Hulls With Auxiliary Vertices for Palette-Based Image Recoloring.利用辅助顶点构建从粗到细的凸包用于基于调色板的图像重着色
IEEE Trans Vis Comput Graph. 2024 Aug;30(8):5581-5595. doi: 10.1109/TVCG.2023.3296386. Epub 2024 Jul 1.
8
Real-Time Adaptation to Time-Varying Constraints for Medical Video Communications.医疗视频通信中时变约束的实时自适应。
IEEE J Biomed Health Inform. 2018 Jul;22(4):1177-1188. doi: 10.1109/JBHI.2017.2726180. Epub 2017 Jul 12.
9
Competitive equilibrium bitrate allocation for multiple video streams.多视频流的竞争均衡比特率分配。
IEEE Trans Image Process. 2010 Apr;19(4):1009-21. doi: 10.1109/TIP.2009.2038777. Epub 2009 Dec 18.
10
A multilayer self-organizing model for convex-hull computation.一种用于凸包计算的多层自组织模型。
IEEE Trans Neural Netw. 2001;12(6):1341-7. doi: 10.1109/72.963770.