• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面向多功能视频编码的目标协作三元树分割决策方法。

Object-Cooperated Ternary Tree Partitioning Decision Method for Versatile Video Coding.

机构信息

School of Computer Science and Engineering, Kyungpook National University, Daegu 41566, Korea.

Department of Computer Engineering, Dong-A University, Busan 49315, Korea.

出版信息

Sensors (Basel). 2022 Aug 23;22(17):6328. doi: 10.3390/s22176328.

DOI:10.3390/s22176328
PMID:36080784
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9460852/
Abstract

In this paper, we propose an object-cooperated decision method for efficient ternary tree (TT) partitioning that reduces the encoding complexity of versatile video coding (VVC). In most previous studies, the VVC complexity was reduced using decision schemes based on the encoding context, which do not apply object detecion models. We assume that high-level objects are important for deciding whether complex TT partitioning is required because they can provide hints on the characteristics of a video. Herein, we apply an object detection model that discovers and extracts the high-level object features-the number and ratio of objects from frames in a video sequence. Using the extracted features, we propose machine learning (ML)-based classifiers for each TT-split direction to efficiently reduce the encoding complexity of VVC and decide whether the TT-split process can be skipped in the vertical or horizontal direction. The TT-split decision of classifiers is formulated as a binary classification problem. Experimental results show that the proposed method more effectively decreases the encoding complexity of VVC than a state-of-the-art model based on ML.

摘要

在本文中,我们提出了一种用于高效三元树 (TT) 划分的目标协作决策方法,以降低通用视频编码 (VVC) 的编码复杂度。在大多数先前的研究中,使用基于编码上下文的决策方案来降低 VVC 的复杂度,这些方案不适用对象检测模型。我们假设高层对象对于决定是否需要复杂的 TT 划分很重要,因为它们可以提供有关视频特征的提示。在此,我们应用对象检测模型来发现和提取高层对象特征——从视频序列中的帧中提取对象的数量和比例。使用提取的特征,我们为每个 TT 分裂方向提出基于机器学习 (ML) 的分类器,以有效地降低 VVC 的编码复杂度,并决定是否可以跳过垂直或水平方向的 TT 分裂过程。分类器的 TT 分裂决策被表述为一个二进制分类问题。实验结果表明,与基于 ML 的最新模型相比,所提出的方法更有效地降低了 VVC 的编码复杂度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/c28d4363df76/sensors-22-06328-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/26d8abcd24db/sensors-22-06328-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/6b962bdcea9e/sensors-22-06328-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/2ea17d096264/sensors-22-06328-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/e34f7499133c/sensors-22-06328-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/d0e61584298d/sensors-22-06328-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/86cd87a16019/sensors-22-06328-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/08835cb2a805/sensors-22-06328-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/5ff2d9e79174/sensors-22-06328-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/c28d4363df76/sensors-22-06328-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/26d8abcd24db/sensors-22-06328-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/6b962bdcea9e/sensors-22-06328-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/2ea17d096264/sensors-22-06328-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/e34f7499133c/sensors-22-06328-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/d0e61584298d/sensors-22-06328-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/86cd87a16019/sensors-22-06328-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/08835cb2a805/sensors-22-06328-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/5ff2d9e79174/sensors-22-06328-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/9460852/c28d4363df76/sensors-22-06328-g009.jpg

相似文献

1
Object-Cooperated Ternary Tree Partitioning Decision Method for Versatile Video Coding.面向多功能视频编码的目标协作三元树分割决策方法。
Sensors (Basel). 2022 Aug 23;22(17):6328. doi: 10.3390/s22176328.
2
Decision tree accelerated CTU partition algorithm for intra prediction in versatile video coding.决策树加速 CTU 分区算法在通用视频编码中的帧内预测。
PLoS One. 2021 Nov 8;16(11):e0258890. doi: 10.1371/journal.pone.0258890. eCollection 2021.
3
A Fast Decision Algorithm for VVC Intra-Coding Based on Texture Feature and Machine Learning.基于纹理特征和机器学习的 VVC 帧内编码快速决策算法。
Comput Intell Neurosci. 2022 Sep 13;2022:7675749. doi: 10.1155/2022/7675749. eCollection 2022.
4
Tunable VVC Frame Partitioning based on Lightweight Machine Learning.基于轻量级机器学习的可调式可变视频编码(VVC)帧分区
IEEE Trans Image Process. 2019 Sep 6. doi: 10.1109/TIP.2019.2938670.
5
Low-Complexity Multiple Transform Selection Combining Multi-Type Tree Partition Algorithm for Versatile Video Coding.低复杂度多变换选择结合多类型树分割算法的通用视频编码。
Sensors (Basel). 2022 Jul 25;22(15):5523. doi: 10.3390/s22155523.
6
A Fast Algorithm for Intra-Frame Versatile Video Coding Based on Edge Features.一种基于边缘特征的帧内通用视频编码快速算法。
Sensors (Basel). 2023 Jul 7;23(13):6244. doi: 10.3390/s23136244.
7
A Study on Fast and Low-Complexity Algorithms for Versatile Video Coding.通用视频编码的快速低复杂度算法研究
Sensors (Basel). 2022 Nov 20;22(22):8990. doi: 10.3390/s22228990.
8
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding.VVC 帧内编码中快速分块的分区图预测。
IEEE Trans Image Process. 2023;32:2237-2251. doi: 10.1109/TIP.2023.3266165. Epub 2023 Apr 21.
9
Temporal Prediction Model-Based Fast Inter CU Partition for Versatile Video Coding.基于时域预测模型的灵活视频编码快速交叉 CU 分区。
Sensors (Basel). 2022 Oct 12;22(20):7741. doi: 10.3390/s22207741.
10
Extended Coding Unit Partitioning for Future Video Coding.面向未来视频编码的扩展编码单元划分
IEEE Trans Image Process. 2019 Nov 28. doi: 10.1109/TIP.2019.2955238.

本文引用的文献

1
DeepQTMT: A Deep Learning Approach for Fast QTMT-Based CU Partition of Intra-Mode VVC.深度QTMT:一种基于深度学习的用于帧内模式VVC的快速基于QTMT的CU划分方法。
IEEE Trans Image Process. 2021;30:5377-5390. doi: 10.1109/TIP.2021.3083447. Epub 2021 Jun 3.
2
Cascade R-CNN: High Quality Object Detection and Instance Segmentation.级联 R-CNN:高质量目标检测和实例分割。
IEEE Trans Pattern Anal Mach Intell. 2021 May;43(5):1483-1498. doi: 10.1109/TPAMI.2019.2956516. Epub 2021 Apr 1.
3
Focal Loss for Dense Object Detection.用于密集目标检测的焦散损失
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):318-327. doi: 10.1109/TPAMI.2018.2858826. Epub 2018 Jul 23.
4
Mask R-CNN.Mask R-CNN。
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):386-397. doi: 10.1109/TPAMI.2018.2844175. Epub 2018 Jun 5.
5
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
6
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.