Huang Chao, Peng Zongju, Xu Yong, Chen Fen, Jiang Qiuping, Zhang Yun, Jiang Gangyi, Ho Yo-Sung
IEEE Trans Image Process. 2021;30:641-656. doi: 10.1109/TIP.2020.3036766. Epub 2020 Dec 4.
High Efficiency Video Coding (HEVC) can significantly improve the compression efficiency in comparison with the preceding H.264/Advanced Video Coding (AVC) but at the cost of extremely high computational complexity. Hence, it is challenging to realize live video applications on low-delay and power-constrained devices, such as the smart mobile devices. In this article, we propose an online learning-based multi-stage complexity control method for live video coding. The proposed method consists of three stages: multi-accuracy Coding Unit (CU) decision, multi-stage complexity allocation, and Coding Tree Unit (CTU) level complexity control. Consequently, the encoding complexity can be accurately controlled to correspond with the computing capability of the video-capable device by replacing the traditional brute-force search with the proposed algorithm, which properly determines the optimal CU size. Specifically, the multi-accuracy CU decision model is obtained by an online learning approach to accommodate the different characteristics of input videos. In addition, multi-stage complexity allocation is implemented to reasonably allocate the complexity budgets to each coding level. In order to achieve a good trade-off between complexity control and rate distortion (RD) performance, the CTU-level complexity control is proposed to select the optimal accuracy of the CU decision model. The experimental results show that the proposed algorithm can accurately control the coding complexity from 100% to 40%. Furthermore, the proposed algorithm outperforms the state-of-the-art algorithms in terms of both accuracy of complexity control and RD performance.
与先前的H.264/高级视频编码(AVC)相比,高效视频编码(HEVC)可以显著提高压缩效率,但代价是计算复杂度极高。因此,在低延迟和功率受限的设备(如智能移动设备)上实现实时视频应用具有挑战性。在本文中,我们提出了一种基于在线学习的实时视频编码多阶段复杂度控制方法。所提出的方法包括三个阶段:多精度编码单元(CU)决策、多阶段复杂度分配和编码树单元(CTU)级复杂度控制。因此,通过用所提出的算法代替传统的蛮力搜索,可以准确控制编码复杂度以与具有视频处理能力的设备的计算能力相对应,该算法能正确确定最佳CU大小。具体而言,通过在线学习方法获得多精度CU决策模型,以适应输入视频的不同特征。此外,实施多阶段复杂度分配以将复杂度预算合理分配到每个编码级别。为了在复杂度控制和率失真(RD)性能之间取得良好的平衡,提出了CTU级复杂度控制以选择CU决策模型的最佳精度。实验结果表明,所提出的算法可以将编码复杂度从100%精确控制到40%。此外,所提出的算法在复杂度控制精度和RD性能方面均优于现有算法。