Paper ID | COM-1.10 | ||
Paper Title | VIDEO MULTIMETHOD ASSESSMENT FUSION BASED RATE-DISTORTION OPTIMIZATION FOR VERSATILE VIDEO CODING | ||
Authors | Han Zhang, Shanghai Jiao Tong University, China; Jizheng Xu, Bytedance Inc., United States; Li Song, Shanghai Jiao Tong University, China | ||
Session | COM-1: Image and Video Coding | ||
Location | Area H | ||
Session Time: | Tuesday, 21 September, 08:00 - 09:30 | ||
Presentation Time: | Tuesday, 21 September, 08:00 - 09:30 | ||
Presentation | Poster | ||
Topic | Image and Video Communications: Lossy coding of images & video | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | The emerging visual quality assessment metric VMAF that fuses several elementary metrics by SVM regression has shown a higher correlation with human perception. In this paper, we introduce VMAF into the traditional video coding task as the distortion metric, which needs to be optimized to explore the potential of perceptual quality improvement. Specifically, we propose a multi-granularity VMAF based rate-distortion optimization framework. A frame level visual quality adaption is first conducted by taking the quantization characteristics into account at the coarse-grained adjustment step. Within each frame, a CTU level Lagrangian multiplier and corresponding quantization parameter adaption are carried out based on the content of each CTU at the fine-grained adjustment step. The proposed method has been incorporated into the latest video coding standard – VVC. Experimental results show compared with the conventional rate-distortion optimization for SSE, the proposed method achieves an average 3.30% BD-rate reduction in VMAF. |