IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
ST-3-IP: VCA: Video Complexity Analyzer
Wed, 25 May, 14:00 - 17:00 China Time (UTC +8)
Wed, 25 May, 06:00 - 09:00 UTC
Location: Peony Junior Ballroom 4511
In-Person
Show & Tell
Presented by: Hadi Amirpour (Alpen-Adria-Universität Klagenfurt), Vignesh V Menon (Alpen-Adria-Universität Klagenfurt), Mohammad Ghanbari (School of Computer Science and Electronic Engineering, University of Essex, Colchester, UK), and Christian Timmerer (Alpen-Adria-Universität Klagenfurt)

We have released Video Complexity Analyzer (VCA) version 1.0 open-source software on Feb 14, 2022 as a valentine’s day gift to video coding enthusiasts across the globe. The primary objective of VCA is to become the best spatial and temporal complexity predictor for every video (segment) which aids in predicting encoding parameters for applications like scene-cut detection and online per-title encoding. VCA leverages x86 SIMD and multi-threading optimizations for effective performance. While VCA is primarily designed as a video complexity analyzer library, a command-line executable is provided to facilitate testing and development. VCA is available as an open-source library, published under the GPLv3 license.

According to the Bitmovin Video Developer Report 2021, live streaming at scale has the highest scope for innovation in video streaming services. Currently, there are no open-source implementations available which can predict video complexity for live streaming applications. To this light, we plan to demo the functions of VCA software, and show accuracy of the complexities analyzed by VCA using the heatmaps, and show-case the speed of video complexity analysis. VCA can achieve an analysis speed of about 370fps compared to the 5fps speed of the reference SITI implementation. Hence, we show that it can be used for live streaming applications.

We expect VCA to be impactful in many leading video encoding solutions in the coming years. Video complexity analysis is also proved to be an important step in applications like rate-distortion modeling and modeling QoE evaluation metrics. Fast video complexity analysis can be used in online per-title encoding schemes which determine optimized resolution, bitrate-ladder, framerate, and other relevant encoding parameters for live streaming applications.

In the demo, the attendees can watch the spatial and temporal complexity heatmap of the test video, to understand the accuracy of the features predicted. Finally, we show the shot-transitions of the test video detected by VCA as an use-case of the analyzed complexity features.