Paper ID | SS-IVC-DL.7 | ||
Paper Title | Machine-Learning Based Secondary Transform for Improved Image Compression in JPEG2000 | ||
Authors | Xinyue Li, Aous Naman, David Taubman, University of New South Wales, Australia | ||
Session | SS-IVC-DL: Special Session: Optimized Image and Video Coding Using Deep Learning | ||
Location | Area B | ||
Session Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation | Poster | ||
Topic | Special Sessions: Optimized image and video coding schemes using deep learning | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | This paper proposes a convolutional neural network (CNN) based secondary transform for not only improved coding efficiency in the JPEG2000 image compression format, but also to produce more appealing approximation sub-bands at different resolutions. The CNN in this work exploits information in detail sub-bands to predict some of the aliasing information in the corresponding approximation or low-pass sub-band; this reduction in aliasing, although not perfect, improves the compressibility of the "cleaned'' approximation sub-band. This process is repeated in subsequent wavelet decomposition levels to further improve coding efficiency. Experimental results show that, at high bit rates, the proposed network outperforms conventional JPEG2000 compression framework by up to 1.2 dB, especially for images with strong geometric flow. |