Technical Program

Paper Detail

Session:Machine Learning for Image and Video Coding
Location:Lecture Room
Session Time:Monday, June 25, 10:20 - 12:40
Presentation Time:Monday, June 25, 11:00 - 11:20
Presentation: Special Session Lecture
Paper Title: Deep Convolutional AutoEncoder-based Lossy Image Compression
Authors: Zhengxue Cheng; Waseda University, Japan 
 Heming Sun; Waseda University, Japan 
 Masaru Takeuchi; Waseda University, Japan 
 Jiro Katto; Waseda University, Japan 
Abstract: Image compression has been investigated as a fundamental research topic for many decades. Recently, deep learning has achieved great success in many computer vision tasks, and is gradually being used in image compression. In this paper, we present a lossy image compression architecture, which utilizes the advantages of convolutional autoencoder (CAE) to achieve a high coding efficiency. First, we design a novel CAE architecture to replace the conventional transforms and train this CAE using a rate-distortion loss function. Second, to generate a more energy-compact representation, we utilize the principal components analysis (PCA) to rotate the feature maps produced by the CAE, and then apply the quantization and entropy coder to generate the codes. Experimental results demonstrate that our method outperforms traditional image coding algorithms, by achieving a 13.7% BD-rate decrement on the Kodak database images compared to JPEG2000. Besides, our method maintains a moderate complexity similar to JPEG2000.