Technical Program

Paper Detail

Session:Machine Learning for Image and Video Coding
Location:Lecture Room
Session Time:Monday, June 25, 10:20 - 12:40
Presentation Time:Monday, June 25, 10:20 - 10:40
Presentation: Special Session Lecture
Paper Title: RESTRICTED BOLTZMANN MACHINE IMAGE COMPRESSION
Authors: Markus Küchhold; Technische Universität Berlin, Germany 
 Maik Simon; Technische Universität Berlin, Germany 
 Thomas Sikora; Technische Universität Berlin, Germany 
Abstract: We propose a novel lossy block-based image compression approach. Our approach builds on non-linear autoencoders that can, when properly trained, explore non-linear statistical dependencies in the image blocks for redundancy reduction. In contrast the DCT employed in JPEG is inherently restricted to exploration of linear dependencies using a second-order statistics framework. The coder is based on pre-trained class-specific Restricted Boltzmann Machines (RBM). These machines are statistical variants of neural network autoencoders that directly map pixel values in image blocks into coded bits. Decoders can be implemented with low computational complexity in a codebook design. Experimental results show that our RBM-codec outperforms JPEG at high compression rates, both in terms of PSNR, SSIM and subjective results.