Paper ID | TEC-5.7 | ||
Paper Title | TWO-PHASE MULTIMODAL IMAGE FUSION USING CONVOLUTIONAL NEURAL NETWORKS | ||
Authors | Kushal Kusram, Shane Transue, Min-Hyung Choi, University of Colorado Denver, United States | ||
Session | TEC-5: Image and Video Processing 1 | ||
Location | Area G | ||
Session Time: | Monday, 20 September, 13:30 - 15:00 | ||
Presentation Time: | Monday, 20 September, 13:30 - 15:00 | ||
Presentation | Poster | ||
Topic | Image and Video Processing: Formation and reconstruction | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | The fusion of multiple imaging modalities presents an important contribution to machine vision but remains an ongoing challenge due to the limitations in traditional calibration methods that perform a single, global alignment. For depth and thermal imaging devices, sensor and lens intrinsics (FOV, resolution, etc.) may vary considerably, making per-pixel fusion accuracy difficult. In this paper, we present AccuFusion, a two-phase non-linear registration method to fuse multimodal images at a per-pixel level to obtain an efficient and accurate image registration. The two phases: the Coarse Fusion Network (CFN) and Refining Fusion Network (RFN), are designed to learn a robust image-space fusion that provides a non-linear mapping for accurate alignment. By employing the refinement process, we obtain per-pixel displacements to minimize local alignment errors and observe an increase of 18% in average accuracy over global registration. |