SE3: Sequential Semantic Segmentation of Large Images with Minimized Memory

Cheng, Guo; Zheng, Jiang Yu

SE3: Sequential Semantic Segmentation of Large Images with Minimized Memory

Files

Cheng2022SE3-AAM.pdf (1.25 MB)

Date

2022-08

Authors

Cheng, Guo

Zheng, Jiang Yu

Language

American English

Department

Computer and Information Science, School of Science

Found At

IEEE

Abstract

Semantic segmentation results in pixel-wise perception accompanied with GPU computation and expensive memory, which makes trained models hard to apply to small devices in testing. Assuming the availability of hardware in training CNN backbones, this work converts them to a linear architecture enabling the inference on edge devices. Keeping the same accuracy as patch-mode testing, we segment images using a scanning line with the minimum memory. Exploring periods of pyramid network shifting on image, we perform such sequential semantic segmentation (SE3) with a circular memory to avoid redundant computation and preserve the same receptive field as patches for spatial dependency. In the experiments on large drone images and panoramas, we examine this approach in terms of accuracy, parameter memory, and testing speed. Benchmark evaluations demonstrate that, with only one-line computation in linear time, our designed SE3 network consumes a small fraction of memory to maintain an equivalent accuracy as the image segmentation in patches. Considering semantic segmentation for high-resolution images, particularly for data streamed from sensors, this method is significant to the real-time applications of CNN based networks on light-weighted edge devices.

Keywords

semantic segmentation, deep learning, memory, high-resolution image, accuracy lossless, edge computating

Cite As

Cheng, G., & Zheng, J. Y. (2022). SE3: Sequential Semantic Segmentation of Large Images with Minimized Memory. 2022 26th International Conference on Pattern Recognition (ICPR), 3443–3449. https://doi.org/10.1109/ICPR56361.2022.9956578

Journal

2022 26th International Conference on Pattern Recognition (ICPR)

Rights

Publisher Policy

Source

Author

Type

Article

Permanent Link

https://hdl.handle.net/1805/38304

DOI

https://doi.org/10.1109/ICPR56361.2022.9956578

Version

Author's manuscript

Collections

Open Access Policy Articles
Department of Computer and Information Science Works

Full item page