SE3: Sequential Semantic Segmentation of Large Images with Minimized Memory

dc.contributor.authorCheng, Guo
dc.contributor.authorZheng, Jiang Yu
dc.contributor.departmentComputer and Information Science, School of Science
dc.date.accessioned2024-02-05T19:14:02Z
dc.date.available2024-02-05T19:14:02Z
dc.date.issued2022-08
dc.description.abstractSemantic segmentation results in pixel-wise perception accompanied with GPU computation and expensive memory, which makes trained models hard to apply to small devices in testing. Assuming the availability of hardware in training CNN backbones, this work converts them to a linear architecture enabling the inference on edge devices. Keeping the same accuracy as patch-mode testing, we segment images using a scanning line with the minimum memory. Exploring periods of pyramid network shifting on image, we perform such sequential semantic segmentation (SE3) with a circular memory to avoid redundant computation and preserve the same receptive field as patches for spatial dependency. In the experiments on large drone images and panoramas, we examine this approach in terms of accuracy, parameter memory, and testing speed. Benchmark evaluations demonstrate that, with only one-line computation in linear time, our designed SE3 network consumes a small fraction of memory to maintain an equivalent accuracy as the image segmentation in patches. Considering semantic segmentation for high-resolution images, particularly for data streamed from sensors, this method is significant to the real-time applications of CNN based networks on light-weighted edge devices.
dc.eprint.versionAuthor's manuscript
dc.identifier.citationCheng, G., & Zheng, J. Y. (2022). SE3: Sequential Semantic Segmentation of Large Images with Minimized Memory. 2022 26th International Conference on Pattern Recognition (ICPR), 3443–3449. https://doi.org/10.1109/ICPR56361.2022.9956578
dc.identifier.urihttps://hdl.handle.net/1805/38304
dc.language.isoen_US
dc.publisherIEEE
dc.relation.isversionof10.1109/ICPR56361.2022.9956578
dc.relation.journal2022 26th International Conference on Pattern Recognition (ICPR)
dc.rightsPublisher Policy
dc.sourceAuthor
dc.subjectsemantic segmentation
dc.subjectdeep learning
dc.subjectmemory
dc.subjecthigh-resolution image
dc.subjectaccuracy lossless
dc.subjectedge computating
dc.titleSE3: Sequential Semantic Segmentation of Large Images with Minimized Memory
dc.typeConference proceedings
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Cheng2022SE3-AAM.pdf
Size:
1.25 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: