CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation
via Global and Local Refinement
- URL: http://arxiv.org/abs/2005.02551v1
- Date: Wed, 6 May 2020 01:38:03 GMT
- Title: CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation
via Global and Local Refinement
- Authors: Ho Kei Cheng (HKUST), Jihoon Chung (HKUST), Yu-Wing Tai (Tencent),
Chi-Keung Tang (HKUST)
- Abstract summary: State-of-the-art semantic segmentation methods were almost exclusively trained on images within a fixed resolution range.
We propose a novel approach to address the high-resolution segmentation problem without using any high-resolution training data.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: State-of-the-art semantic segmentation methods were almost exclusively
trained on images within a fixed resolution range. These segmentations are
inaccurate for very high-resolution images since using bicubic upsampling of
low-resolution segmentation does not adequately capture high-resolution details
along object boundaries. In this paper, we propose a novel approach to address
the high-resolution segmentation problem without using any high-resolution
training data. The key insight is our CascadePSP network which refines and
corrects local boundaries whenever possible. Although our network is trained
with low-resolution segmentation data, our method is applicable to any
resolution even for very high-resolution images larger than 4K. We present
quantitative and qualitative studies on different datasets to show that
CascadePSP can reveal pixel-accurate segmentation boundaries using our novel
refinement module without any finetuning. Thus, our method can be regarded as
class-agnostic. Finally, we demonstrate the application of our model to scene
parsing in multi-class segmentation.
Related papers
- On the Effect of Image Resolution on Semantic Segmentation [27.115235051091663]
We show that a model capable of directly producing high-resolution segmentations can match the performance of more complex systems.
Our approach leverages a bottom-up information propagation technique across various scales.
We have rigorously tested our method using leading-edge semantic segmentation datasets.
arXiv Detail & Related papers (2024-02-08T04:21:30Z) - Memory-Constrained Semantic Segmentation for Ultra-High Resolution UAV
Imagery [35.96063342025938]
This paper explores the intricate problem of achieving efficient and effective segmentation of ultra-high resolution UAV imagery.
We propose a GPU memory-efficient and effective framework for local inference without accessing the context beyond local patches.
We present an efficient memory-based interaction scheme to correct potential semantic bias of the underlying high-resolution information.
arXiv Detail & Related papers (2023-10-07T07:44:59Z) - A Robust Morphological Approach for Semantic Segmentation of Very High
Resolution Images [2.2230089845369085]
We develop a robust pipeline that seamlessly extends any existing semantic segmentation algorithm to high resolution images.
Our method does not require the ground truth annotations of the high resolution images.
We show that the semantic segmentation results obtained by our method beat the existing state-of-the-art algorithms on high-resolution images.
arXiv Detail & Related papers (2022-08-02T05:25:35Z) - Learning Enriched Features for Fast Image Restoration and Enhancement [166.17296369600774]
This paper presents a holistic goal of maintaining spatially-precise high-resolution representations through the entire network.
We learn an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
Our approach achieves state-of-the-art results for a variety of image processing tasks, including defocus deblurring, image denoising, super-resolution, and image enhancement.
arXiv Detail & Related papers (2022-04-19T17:59:45Z) - Any-resolution Training for High-resolution Image Synthesis [55.19874755679901]
Generative models operate at fixed resolution, even though natural images come in a variety of sizes.
We argue that every pixel matters and create datasets with variable-size images, collected at their native resolutions.
We introduce continuous-scale training, a process that samples patches at random scales to train a new generator with variable output resolutions.
arXiv Detail & Related papers (2022-04-14T17:59:31Z) - High Quality Segmentation for Ultra High-resolution Images [72.97958314291648]
We propose the Continuous Refinement Model for the ultra high-resolution segmentation refinement task.
Our proposed method is fast and effective on image segmentation refinement.
arXiv Detail & Related papers (2021-11-29T11:53:06Z) - Learning to Downsample for Segmentation of Ultra-High Resolution Images [6.432524678252553]
We show that learning the spatially varying downsampling strategy jointly with segmentation offers advantages in segmenting large images with limited computational budget.
Our method adapts the sampling density over different locations so that more samples are collected from the small important regions and less from the others.
We show on two public and one local high-resolution datasets that our method consistently learns sampling locations preserving more information and boosting segmentation accuracy over baseline methods.
arXiv Detail & Related papers (2021-09-22T23:04:59Z) - BoundarySqueeze: Image Segmentation as Boundary Squeezing [104.43159799559464]
We propose a novel method for fine-grained high-quality image segmentation of both objects and scenes.
Inspired by dilation and erosion from morphological image processing techniques, we treat the pixel level segmentation problems as squeezing object boundary.
Our method yields large gains on COCO, Cityscapes, for both instance and semantic segmentation and outperforms previous state-of-the-art PointRend in both accuracy and speed under the same setting.
arXiv Detail & Related papers (2021-05-25T04:58:51Z) - Superpixel Segmentation Based on Spatially Constrained Subspace
Clustering [57.76302397774641]
We consider each representative region with independent semantic information as a subspace, and formulate superpixel segmentation as a subspace clustering problem.
We show that a simple integration of superpixel segmentation with the conventional subspace clustering does not effectively work due to the spatial correlation of the pixels.
We propose a novel convex locality-constrained subspace clustering model that is able to constrain the spatial adjacent pixels with similar attributes to be clustered into a superpixel.
arXiv Detail & Related papers (2020-12-11T06:18:36Z) - Learning Enriched Features for Real Image Restoration and Enhancement [166.17296369600774]
convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task.
We present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network.
Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
arXiv Detail & Related papers (2020-03-15T11:04:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.