Attention Toward Neighbors: A Context Aware Framework for High
Resolution Image Segmentation
- URL: http://arxiv.org/abs/2106.12902v1
- Date: Thu, 24 Jun 2021 10:58:09 GMT
- Title: Attention Toward Neighbors: A Context Aware Framework for High
Resolution Image Segmentation
- Authors: Fahim Faisal Niloy, M. Ashraful Amin, Amin Ahsan Ali, AKM Mahbubur
Rahman
- Abstract summary: We propose a novel framework to segment a particular patch by incorporating contextual information from its neighboring patches.
This allows the segmentation network to see the target patch with a wider field of view without the need of larger feature maps.
- Score: 2.9210447295585724
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: High-resolution image segmentation remains challenging and error-prone due to
the enormous size of intermediate feature maps. Conventional methods avoid this
problem by using patch based approaches where each patch is segmented
independently. However, independent patch segmentation induces errors,
particularly at the patch boundary due to the lack of contextual information in
very high-resolution images where the patch size is much smaller compared to
the full image. To overcome these limitations, in this paper, we propose a
novel framework to segment a particular patch by incorporating contextual
information from its neighboring patches. This allows the segmentation network
to see the target patch with a wider field of view without the need of larger
feature maps. Comparative analysis from a number of experiments shows that our
proposed framework is able to segment high resolution images with significantly
improved mean Intersection over Union and overall accuracy.
Related papers
- Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation [14.419786126791369]
Weakly supervised semantic segmentation (WSSS) with image-level labels aims to achieve segmentation tasks without dense annotations.
Co-occurrence problem is widely present and leads to false activation of objects in WSSS.
We devise a 'Separate and Conquer' scheme SeCo to tackle this issue from dimensions of image space and feature space.
arXiv Detail & Related papers (2024-02-28T16:43:27Z) - DBAT: Dynamic Backward Attention Transformer for Material Segmentation
with Cross-Resolution Patches [8.812837829361923]
We propose the Dynamic Backward Attention Transformer (DBAT) to aggregate cross-resolution features.
Experiments show that our DBAT achieves an accuracy of 86.85%, which is the best performance among state-of-the-art real-time models.
We further align features to semantic labels, performing network dissection, to infer that the proposed model can extract material-related features better than other methods.
arXiv Detail & Related papers (2023-05-06T03:47:20Z) - High-Quality Entity Segmentation [110.55724145851725]
CropFormer is designed to tackle the intractability of instance-level segmentation on high-resolution images.
It improves mask prediction by fusing high-res image crops that provide more fine-grained image details and the full image.
With CropFormer, we achieve a significant AP gain of $1.9$ on the challenging entity segmentation task.
arXiv Detail & Related papers (2022-11-10T18:58:22Z) - Image-Specific Information Suppression and Implicit Local Alignment for
Text-based Person Search [61.24539128142504]
Text-based person search (TBPS) is a challenging task that aims to search pedestrian images with the same identity from an image gallery given a query text.
Most existing methods rely on explicitly generated local parts to model fine-grained correspondence between modalities.
We propose an efficient joint Multi-level Alignment Network (MANet) for TBPS, which can learn aligned image/text feature representations between modalities at multiple levels.
arXiv Detail & Related papers (2022-08-30T16:14:18Z) - HIPA: Hierarchical Patch Transformer for Single Image Super Resolution [62.7081074931892]
This paper presents HIPA, a novel Transformer architecture that progressively recovers the high resolution image using a hierarchical patch partition.
We build a cascaded model that processes an input image in multiple stages, where we start with tokens with small patch sizes and gradually merge to the full resolution.
Such a hierarchical patch mechanism not only explicitly enables feature aggregation at multiple resolutions but also adaptively learns patch-aware features for different image regions.
arXiv Detail & Related papers (2022-03-19T05:09:34Z) - Context-aware Padding for Semantic Segmentation [82.37483350347559]
We propose a context-aware (CA) padding approach to extend the image.
Using context-aware padding, the ResNet-based segmentation model achieves higher mean Intersection-Over-Union than the traditional zero padding.
arXiv Detail & Related papers (2021-09-16T10:33:21Z) - From Contexts to Locality: Ultra-high Resolution Image Segmentation via
Locality-aware Contextual Correlation [43.70432772819461]
We innovate the widely used high-resolution image segmentation pipeline.
An ultra-high resolution image is partitioned into regular patches for local segmentation and then the local results are merged into a high-resolution semantic mask.
arXiv Detail & Related papers (2021-09-06T16:26:05Z) - Look Closer to Segment Better: Boundary Patch Refinement for Instance
Segmentation [51.59290734837372]
We propose a conceptually simple yet effective post-processing refinement framework to improve the boundary quality.
The proposed BPR framework yields significant improvements over the Mask R-CNN baseline on Cityscapes benchmark.
By applying the BPR framework to the PolyTransform + SegFix baseline, we reached 1st place on the Cityscapes leaderboard.
arXiv Detail & Related papers (2021-04-12T07:10:48Z) - Improving Semantic Segmentation via Decoupled Body and Edge Supervision [89.57847958016981]
Existing semantic segmentation approaches either aim to improve the object's inner consistency by modeling the global context, or refine objects detail along their boundaries by multi-scale feature fusion.
In this paper, a new paradigm for semantic segmentation is proposed.
Our insight is that appealing performance of semantic segmentation requires textitexplicitly modeling the object textitbody and textitedge, which correspond to the high and low frequency of the image.
We show that the proposed framework with various baselines or backbone networks leads to better object inner consistency and object boundaries.
arXiv Detail & Related papers (2020-07-20T12:11:22Z) - CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation
via Global and Local Refinement [0.0]
State-of-the-art semantic segmentation methods were almost exclusively trained on images within a fixed resolution range.
We propose a novel approach to address the high-resolution segmentation problem without using any high-resolution training data.
arXiv Detail & Related papers (2020-05-06T01:38:03Z) - Multi-Scale Superpatch Matching using Dual Superpixel Descriptors [0.6875312133832078]
Over-segmentation into superpixels is a very effective dimensionality reduction strategy, enabling fast dense image processing.
The inherent irregularity of the image decomposition compared to standard hierarchical multi-resolution schemes is a problem.
We introduce the dual superpatch, a novel superpixel neighborhood descriptor.
arXiv Detail & Related papers (2020-03-09T22:04:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.