FAIRS -- Soft Focus Generator and Attention for Robust Object
Segmentation from Extreme Points
- URL: http://arxiv.org/abs/2004.02038v1
- Date: Sat, 4 Apr 2020 22:25:47 GMT
- Title: FAIRS -- Soft Focus Generator and Attention for Robust Object
Segmentation from Extreme Points
- Authors: Ahmed H. Shahin, Prateek Munjal, Ling Shao, Shadab Khan
- Abstract summary: We present a new approach to generate object segmentation from user inputs in the form of extreme points and corrective clicks.
We demonstrate our method's ability to generate high-quality training data as well as its scalability in incorporating extreme points, guiding clicks, and corrective clicks in a principled manner.
- Score: 70.65563691392987
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semantic segmentation from user inputs has been actively studied to
facilitate interactive segmentation for data annotation and other applications.
Recent studies have shown that extreme points can be effectively used to encode
user inputs. A heat map generated from the extreme points can be appended to
the RGB image and input to the model for training. In this study, we present
FAIRS -- a new approach to generate object segmentation from user inputs in the
form of extreme points and corrective clicks. We propose a novel approach for
effectively encoding the user input from extreme points and corrective clicks,
in a novel and scalable manner that allows the network to work with a variable
number of clicks, including corrective clicks for output refinement. We also
integrate a dual attention module with our approach to increase the efficacy of
the model in preferentially attending to the objects. We demonstrate that these
additions help achieve significant improvements over state-of-the-art in dense
object segmentation from user inputs, on multiple large-scale datasets. Through
experiments, we demonstrate our method's ability to generate high-quality
training data as well as its scalability in incorporating extreme points,
guiding clicks, and corrective clicks in a principled manner.
Related papers
- Interactive segmentation in aerial images: a new benchmark and an open
access web-based tool [2.729446374377189]
In recent years, interactive semantic segmentation proposed in computer vision has achieved an ideal state of human-computer interaction segmentation.
This study aims to bridge the gap between interactive segmentation and remote sensing analysis by conducting benchmark study on various interactive segmentation models.
arXiv Detail & Related papers (2023-08-25T04:49:49Z) - CFR-ICL: Cascade-Forward Refinement with Iterative Click Loss for
Interactive Image Segmentation [2.482735440750151]
We propose a click-based and mask-guided interactive image segmentation framework containing three novel components.
The proposed framework offers a unified inference framework to generate segmentation results in a coarse-to-fine manner.
Our model reduces by 33.2%, and 15.5% the number of clicks required to surpass an IoU of 0.95 in the previous state-of-the-art approach.
arXiv Detail & Related papers (2023-03-09T23:20:35Z) - Cascaded Sparse Feature Propagation Network for Interactive Segmentation [18.584007891618096]
We propose a cascade sparse feature propagation network that learns a click-augmented feature representation for propagating user-provided information to unlabeled regions.
We validate the effectiveness of our method through comprehensive experiments on various benchmarks, and the results demonstrate the superior performance of our approach.
arXiv Detail & Related papers (2022-03-10T03:47:24Z) - SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object
Detection [78.90102636266276]
We propose a novel set abstraction method named Semantics-Augmented Set Abstraction (SASA)
Based on the estimated point-wise foreground scores, we then propose a semantics-guided point sampling algorithm to help retain more important foreground points during down-sampling.
In practice, SASA shows to be effective in identifying valuable points related to foreground objects and improving feature learning for point-based 3D detection.
arXiv Detail & Related papers (2022-01-06T08:54:47Z) - Interactive segmentation using U-Net with weight map and dynamic user
interactions [0.0]
We propose a novel interactive segmentation framework, where user clicks are dynamically adapted in size based on the current segmentation mask.
The clicked regions form a weight map and are fed to a deep neural network as a novel weighted loss function.
Applying dynamic user click sizes increases the overall accuracy by 5.60% and 10.39% respectively by utilizing only a single user interaction.
arXiv Detail & Related papers (2021-11-18T15:08:11Z) - Adversarial Feature Augmentation and Normalization for Visual
Recognition [109.6834687220478]
Recent advances in computer vision take advantage of adversarial data augmentation to ameliorate the generalization ability of classification models.
Here, we present an effective and efficient alternative that advocates adversarial augmentation on intermediate feature embeddings.
We validate the proposed approach across diverse visual recognition tasks with representative backbone networks.
arXiv Detail & Related papers (2021-03-22T20:36:34Z) - EDN: Salient Object Detection via Extremely-Downsampled Network [66.38046176176017]
We introduce an Extremely-Downsampled Network (EDN), which employs an extreme downsampling technique to effectively learn a global view of the whole image.
Experiments demonstrate that EDN achieves sArt performance with real-time speed.
arXiv Detail & Related papers (2020-12-24T04:23:48Z) - Point-Set Anchors for Object Detection, Instance Segmentation and Pose
Estimation [85.96410825961966]
We argue that the image features extracted at a central point contain limited information for predicting distant keypoints or bounding box boundaries.
To facilitate inference, we propose to instead perform regression from a set of points placed at more advantageous positions.
We apply this proposed framework, called Point-Set Anchors, to object detection, instance segmentation, and human pose estimation.
arXiv Detail & Related papers (2020-07-06T15:59:56Z) - Mining Implicit Entity Preference from User-Item Interaction Data for
Knowledge Graph Completion via Adversarial Learning [82.46332224556257]
We propose a novel adversarial learning approach by leveraging user interaction data for the Knowledge Graph Completion task.
Our generator is isolated from user interaction data, and serves to improve the performance of the discriminator.
To discover implicit entity preference of users, we design an elaborate collaborative learning algorithms based on graph neural networks.
arXiv Detail & Related papers (2020-03-28T05:47:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.