Deep Level Set for Box-supervised Instance Segmentation in Aerial Images
- URL: http://arxiv.org/abs/2112.03451v1
- Date: Tue, 7 Dec 2021 02:27:58 GMT
- Title: Deep Level Set for Box-supervised Instance Segmentation in Aerial Images
- Authors: Wentong Li, Yijie Chen, Wenyu Liu, Jianke Zhu
- Abstract summary: We propose a novel aerial instance segmentation approach, which drives the network to learn a series of level set functions for the aerial objects.
The experimental results demonstrate that the proposed approach outperforms the state-of-the-art box-supervised instance segmentation methods.
- Score: 27.659592291045414
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Box-supervised instance segmentation has recently attracted lots of research
efforts while little attention is received in aerial image domain. In contrast
to the general object collections, aerial objects have large intra-class
variances and inter-class similarity with complex background. Moreover, there
are many tiny objects in the high-resolution satellite images. This makes the
recent pairwise affinity modeling method inevitably to involve the noisy
supervision with the inferior results. To tackle these problems, we propose a
novel aerial instance segmentation approach, which drives the network to learn
a series of level set functions for the aerial objects with only box
annotations in an end-to-end fashion. Instead of learning the pairwise
affinity, the level set method with the carefully designed energy functions
treats the object segmentation as curve evolution, which is able to accurately
recover the object's boundaries and prevent the interference from the
indistinguishable background and similar objects. The experimental results
demonstrate that the proposed approach outperforms the state-of-the-art
box-supervised instance segmentation methods. The source code is available at
https://github.com/LiWentomng/boxlevelset.
Related papers
- Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - Box-supervised Instance Segmentation with Level Set Evolution [41.19797478617953]
We propose a box-supervised instance segmentation approach, which integrates the classical level set model with deep neural network delicately.
A simple mask supervised SOLOv2 model is adapted to predict the instance-aware mask map as the level set for each instance.
The experimental results on four challenging benchmarks demonstrate the leading performance of our proposed approach.
arXiv Detail & Related papers (2022-07-19T03:59:44Z) - Discovering Objects that Can Move [55.743225595012966]
We study the problem of object discovery -- separating objects from the background without manual labels.
Existing approaches utilize appearance cues, such as color, texture, and location, to group pixels into object-like regions.
We choose to focus on dynamic objects -- entities that can move independently in the world.
arXiv Detail & Related papers (2022-03-18T21:13:56Z) - Robust Region Feature Synthesizer for Zero-Shot Object Detection [87.79902339984142]
We build a novel zero-shot object detection framework that contains an Intra-class Semantic Diverging component and an Inter-class Structure Preserving component.
It is the first study to carry out zero-shot object detection in remote sensing imagery.
arXiv Detail & Related papers (2022-01-01T03:09:15Z) - Multi-patch Feature Pyramid Network for Weakly Supervised Object
Detection in Optical Remote Sensing Images [39.25541709228373]
We propose a new architecture for object detection with a multiple patch feature pyramid network (MPFP-Net)
MPFP-Net is different from the current models that during training only pursue the most discriminative patches.
We introduce an effective method to regularize the residual values and make the fusion transition layers strictly norm-preserving.
arXiv Detail & Related papers (2021-08-18T09:25:39Z) - Instance Localization for Self-supervised Detection Pretraining [68.24102560821623]
We propose a new self-supervised pretext task, called instance localization.
We show that integration of bounding boxes into pretraining promotes better task alignment and architecture alignment for transfer learning.
Experimental results demonstrate that our approach yields state-of-the-art transfer learning results for object detection.
arXiv Detail & Related papers (2021-02-16T17:58:57Z) - Robust Instance Segmentation through Reasoning about Multi-Object
Occlusion [9.536947328412198]
We propose a deep network for multi-object instance segmentation that is robust to occlusion.
Our work builds on Compositional Networks, which learn a generative model of neural feature activations to locate occluders.
In particular, we obtain feed-forward predictions of the object classes and their instance and occluder segmentations.
arXiv Detail & Related papers (2020-12-03T17:41:55Z) - Multiple instance learning on deep features for weakly supervised object
detection with extreme domain shifts [1.9336815376402716]
Weakly supervised object detection (WSOD) using only image-level annotations has attracted a growing attention over the past few years.
We show that a simple multiple instance approach applied on pre-trained deep features yields excellent performances on non-photographic datasets.
arXiv Detail & Related papers (2020-08-03T20:36:01Z) - Multi-scale Interactive Network for Salient Object Detection [91.43066633305662]
We propose the aggregate interaction modules to integrate the features from adjacent levels.
To obtain more efficient multi-scale features, the self-interaction modules are embedded in each decoder unit.
Experimental results on five benchmark datasets demonstrate that the proposed method without any post-processing performs favorably against 23 state-of-the-art approaches.
arXiv Detail & Related papers (2020-07-17T15:41:37Z) - Object-Centric Image Generation from Layouts [93.10217725729468]
We develop a layout-to-image-generation method to generate complex scenes with multiple objects.
Our method learns representations of the spatial relationships between objects in the scene, which lead to our model's improved layout-fidelity.
We introduce SceneFID, an object-centric adaptation of the popular Fr'echet Inception Distance metric, that is better suited for multi-object images.
arXiv Detail & Related papers (2020-03-16T21:40:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.