IDA: Improved Data Augmentation Applied to Salient Object Detection
- URL: http://arxiv.org/abs/2009.08845v1
- Date: Fri, 18 Sep 2020 14:03:27 GMT
- Title: IDA: Improved Data Augmentation Applied to Salient Object Detection
- Authors: Daniel V. Ruiz and Bruno A. Krinski and Eduardo Todt
- Abstract summary: We present an Improved Data Augmentation (IDA) technique focused on Salient Object Detection (SOD)
Our method combines image inpainting, affine transformations, and the linear combination of different generated background images with salient objects extracted from labeled data.
We show that our method improves the segmentation quality when used for training state-of-the-art neural networks on several famous datasets of the SOD field.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we present an Improved Data Augmentation (IDA) technique
focused on Salient Object Detection (SOD). Standard data augmentation
techniques proposed in the literature, such as image cropping, rotation,
flipping, and resizing, only generate variations of the existing examples,
providing a limited generalization. Our method combines image inpainting,
affine transformations, and the linear combination of different generated
background images with salient objects extracted from labeled data. Our
proposed technique enables more precise control of the object's position and
size while preserving background information. The background choice is based on
an inter-image optimization, while object size follows a uniform random
distribution within a specified interval, and the object position is
intra-image optimal. We show that our method improves the segmentation quality
when used for training state-of-the-art neural networks on several famous
datasets of the SOD field. Combining our method with others surpasses
traditional techniques such as horizontal-flip in 0.52% for F-measure and 1.19%
for Precision. We also provide an evaluation in 7 different SOD datasets, with
9 distinct evaluation metrics and an average ranking of the evaluated methods.
Related papers
- PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation [15.342060815068347]
PixelMan is an inversion-free and training-free method for achieving consistent object editing via Pixel Manipulation and generation.
We show that in as few as 16 inference steps, PixelMan outperforms a range of state-of-the-art training-based and training-free methods.
arXiv Detail & Related papers (2024-12-18T19:24:15Z) - RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images [13.051302134031808]
We introduce a novel method for calculating the 6DoF pose of an object using a single RGB-D image.
Unlike existing methods that either directly predict objects' poses or rely on sparse keypoints for pose recovery, our approach addresses this challenging task using dense correspondence.
arXiv Detail & Related papers (2024-05-14T10:10:45Z) - Diff-DOPE: Differentiable Deep Object Pose Estimation [29.703385848843414]
We introduce Diff-DOPE, a 6-DoF pose refiner that takes as input an image, a 3D textured model of an object, and an initial pose of the object.
The method uses differentiable rendering to update the object pose to minimize the visual error between the image and the projection of the model.
We show that this simple, yet effective, idea is able to achieve state-of-the-art results on pose estimation datasets.
arXiv Detail & Related papers (2023-09-30T18:52:57Z) - Randomize to Generalize: Domain Randomization for Runway FOD Detection [1.4249472316161877]
Tiny Object Detection is challenging due to small size, low resolution, occlusion, background clutter, lighting conditions and small object-to-image ratio.
We propose a novel two-stage methodology Synthetic Image Augmentation (SRIA) to enhance generalization capabilities of models encountering 2D datasets.
We report that detection accuracy improved from an initial 41% to 92% for OOD test set.
arXiv Detail & Related papers (2023-09-23T05:02:31Z) - Unsupervised Domain Adaptive Fundus Image Segmentation with Few Labeled
Source Data [17.106866501665916]
unsupervised domain adaptation (UDA) methods enhance the models' generalization ability on the unlabeled target fundus datasets.
UDA methods always require sufficient labeled data from the source domain, bringing auxiliary data acquisition and annotation costs.
We propose a Searching-based Multi-style Invariant Mechanism to diversify the source data style and increase the data amount.
Our method has outperformed several state-of-the-art UDA segmentation methods under the UDA fundus segmentation with few labeled source data.
arXiv Detail & Related papers (2022-10-10T00:30:48Z) - Feature transforms for image data augmentation [74.12025519234153]
In image classification, many augmentation approaches utilize simple image manipulation algorithms.
In this work, we build ensembles on the data level by adding images generated by combining fourteen augmentation approaches.
Pretrained ResNet50 networks are finetuned on training sets that include images derived from each augmentation method.
arXiv Detail & Related papers (2022-01-24T14:12:29Z) - Weakly Supervised Change Detection Using Guided Anisotropic Difusion [97.43170678509478]
We propose original ideas that help us to leverage such datasets in the context of change detection.
First, we propose the guided anisotropic diffusion (GAD) algorithm, which improves semantic segmentation results.
We then show its potential in two weakly-supervised learning strategies tailored for change detection.
arXiv Detail & Related papers (2021-12-31T10:03:47Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose
Estimation with Decoupled Rotation Mechanism [49.89268018642999]
We propose a fast shape-based network (FS-Net) with efficient category-level feature extraction for 6D pose estimation.
The proposed method achieves state-of-the-art performance in both category- and instance-level 6D object pose estimation.
arXiv Detail & Related papers (2021-03-12T03:07:24Z) - Scale Normalized Image Pyramids with AutoFocus for Object Detection [75.71320993452372]
A scale normalized image pyramid (SNIP) is generated that, like human vision, only attends to objects within a fixed size range at different scales.
We propose an efficient spatial sub-sampling scheme which only operates on fixed-size sub-regions likely to contain objects.
The resulting algorithm is referred to as AutoFocus and results in a 2.5-5 times speed-up during inference when used with SNIP.
arXiv Detail & Related papers (2021-02-10T18:57:53Z) - FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning [64.32306537419498]
We propose a novel learned feature-based refinement and augmentation method that produces a varied set of complex transformations.
These transformations also use information from both within-class and across-class representations that we extract through clustering.
We demonstrate that our method is comparable to current state of art for smaller datasets while being able to scale up to larger datasets.
arXiv Detail & Related papers (2020-07-16T17:55:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.