Saliency for free: Saliency prediction as a side-effect of object
recognition
- URL: http://arxiv.org/abs/2107.09628v1
- Date: Tue, 20 Jul 2021 17:17:28 GMT
- Title: Saliency for free: Saliency prediction as a side-effect of object
recognition
- Authors: Carola Figueroa-Flores, David Berga, Joost van der Weijer and Bogdan
Raducanu
- Abstract summary: We show that saliency maps can be generated as a side-effect of training an object recognition deep neural network.
Such a network does not require any ground-truth saliency maps for training.
Extensive experiments carried out on both real and synthetic saliency datasets demonstrate that our approach is able to generate accurate saliency maps.
- Score: 4.609056834401648
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Saliency is the perceptual capacity of our visual system to focus our
attention (i.e. gaze) on relevant objects. Neural networks for saliency
estimation require ground truth saliency maps for training which are usually
achieved via eyetracking experiments. In the current paper, we demonstrate that
saliency maps can be generated as a side-effect of training an object
recognition deep neural network that is endowed with a saliency branch. Such a
network does not require any ground-truth saliency maps for training.Extensive
experiments carried out on both real and synthetic saliency datasets
demonstrate that our approach is able to generate accurate saliency maps,
achieving competitive results on both synthetic and real datasets when compared
to methods that do require ground truth data.
Related papers
- Deep Domain Adaptation: A Sim2Real Neural Approach for Improving Eye-Tracking Systems [80.62854148838359]
Eye image segmentation is a critical step in eye tracking that has great influence over the final gaze estimate.
We use dimensionality-reduction techniques to measure the overlap between the target eye images and synthetic training data.
Our methods result in robust, improved performance when tackling the discrepancy between simulation and real-world data samples.
arXiv Detail & Related papers (2024-03-23T22:32:06Z) - Exploring the Effectiveness of Dataset Synthesis: An application of
Apple Detection in Orchards [68.95806641664713]
We explore the usability of Stable Diffusion 2.1-base for generating synthetic datasets of apple trees for object detection.
We train a YOLOv5m object detection model to predict apples in a real-world apple detection dataset.
Results demonstrate that the model trained on generated data is slightly underperforming compared to a baseline model trained on real-world images.
arXiv Detail & Related papers (2023-06-20T09:46:01Z) - Neural Maximum A Posteriori Estimation on Unpaired Data for Motion
Deblurring [87.97330195531029]
We propose a Neural Maximum A Posteriori (NeurMAP) estimation framework for training neural networks to recover blind motion information and sharp content from unpaired data.
The proposed NeurMAP is an approach to existing deblurring neural networks, and is the first framework that enables training image deblurring networks on unpaired datasets.
arXiv Detail & Related papers (2022-04-26T08:09:47Z) - Self-supervised Audiovisual Representation Learning for Remote Sensing Data [96.23611272637943]
We propose a self-supervised approach for pre-training deep neural networks in remote sensing.
By exploiting the correspondence between geo-tagged audio recordings and remote sensing, this is done in a completely label-free manner.
We show that our approach outperforms existing pre-training strategies for remote sensing imagery.
arXiv Detail & Related papers (2021-08-02T07:50:50Z) - Predicting Depth from Semantic Segmentation using Game Engine Dataset [0.0]
This thesis investigates the relation of perception of objects and depth estimation convolutional neural networks.
We developed new network structures based on a simple depth estimation network that only used a single image at its input.
Results show that our novel structures can improve the performance of depth estimation by 52% of relative error of distance.
arXiv Detail & Related papers (2021-06-12T10:15:40Z) - Learning Topology from Synthetic Data for Unsupervised Depth Completion [66.26787962258346]
We present a method for inferring dense depth maps from images and sparse depth measurements.
We learn the association of sparse point clouds with dense natural shapes, and using the image as evidence to validate the predicted depth map.
arXiv Detail & Related papers (2021-06-06T00:21:12Z) - Topological Uncertainty: Monitoring trained neural networks through
persistence of activation graphs [0.9786690381850356]
In industrial applications, data coming from an open-world setting might widely differ from the benchmark datasets on which a network was trained.
We develop a method to monitor trained neural networks based on the topological properties of their activation graphs.
arXiv Detail & Related papers (2021-05-07T14:16:03Z) - Unsupervised Metric Relocalization Using Transform Consistency Loss [66.19479868638925]
Training networks to perform metric relocalization traditionally requires accurate image correspondences.
We propose a self-supervised solution, which exploits a key insight: localizing a query image within a map should yield the same absolute pose, regardless of the reference image used for registration.
We evaluate our framework on synthetic and real-world data, showing our approach outperforms other supervised methods when a limited amount of ground-truth information is available.
arXiv Detail & Related papers (2020-11-01T19:24:27Z) - Hallucinating Saliency Maps for Fine-Grained Image Classification for
Limited Data Domains [27.91871214060683]
We propose an approach which does not require explicit saliency maps to improve image classification.
We show that our approach obtains similar results as the case when the saliency maps are provided explicitely.
In addition, we show that our saliency estimation method, which is trained without any saliency groundtruth data, obtains competitive results on real image saliency benchmark (Toronto)
arXiv Detail & Related papers (2020-07-24T15:08:55Z) - Learning Topometric Semantic Maps from Occupancy Grids [2.5234065536725963]
We propose a new approach for deriving such instance-based semantic maps purely from occupancy grids.
We employ a combination of deep learning techniques to detect, segment and extract door hypotheses from a random-sized map.
We evaluate our approach on several publicly available real-world data sets.
arXiv Detail & Related papers (2020-01-10T22:06:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.