ODSmoothGrad: Generating Saliency Maps for Object Detectors
- URL: http://arxiv.org/abs/2304.07609v1
- Date: Sat, 15 Apr 2023 18:21:56 GMT
- Title: ODSmoothGrad: Generating Saliency Maps for Object Detectors
- Authors: Chul Gwon and Steven C. Howell
- Abstract summary: We present ODSmoothGrad, a tool for generating saliency maps for the classification and the bounding box parameters in object detectors.
Given the noisiness of saliency maps, we also apply the SmoothGrad algorithm to visually enhance the pixels of interest.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Techniques for generating saliency maps continue to be used for
explainability of deep learning models, with efforts primarily applied to the
image classification task. Such techniques, however, can also be applied to
object detectors, not only with the classification scores, but also for the
bounding box parameters, which are regressed values for which the relevant
pixels contributing to these parameters can be identified. In this paper, we
present ODSmoothGrad, a tool for generating saliency maps for the
classification and the bounding box parameters in object detectors. Given the
noisiness of saliency maps, we also apply the SmoothGrad algorithm to visually
enhance the pixels of interest. We demonstrate these capabilities on one-stage
and two-stage object detectors, with comparisons using classifier-based
techniques.
Related papers
- Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised
Semantic Segmentation [79.05949524349005]
We propose AuxSegNet+, a weakly supervised auxiliary learning framework to explore the rich information from saliency maps.
We also propose a cross-task affinity learning mechanism to learn pixel-level affinities from the saliency and segmentation feature maps.
arXiv Detail & Related papers (2024-03-02T10:03:21Z) - Improving the matching of deformable objects by learning to detect
keypoints [6.4587163310833855]
We propose a novel learned keypoint detection method to increase the number of correct matches for the task of non-rigid image correspondence.
We train an end-to-end convolutional neural network (CNN) to find keypoint locations that are more appropriate to the considered descriptor.
Experiments demonstrate that our method enhances the Mean Matching Accuracy of numerous descriptors when used in conjunction with our detection method.
We also apply our method on the complex real-world task object retrieval where our detector performs on par with the finest keypoint detectors currently available for this task.
arXiv Detail & Related papers (2023-09-01T13:02:19Z) - CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle
Components [77.33782775860028]
We introduce CarPatch, a novel synthetic benchmark of vehicles.
In addition to a set of images annotated with their intrinsic and extrinsic camera parameters, the corresponding depth maps and semantic segmentation masks have been generated for each view.
Global and part-based metrics have been defined and used to evaluate, compare, and better characterize some state-of-the-art techniques.
arXiv Detail & Related papers (2023-07-24T11:59:07Z) - G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors [0.0]
G-CAME generates a saliency map as the explanation for object detection models.
We evaluated our method with YOLOX on the MS-COCO 2017 dataset and guided to apply G-CAME into the two-stage Faster-RCNN model.
arXiv Detail & Related papers (2023-06-06T04:30:18Z) - Object Delineation in Satellite Images [4.264192013842097]
This gem delivers a simple and light-weight algorithm for delineating the pixels that are marked by ML algorithms to extract geospatial objects from satellite images.
The proposed algorithm is exact and users can further apply simplification and approximation based on the application needs.
arXiv Detail & Related papers (2022-12-14T04:19:45Z) - TopoDetect: Framework for Topological Features Detection in Graph
Embeddings [1.005130974691351]
TopoDetect is a Python package that allows the user to investigate if important topological features are preserved in the embeddings of graph representation models.
The framework enables the visualization of the embeddings according to the distribution of the topological features among the nodes.
arXiv Detail & Related papers (2021-10-08T14:54:53Z) - Topological Semantic Mapping by Consolidation of Deep Visual Features [0.0]
This work introduces a topological semantic mapping method that uses deep visual features extracted by a CNN, the GoogLeNet, from 2D images captured in multiple views of the environment as the robot operates.
The experiments, performed using a real-world indoor dataset, showed that the method is able to consolidate the visual features of regions and use them to recognize objects and place categories as semantic properties.
arXiv Detail & Related papers (2021-06-24T01:10:03Z) - Ensembling object detectors for image and video data analysis [98.26061123111647]
We propose a method for ensembling the outputs of multiple object detectors for improving detection performance and precision of bounding boxes on image data.
We extend it to video data by proposing a two-stage tracking-based scheme for detection refinement.
arXiv Detail & Related papers (2021-02-09T12:38:16Z) - Synthesizing the Unseen for Zero-shot Object Detection [72.38031440014463]
We propose to synthesize visual features for unseen classes, so that the model learns both seen and unseen objects in the visual domain.
We use a novel generative model that uses class-semantics to not only generate the features but also to discriminatively separate them.
arXiv Detail & Related papers (2020-10-19T12:36:11Z) - Black-box Explanation of Object Detectors via Saliency Maps [66.745167677293]
We propose D-RISE, a method for generating visual explanations for the predictions of object detectors.
We show that D-RISE can be easily applied to different object detectors including one-stage detectors such as YOLOv3 and two-stage detectors such as Faster-RCNN.
arXiv Detail & Related papers (2020-06-05T02:13:35Z) - Refined Plane Segmentation for Cuboid-Shaped Objects by Leveraging Edge
Detection [63.942632088208505]
We propose a post-processing algorithm to align the segmented plane masks with edges detected in the image.
This allows us to increase the accuracy of state-of-the-art approaches, while limiting ourselves to cuboid-shaped objects.
arXiv Detail & Related papers (2020-03-28T18:51:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.