TALISMAN: Targeted Active Learning for Object Detection with Rare
Classes and Slices using Submodular Mutual Information
- URL: http://arxiv.org/abs/2112.00166v1
- Date: Tue, 30 Nov 2021 23:17:53 GMT
- Title: TALISMAN: Targeted Active Learning for Object Detection with Rare
Classes and Slices using Submodular Mutual Information
- Authors: Suraj Kothawade, Saikat Ghosh, Sumit Shekhar, Yu Xiang, Rishabh Iyer
- Abstract summary: We propose a novel framework for Targeted Active Learning or object detectIon with rare slices.
Our method uses the submodular mutual information functions instantiated using features of the region of interest.
We evaluate our framework on the standard PASCAL VOC07+12 and BDD100K, a real-world self-driving dataset.
- Score: 16.34454526943999
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep neural networks based object detectors have shown great success in a
variety of domains like autonomous vehicles, biomedical imaging, etc. It is
known that their success depends on a large amount of data from the domain of
interest. While deep models often perform well in terms of overall accuracy,
they often struggle in performance on rare yet critical data slices. For
example, data slices like "motorcycle at night" or "bicycle at night" are often
rare but very critical slices for self-driving applications and false negatives
on such rare slices could result in ill-fated failures and accidents. Active
learning (AL) is a well-known paradigm to incrementally and adaptively build
training datasets with a human in the loop. However, current AL based
acquisition functions are not well-equipped to tackle real-world datasets with
rare slices, since they are based on uncertainty scores or global descriptors
of the image. We propose TALISMAN, a novel framework for Targeted Active
Learning or object detectIon with rare slices using Submodular MutuAl
iNformation. Our method uses the submodular mutual information functions
instantiated using features of the region of interest (RoI) to efficiently
target and acquire data points with rare slices. We evaluate our framework on
the standard PASCAL VOC07+12 and BDD100K, a real-world self-driving dataset. We
observe that TALISMAN outperforms other methods by in terms of average
precision on rare slices, and in terms of mAP.
Related papers
- STREAMLINE: Streaming Active Learning for Realistic Multi-Distributional
Settings [2.580765958706854]
STREAMLINE is a novel streaming active learning framework that mitigates scenario-driven slice imbalance in working labeled data.
We evaluate STREAMLINE on real-world streaming scenarios for image classification and object detection tasks.
arXiv Detail & Related papers (2023-05-18T02:01:45Z) - Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for
Autonomous Driving [91.39625612027386]
We propose a novel task, called generalized few-shot 3D object detection, where we have a large amount of training data for common (base) objects, but only a few data for rare (novel) classes.
Specifically, we analyze in-depth differences between images and point clouds, and then present a practical principle for the few-shot setting in the 3D LiDAR dataset.
To solve this task, we propose an incremental fine-tuning method to extend existing 3D detection models to recognize both common and rare objects.
arXiv Detail & Related papers (2023-02-08T07:11:36Z) - Mitigating Representation Bias in Action Recognition: Algorithms and
Benchmarks [76.35271072704384]
Deep learning models perform poorly when applied to videos with rare scenes or objects.
We tackle this problem from two different angles: algorithm and dataset.
We show that the debiased representation can generalize better when transferred to other datasets and tasks.
arXiv Detail & Related papers (2022-09-20T00:30:35Z) - Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based
Action Recognition [88.34182299496074]
Action labels are only available on a source dataset, but unavailable on a target dataset in the training stage.
We utilize a self-supervision scheme to reduce the domain shift between two skeleton-based action datasets.
By segmenting and permuting temporal segments or human body parts, we design two self-supervised learning classification tasks.
arXiv Detail & Related papers (2022-07-17T07:05:39Z) - Active Data Discovery: Mining Unknown Data using Submodular Information
Measures [1.7491858164568674]
We provide an active data discovery framework which can mine unknown data slices and classes efficiently.
We show significant accuracy and labeling efficiency gains with our approach compared to existing state-of-the-art active learning approaches.
arXiv Detail & Related papers (2022-06-17T05:52:18Z) - CARLA-GeAR: a Dataset Generator for a Systematic Evaluation of
Adversarial Robustness of Vision Models [61.68061613161187]
This paper presents CARLA-GeAR, a tool for the automatic generation of synthetic datasets for evaluating the robustness of neural models against physical adversarial patches.
The tool is built on the CARLA simulator, using its Python API, and allows the generation of datasets for several vision tasks in the context of autonomous driving.
The paper presents an experimental study to evaluate the performance of some defense methods against such attacks, showing how the datasets generated with CARLA-GeAR might be used in future work as a benchmark for adversarial defense in the real world.
arXiv Detail & Related papers (2022-06-09T09:17:38Z) - Diverse Complexity Measures for Dataset Curation in Self-driving [80.55417232642124]
We propose a new data selection method that exploits a diverse set of criteria that quantize interestingness of traffic scenes.
Our experiments show that the proposed curation pipeline is able to select datasets that lead to better generalization and higher performance.
arXiv Detail & Related papers (2021-01-16T23:45:02Z) - Stance Detection Benchmark: How Robust Is Your Stance Detection? [65.91772010586605]
Stance Detection (StD) aims to detect an author's stance towards a certain topic or claim.
We introduce a StD benchmark that learns from ten StD datasets of various domains in a multi-dataset learning setting.
Within this benchmark setup, we are able to present new state-of-the-art results on five of the datasets.
arXiv Detail & Related papers (2020-01-06T13:37:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.