Classification Committee for Active Deep Object Detection
- URL: http://arxiv.org/abs/2308.08476v2
- Date: Tue, 22 Aug 2023 07:52:58 GMT
- Title: Classification Committee for Active Deep Object Detection
- Authors: Lei Zhao, Bo Li, Xingxing Wei
- Abstract summary: We propose a classification committee for active deep object detection method.
The committee selects the most informative images according to their uncertainty values.
We show that our method outperforms the state-of-the-art active learning methods.
- Score: 24.74839931613233
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In object detection, the cost of labeling is much high because it needs not
only to confirm the categories of multiple objects in an image but also to
accurately determine the bounding boxes of each object. Thus, integrating
active learning into object detection will raise pretty positive significance.
In this paper, we propose a classification committee for active deep object
detection method by introducing a discrepancy mechanism of multiple classifiers
for samples' selection when training object detectors. The model contains a
main detector and a classification committee. The main detector denotes the
target object detector trained from a labeled pool composed of the selected
informative images. The role of the classification committee is to select the
most informative images according to their uncertainty values from the view of
classification, which is expected to focus more on the discrepancy and
representative of instances. Specifically, they compute the uncertainty for a
specified instance within the image by measuring its discrepancy output by the
committee pre-trained via the proposed Maximum Classifiers Discrepancy Group
Loss (MCDGL). The most informative images are finally determined by selecting
the ones with many high-uncertainty instances. Besides, to mitigate the impact
of interference instances, we design a Focus on Positive Instances Loss (FPIL)
to make the committee the ability to automatically focus on the representative
instances as well as precisely encode their discrepancies for the same
instance. Experiments are conducted on Pascal VOC and COCO datasets versus some
popular object detectors. And results show that our method outperforms the
state-of-the-art active learning methods, which verifies the effectiveness of
the proposed method.
Related papers
- Few-Shot Object Detection with Sparse Context Transformers [37.106378859592965]
Few-shot detection is a major task in pattern recognition which seeks to localize objects using models trained with few labeled data.
We propose a novel sparse context transformer (SCT) that effectively leverages object knowledge in the source domain, and automatically learns a sparse context from only few training images in the target domain.
We evaluate the proposed method on two challenging few-shot object detection benchmarks, and empirical results show that the proposed method obtains competitive performance compared to the related state-of-the-art.
arXiv Detail & Related papers (2024-02-14T17:10:01Z) - Active Learning for Object Detection with Non-Redundant Informative
Sampling [1.7602289331729377]
Our strategy integrates uncertainty and diversity-based selection principles into a joint selection objective.
Our proposed NORIS algorithm quantifies the impact of training with a sample on the informativeness of other similar samples.
Our strategy achieves a 20% and 30% reduction in labeling costs compared to random selection for PASCAL-VOC and KITTI.
arXiv Detail & Related papers (2023-07-17T11:55:20Z) - The Overlooked Classifier in Human-Object Interaction Recognition [82.20671129356037]
We encode the semantic correlation among classes into the classification head by initializing the weights with language embeddings of HOIs.
We propose a new loss named LSE-Sign to enhance multi-label learning on a long-tailed dataset.
Our simple yet effective method enables detection-free HOI classification, outperforming the state-of-the-arts that require object detection and human pose by a clear margin.
arXiv Detail & Related papers (2022-03-10T23:35:00Z) - Robust Region Feature Synthesizer for Zero-Shot Object Detection [87.79902339984142]
We build a novel zero-shot object detection framework that contains an Intra-class Semantic Diverging component and an Inter-class Structure Preserving component.
It is the first study to carry out zero-shot object detection in remote sensing imagery.
arXiv Detail & Related papers (2022-01-01T03:09:15Z) - Contrastive Object Detection Using Knowledge Graph Embeddings [72.17159795485915]
We compare the error statistics of the class embeddings learned from a one-hot approach with semantically structured embeddings from natural language processing or knowledge graphs.
We propose a knowledge-embedded design for keypoint-based and transformer-based object detection architectures.
arXiv Detail & Related papers (2021-12-21T17:10:21Z) - Learning to Detect Instance-level Salient Objects Using Complementary
Image Labels [55.049347205603304]
We present the first weakly-supervised approach to the salient instance detection problem.
We propose a novel weakly-supervised network with three branches: a Saliency Detection Branch leveraging class consistency information to locate candidate objects; a Boundary Detection Branch exploiting class discrepancy information to delineate object boundaries; and a Centroid Detection Branch using subitizing information to detect salient instance centroids.
arXiv Detail & Related papers (2021-11-19T10:15:22Z) - Instance Localization for Self-supervised Detection Pretraining [68.24102560821623]
We propose a new self-supervised pretext task, called instance localization.
We show that integration of bounding boxes into pretraining promotes better task alignment and architecture alignment for transfer learning.
Experimental results demonstrate that our approach yields state-of-the-art transfer learning results for object detection.
arXiv Detail & Related papers (2021-02-16T17:58:57Z) - Class-agnostic Object Detection [16.97782147401037]
We propose class-agnostic object detection as a new problem that focuses on detecting objects irrespective of their object-classes.
Specifically, the goal is to predict bounding boxes for all objects in an image but not their object-classes.
We propose training and evaluation protocols for benchmarking class-agnostic detectors to advance future research in this domain.
arXiv Detail & Related papers (2020-11-28T19:22:38Z) - A Self-Training Approach for Point-Supervised Object Detection and
Counting in Crowds [54.73161039445703]
We propose a novel self-training approach that enables a typical object detector trained only with point-level annotations.
During training, we utilize the available point annotations to supervise the estimation of the center points of objects.
Experimental results show that our approach significantly outperforms state-of-the-art point-supervised methods under both detection and counting tasks.
arXiv Detail & Related papers (2020-07-25T02:14:42Z) - Are there any 'object detectors' in the hidden layers of CNNs trained to
identify objects or scenes? [5.718442081858377]
We compare various measures on a large set of units in AlexNet.
We find that the different measures provide different estimates of object selectivity.
We fail to find any units that are even remotely as selective as the 'grandmother cell' units reported in recurrent neural networks.
arXiv Detail & Related papers (2020-07-02T12:33:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.