Class-agnostic Object Detection
- URL: http://arxiv.org/abs/2011.14204v1
- Date: Sat, 28 Nov 2020 19:22:38 GMT
- Title: Class-agnostic Object Detection
- Authors: Ayush Jaiswal, Yue Wu, Pradeep Natarajan, Premkumar Natarajan
- Abstract summary: We propose class-agnostic object detection as a new problem that focuses on detecting objects irrespective of their object-classes.
Specifically, the goal is to predict bounding boxes for all objects in an image but not their object-classes.
We propose training and evaluation protocols for benchmarking class-agnostic detectors to advance future research in this domain.
- Score: 16.97782147401037
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection models perform well at localizing and classifying objects
that they are shown during training. However, due to the difficulty and cost
associated with creating and annotating detection datasets, trained models
detect a limited number of object types with unknown objects treated as
background content. This hinders the adoption of conventional detectors in
real-world applications like large-scale object matching, visual grounding,
visual relation prediction, obstacle detection (where it is more important to
determine the presence and location of objects than to find specific types),
etc. We propose class-agnostic object detection as a new problem that focuses
on detecting objects irrespective of their object-classes. Specifically, the
goal is to predict bounding boxes for all objects in an image but not their
object-classes. The predicted boxes can then be consumed by another system to
perform application-specific classification, retrieval, etc. We propose
training and evaluation protocols for benchmarking class-agnostic detectors to
advance future research in this domain. Finally, we propose (1) baseline
methods and (2) a new adversarial learning framework for class-agnostic
detection that forces the model to exclude class-specific information from
features used for predictions. Experimental results show that adversarial
learning improves class-agnostic detection efficacy.
Related papers
- Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation [58.37525311718006]
We put forth a novel formulation of the aerial object detection problem, namely open-vocabulary aerial object detection (OVAD)
We propose CastDet, a CLIP-activated student-teacher detection framework that serves as the first OVAD detector specifically designed for the challenging aerial scenario.
Our framework integrates a robust localization teacher along with several box selection strategies to generate high-quality proposals for novel objects.
arXiv Detail & Related papers (2024-11-04T12:59:13Z) - Open World DETR: Transformer based Open World Object Detection [60.64535309016623]
We propose a two-stage training approach named Open World DETR for open world object detection based on Deformable DETR.
We fine-tune the class-specific components of the model with a multi-view self-labeling strategy and a consistency constraint.
Our proposed method outperforms other state-of-the-art open world object detection methods by a large margin.
arXiv Detail & Related papers (2022-12-06T13:39:30Z) - Towards Open-Set Object Detection and Discovery [38.81806249664884]
We present a new task, namely Open-Set Object Detection and Discovery (OSODD)
We propose a two-stage method that first uses an open-set object detector to predict both known and unknown objects.
Then, we study the representation of predicted objects in an unsupervised manner and discover new categories from the set of unknown objects.
arXiv Detail & Related papers (2022-04-12T08:07:01Z) - Few-Shot Object Detection: A Survey [4.266990593059534]
Few-shot object detection aims to learn from few object instances of new categories in the target domain.
We categorize approaches according to their training scheme and architectural layout.
We introduce commonly used datasets and their evaluation protocols and analyze reported benchmark results.
arXiv Detail & Related papers (2021-12-22T07:08:53Z) - Contrastive Object Detection Using Knowledge Graph Embeddings [72.17159795485915]
We compare the error statistics of the class embeddings learned from a one-hot approach with semantically structured embeddings from natural language processing or knowledge graphs.
We propose a knowledge-embedded design for keypoint-based and transformer-based object detection architectures.
arXiv Detail & Related papers (2021-12-21T17:10:21Z) - A Survey of Self-Supervised and Few-Shot Object Detection [19.647681501581225]
Self-supervised methods aim at learning representations from unlabeled data which transfer well to downstream tasks such as object detection.
Few-shot object detection is about training a model on novel (unseen) object classes with little data.
In this survey, we review and characterize the most recent approaches on few-shot and self-supervised object detection.
arXiv Detail & Related papers (2021-10-27T18:55:47Z) - Unsupervised Domain Adaption of Object Detectors: A Survey [87.08473838767235]
Recent advances in deep learning have led to the development of accurate and efficient models for various computer vision applications.
Learning highly accurate models relies on the availability of datasets with a large number of annotated images.
Due to this, model performance drops drastically when evaluated on label-scarce datasets having visually distinct images.
arXiv Detail & Related papers (2021-05-27T23:34:06Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z) - Synthesizing the Unseen for Zero-shot Object Detection [72.38031440014463]
We propose to synthesize visual features for unseen classes, so that the model learns both seen and unseen objects in the visual domain.
We use a novel generative model that uses class-semantics to not only generate the features but also to discriminatively separate them.
arXiv Detail & Related papers (2020-10-19T12:36:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.