Objects in Semantic Topology
- URL: http://arxiv.org/abs/2110.02687v1
- Date: Wed, 6 Oct 2021 12:15:30 GMT
- Title: Objects in Semantic Topology
- Authors: Shuo Yang, Peize Sun, Yi Jiang, Xiaobo Xia, Ruiheng Zhang, Zehuan
Yuan, Changhu Wang, Ping Luo, Min Xu
- Abstract summary: A qualified open-world object detector can not only identify objects of known categories, but also discover unknown objects.
We provide a unified perspective: Semantic Topology.
Experiments demonstrate that semantic topology, either randomly-generated or derived from a well-trained language model, could outperform the current state-of-the-art open-world object detectors.
- Score: 36.297624587122506
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A more realistic object detection paradigm, Open-World Object Detection, has
arisen increasing research interests in the community recently. A qualified
open-world object detector can not only identify objects of known categories,
but also discover unknown objects, and incrementally learn to categorize them
when their annotations progressively arrive. Previous works rely on independent
modules to recognize unknown categories and perform incremental learning,
respectively. In this paper, we provide a unified perspective: Semantic
Topology. During the life-long learning of an open-world object detector, all
object instances from the same category are assigned to their corresponding
pre-defined node in the semantic topology, including the `unknown' category.
This constraint builds up discriminative feature representations and consistent
relationships among objects, thus enabling the detector to distinguish unknown
objects out of the known categories, as well as making learned features of
known objects undistorted when learning new categories incrementally. Extensive
experiments demonstrate that semantic topology, either randomly-generated or
derived from a well-trained language model, could outperform the current
state-of-the-art open-world object detectors by a large margin, e.g., the
absolute open-set error is reduced from 7832 to 2546, exhibiting the inherent
superiority of semantic topology on open-world object detection.
Related papers
- O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out [3.637162892228131]
Current object detection methods rely on approximate supervision with pseudo-labels corresponding to candidate locations of objects.
We find that geometric cues improve unknown recall.
By identifying similarities between classes within a superclass, we can identify unknown classes through an odd-one-out scoring mechanism.
arXiv Detail & Related papers (2024-10-10T01:08:04Z) - Open-World Object Detection with Instance Representation Learning [1.8749305679160366]
We propose a method to train an object detector that can both detect novel objects and extract semantically rich features in open-world conditions.
Our method learns a robust and generalizable feature space, outperforming other OWOD-based feature extraction methods.
arXiv Detail & Related papers (2024-09-24T13:13:34Z) - Unseen Object Reasoning with Shared Appearance Cues [1.9610132419137964]
This paper introduces an innovative approach to open world recognition (OWR)
We leverage knowledge acquired from known objects to address the recognition of previously unseen objects.
arXiv Detail & Related papers (2024-06-21T18:04:13Z) - Generative Region-Language Pretraining for Open-Ended Object Detection [55.42484781608621]
We propose a framework named GenerateU, which can detect dense objects and generate their names in a free-form way.
Our framework achieves comparable results to the open-vocabulary object detection method GLIP.
arXiv Detail & Related papers (2024-03-15T10:52:39Z) - Open World DETR: Transformer based Open World Object Detection [60.64535309016623]
We propose a two-stage training approach named Open World DETR for open world object detection based on Deformable DETR.
We fine-tune the class-specific components of the model with a multi-view self-labeling strategy and a consistency constraint.
Our proposed method outperforms other state-of-the-art open world object detection methods by a large margin.
arXiv Detail & Related papers (2022-12-06T13:39:30Z) - Towards Open-Set Object Detection and Discovery [38.81806249664884]
We present a new task, namely Open-Set Object Detection and Discovery (OSODD)
We propose a two-stage method that first uses an open-set object detector to predict both known and unknown objects.
Then, we study the representation of predicted objects in an unsupervised manner and discover new categories from the set of unknown objects.
arXiv Detail & Related papers (2022-04-12T08:07:01Z) - Contrastive Object Detection Using Knowledge Graph Embeddings [72.17159795485915]
We compare the error statistics of the class embeddings learned from a one-hot approach with semantically structured embeddings from natural language processing or knowledge graphs.
We propose a knowledge-embedded design for keypoint-based and transformer-based object detection architectures.
arXiv Detail & Related papers (2021-12-21T17:10:21Z) - Learning Open-World Object Proposals without Learning to Classify [110.30191531975804]
We propose a classification-free Object Localization Network (OLN) which estimates the objectness of each region purely by how well the location and shape of a region overlaps with any ground-truth object.
This simple strategy learns generalizable objectness and outperforms existing proposals on cross-category generalization.
arXiv Detail & Related papers (2021-08-15T14:36:02Z) - Towards Open World Object Detection [68.79678648726416]
ORE: Open World Object Detector is based on contrastive clustering and energy based unknown identification.
We find that identifying and characterizing unknown instances helps to reduce confusion in an incremental object detection setting.
arXiv Detail & Related papers (2021-03-03T18:58:18Z) - Synthesizing the Unseen for Zero-shot Object Detection [72.38031440014463]
We propose to synthesize visual features for unseen classes, so that the model learns both seen and unseen objects in the visual domain.
We use a novel generative model that uses class-semantics to not only generate the features but also to discriminatively separate them.
arXiv Detail & Related papers (2020-10-19T12:36:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.