Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic
Distance Enhances Open World Object Detection
- URL: http://arxiv.org/abs/2306.14291v4
- Date: Thu, 15 Feb 2024 15:55:05 GMT
- Title: Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic
Distance Enhances Open World Object Detection
- Authors: Thang Doan, Xin Li, Sima Behpour, Wenbin He, Liang Gou, Liu Ren
- Abstract summary: Open World Object Detection is a challenging and realistic task.
It involves detecting both known and unknown objects.
We propose Hyp-OW, a method that learns and models hierarchical representation of known items.
- Score: 23.005760505169803
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Open World Object Detection (OWOD) is a challenging and realistic task that
extends beyond the scope of standard Object Detection task. It involves
detecting both known and unknown objects while integrating learned knowledge
for future tasks. However, the level of "unknownness" varies significantly
depending on the context. For example, a tree is typically considered part of
the background in a self-driving scene, but it may be significant in a
household context. We argue that this contextual information should already be
embedded within the known classes. In other words, there should be a semantic
or latent structure relationship between the known and unknown items to be
discovered. Motivated by this observation, we propose Hyp-OW, a method that
learns and models hierarchical representation of known items through a
SuperClass Regularizer. Leveraging this representation allows us to effectively
detect unknown objects using a similarity distance-based relabeling module.
Extensive experiments on benchmark datasets demonstrate the effectiveness of
Hyp-OW, achieving improvement in both known and unknown detection (up to 6
percent). These findings are particularly pronounced in our newly designed
benchmark, where a strong hierarchical structure exists between known and
unknown objects. Our code can be found at
https://github.com/boschresearch/Hyp-OW
Related papers
- TARO: Toward Semantically Rich Open-World Object Detection [9.59690330728612]
TARO is a novel detection framework that identifies unknown objects and classifies them into coarse parent categories within a semantic hierarchy.<n>We show TARO can categorize up to 29.9% of unknowns into meaningful coarse classes, significantly reduce confusion between unknown and known classes, and achieve competitive performance in both unknown recall and known mAP.
arXiv Detail & Related papers (2025-10-10T09:15:26Z) - O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out [3.637162892228131]
Current object detection methods rely on approximate supervision with pseudo-labels corresponding to candidate locations of objects.
We find that geometric cues improve unknown recall.
By identifying similarities between classes within a superclass, we can identify unknown classes through an odd-one-out scoring mechanism.
arXiv Detail & Related papers (2024-10-10T01:08:04Z) - OW-Rep: Open World Object Detection with Instance Representation Learning [1.8749305679160366]
Open World Object Detection (OWOD) addresses realistic scenarios where unseen object classes emerge.
We extend the OWOD framework to jointly detect unknown objects and learn semantically rich instance embeddings.
arXiv Detail & Related papers (2024-09-24T13:13:34Z) - Generative Region-Language Pretraining for Open-Ended Object Detection [55.42484781608621]
We propose a framework named GenerateU, which can detect dense objects and generate their names in a free-form way.
Our framework achieves comparable results to the open-vocabulary object detection method GLIP.
arXiv Detail & Related papers (2024-03-15T10:52:39Z) - Semi-supervised Open-World Object Detection [74.95267079505145]
We introduce a more realistic formulation, named semi-supervised open-world detection (SS-OWOD)
We demonstrate that the performance of the state-of-the-art OWOD detector dramatically deteriorates in the proposed SS-OWOD setting.
Our experiments on 4 datasets including MS COCO, PASCAL, Objects365 and DOTA demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2024-02-25T07:12:51Z) - Unsupervised Recognition of Unknown Objects for Open-World Object
Detection [28.787586991713535]
Open-World Object Detection (OWOD) extends object detection problem to a realistic and dynamic scenario.
Current OWOD models, such as ORE and OW-DETR, focus on pseudo-labeling regions with high objectness scores as unknowns.
This paper proposes a novel approach that learns an unsupervised discriminative model to recognize true unknown objects.
arXiv Detail & Related papers (2023-08-31T08:17:29Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - Detecting the open-world objects with the help of the Brain [20.00772846521719]
Open World Object Detection (OWOD) is a novel computer vision task with a considerable challenge.
OWOD algorithms are expected to detect unseen/unknown objects and incrementally learn them.
We propose leveraging the VL as the Brain'' of the open-world detector by simply generating unknown labels.
arXiv Detail & Related papers (2023-03-21T06:44:02Z) - Open World DETR: Transformer based Open World Object Detection [60.64535309016623]
We propose a two-stage training approach named Open World DETR for open world object detection based on Deformable DETR.
We fine-tune the class-specific components of the model with a multi-view self-labeling strategy and a consistency constraint.
Our proposed method outperforms other state-of-the-art open world object detection methods by a large margin.
arXiv Detail & Related papers (2022-12-06T13:39:30Z) - More Practical Scenario of Open-set Object Detection: Open at Category
Level and Closed at Super-category Level [23.98839374194848]
Open-set object detection (OSOD) has recently attracted considerable attention.
We first point out that the scenario of OSOD considered in recent studies, which considers an unlimited variety of unknown objects, has a fundamental issue.
This issue leads to difficulty with the evaluation of methods' performance on unknown object detection.
arXiv Detail & Related papers (2022-07-20T09:28:51Z) - Contrastive Object Detection Using Knowledge Graph Embeddings [72.17159795485915]
We compare the error statistics of the class embeddings learned from a one-hot approach with semantically structured embeddings from natural language processing or knowledge graphs.
We propose a knowledge-embedded design for keypoint-based and transformer-based object detection architectures.
arXiv Detail & Related papers (2021-12-21T17:10:21Z) - OW-DETR: Open-world Detection Transformer [90.56239673123804]
We introduce a novel end-to-end transformer-based framework, OW-DETR, for open-world object detection.
OW-DETR comprises three dedicated components namely, attention-driven pseudo-labeling, novelty classification and objectness scoring.
Our model outperforms the recently introduced OWOD approach, ORE, with absolute gains ranging from 1.8% to 3.3% in terms of unknown recall.
arXiv Detail & Related papers (2021-12-02T18:58:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.