Robust Object Detection via Instance-Level Temporal Cycle Confusion
- URL: http://arxiv.org/abs/2104.08381v1
- Date: Fri, 16 Apr 2021 21:35:08 GMT
- Title: Robust Object Detection via Instance-Level Temporal Cycle Confusion
- Authors: Xin Wang, Thomas E. Huang, Benlin Liu, Fisher Yu, Xiaolong Wang,
Joseph E. Gonzalez, Trevor Darrell
- Abstract summary: We study the effectiveness of auxiliary self-supervised tasks to improve the out-of-distribution generalization of object detectors.
Inspired by the principle of maximum entropy, we introduce a novel self-supervised task, instance-level temporal cycle confusion (CycConf)
For each object, the task is to find the most different object proposals in the adjacent frame in a video and then cycle back to itself for self-supervision.
- Score: 89.1027433760578
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Building reliable object detectors that are robust to domain shifts, such as
various changes in context, viewpoint, and object appearances, is critical for
real-world applications. In this work, we study the effectiveness of auxiliary
self-supervised tasks to improve the out-of-distribution generalization of
object detectors. Inspired by the principle of maximum entropy, we introduce a
novel self-supervised task, instance-level temporal cycle confusion (CycConf),
which operates on the region features of the object detectors. For each object,
the task is to find the most different object proposals in the adjacent frame
in a video and then cycle back to itself for self-supervision. CycConf
encourages the object detector to explore invariant structures across instances
under various motions, which leads to improved model robustness in unseen
domains at test time. We observe consistent out-of-domain performance
improvements when training object detectors in tandem with self-supervised
tasks on large-scale video datasets (BDD100K and Waymo open data). The joint
training framework also establishes a new state-of-the-art on standard
unsupervised domain adaptative detection benchmarks (Cityscapes, Foggy
Cityscapes, and Sim10K). The project page is available at
https://xinw.ai/cyc-conf.
Related papers
- Tracking Transforming Objects: A Benchmark [2.53045657890708]
This study collects a novel dedicated dataset for Tracking Transforming Objects, called DTTO, which contains 100 sequences, amounting to approximately 9.3K frames.
We provide carefully hand-annotated bounding boxes for each frame within these sequences, making DTTO the pioneering benchmark dedicated to tracking transforming objects.
We thoroughly evaluate 20 state-of-the-art trackers on the benchmark, aiming to comprehend the performance of existing methods and provide a comparison for future research on DTTO.
arXiv Detail & Related papers (2024-04-28T11:24:32Z) - Detecting Every Object from Events [24.58024539462497]
We propose Detecting Every Object in Events (DEOE), an approach tailored for achieving high-speed, class-agnostic open-world object detection in event-based vision.
Our code is available at https://github.com/Hatins/DEOE.
arXiv Detail & Related papers (2024-04-08T08:20:53Z) - GOOD: Towards Domain Generalized Orientated Object Detection [39.76969237020444]
Oriented object detection has been rapidly developed in the past few years, but most of these methods assume the training and testing images are under the same statistical distribution.
We propose the task of domain generalized oriented object detection, which intends to explore the generalization of oriented object detectors on arbitrary unseen target domains.
arXiv Detail & Related papers (2024-02-20T07:12:22Z) - Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector [72.05791402494727]
This paper studies the challenging cross-domain few-shot object detection (CD-FSOD)
It aims to develop an accurate object detector for novel domains with minimal labeled examples.
arXiv Detail & Related papers (2024-02-05T15:25:32Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - Towards Building Self-Aware Object Detectors via Reliable Uncertainty
Quantification and Calibration [17.461451218469062]
In this work, we introduce the Self-Aware Object Detection (SAOD) task.
The SAOD task respects and adheres to the challenges that object detectors face in safety-critical environments such as autonomous driving.
We extensively use our framework, which introduces novel metrics and large scale test datasets, to test numerous object detectors.
arXiv Detail & Related papers (2023-07-03T11:16:39Z) - MOST: Multiple Object localization with Self-supervised Transformers for
object discovery [97.47075050779085]
We present Multiple Object localization with Self-supervised Transformers (MOST)
MOST uses features of transformers trained using self-supervised learning to localize multiple objects in real world images.
We show MOST can be used for self-supervised pre-training of object detectors, and yields consistent improvements on fully, semi-supervised object detection and unsupervised region proposal generation.
arXiv Detail & Related papers (2023-04-11T17:57:27Z) - Robust Change Detection Based on Neural Descriptor Fields [53.111397800478294]
We develop an object-level online change detection approach that is robust to partially overlapping observations and noisy localization results.
By associating objects via shape code similarity and comparing local object-neighbor spatial layout, our proposed approach demonstrates robustness to low observation overlap and localization noises.
arXiv Detail & Related papers (2022-08-01T17:45:36Z) - Interactron: Embodied Adaptive Object Detection [18.644357684104662]
We propose Interactron, a method for adaptive object detection in an interactive setting.
Our idea is to continue training during inference and adapt the model at test time without any explicit supervision via interacting with the environment.
arXiv Detail & Related papers (2022-02-01T18:56:14Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.