Omni-DETR: Omni-Supervised Object Detection with Transformers
- URL: http://arxiv.org/abs/2203.16089v1
- Date: Wed, 30 Mar 2022 06:36:09 GMT
- Title: Omni-DETR: Omni-Supervised Object Detection with Transformers
- Authors: Pei Wang, Zhaowei Cai, Hao Yang, Gurumurthy Swaminathan, Nuno
Vasconcelos, Bernt Schiele, Stefano Soatto
- Abstract summary: We consider the problem of omni-supervised object detection, which can use unlabeled, fully labeled and weakly labeled annotations.
Under this unified architecture, different types of weak labels can be leveraged to generate accurate pseudo labels.
We have found that weak annotations can help to improve detection performance and a mixture of them can achieve a better trade-off between annotation cost and accuracy.
- Score: 165.4190908259015
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider the problem of omni-supervised object detection, which can use
unlabeled, fully labeled and weakly labeled annotations, such as image tags,
counts, points, etc., for object detection. This is enabled by a unified
architecture, Omni-DETR, based on the recent progress on student-teacher
framework and end-to-end transformer based object detection. Under this unified
architecture, different types of weak labels can be leveraged to generate
accurate pseudo labels, by a bipartite matching based filtering mechanism, for
the model to learn. In the experiments, Omni-DETR has achieved state-of-the-art
results on multiple datasets and settings. And we have found that weak
annotations can help to improve detection performance and a mixture of them can
achieve a better trade-off between annotation cost and accuracy than the
standard complete annotation. These findings could encourage larger object
detection datasets with mixture annotations. The code is available at
https://github.com/amazon-research/omni-detr.
Related papers
- Bayesian Detector Combination for Object Detection with Crowdsourced Annotations [49.43709660948812]
Acquiring fine-grained object detection annotations in unconstrained images is time-consuming, expensive, and prone to noise.
We propose a novel Bayesian Detector Combination (BDC) framework to more effectively train object detectors with noisy crowdsourced annotations.
BDC is model-agnostic, requires no prior knowledge of the annotators' skill level, and seamlessly integrates with existing object detection models.
arXiv Detail & Related papers (2024-07-10T18:00:54Z) - Object-Centric Multiple Object Tracking [124.30650395969126]
This paper proposes a video object-centric model for multiple-object tracking pipelines.
It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module.
Benefited from object-centric learning, we only require sparse detection labels for object localization and feature binding.
arXiv Detail & Related papers (2023-09-01T03:34:12Z) - ComplETR: Reducing the cost of annotations for object detection in dense
scenes with vision transformers [73.29057814695459]
ComplETR is designed to explicitly complete missing annotations in partially annotated dense scene datasets.
This reduces the need to annotate every object instance in the scene thereby reducing annotation cost.
We show performance improvement for several popular detectors such as Faster R-CNN, Cascade R-CNN, CenterNet2, and Deformable DETR.
arXiv Detail & Related papers (2022-09-13T00:11:16Z) - Robust Object Detection With Inaccurate Bounding Boxes [27.664730859319707]
Learning accurate object detectors often requires large-scale training data with precise object bounding boxes.
In this work, we aim to address the challenge of learning robust object detectors with inaccurate bounding boxes.
By treating an object as a bag of instances, we introduce an Object-Aware Multiple Instance Learning approach.
arXiv Detail & Related papers (2022-07-20T06:57:30Z) - Cross-Supervised Object Detection [42.783400918552765]
We show how to build better object detectors from weakly labeled images of new categories by leveraging knowledge learned from fully labeled base categories.
We propose a unified framework that combines a detection head trained from instance-level annotations and a recognition head learned from image-level annotations.
arXiv Detail & Related papers (2020-06-26T15:33:48Z) - Weakly-Supervised Salient Object Detection via Scribble Annotations [54.40518383782725]
We propose a weakly-supervised salient object detection model to learn saliency from scribble labels.
We present a new metric, termed saliency structure measure, to measure the structure alignment of the predicted saliency maps.
Our method not only outperforms existing weakly-supervised/unsupervised methods, but also is on par with several fully-supervised state-of-the-art models.
arXiv Detail & Related papers (2020-03-17T12:59:50Z) - EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement [53.69674636044927]
We present EHSOD, an end-to-end hybrid-supervised object detection system.
It can be trained in one shot on both fully and weakly-annotated data.
It achieves comparable results on multiple object detection benchmarks with only 30% fully-annotated data.
arXiv Detail & Related papers (2020-02-18T08:04:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.