Object Discovery via Contrastive Learning for Weakly Supervised Object
Detection
- URL: http://arxiv.org/abs/2208.07576v1
- Date: Tue, 16 Aug 2022 07:36:12 GMT
- Title: Object Discovery via Contrastive Learning for Weakly Supervised Object
Detection
- Authors: Jinhwan Seo, Wonho Bae, Danica J. Sutherland, Junhyug Noh, Daijin Kim
- Abstract summary: Weakly Supervised Object Detection is a task that detects objects in an image using a model trained only on image-level annotations.
We propose a novel multiple instance labeling method called object discovery.
We achieve new state-of-the-art results on MS-COCO 2014 and 2017 as well as PASCAL VOC 2012, and competitive results on PASCAL VOC 2007.
- Score: 12.822548027334328
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weakly Supervised Object Detection (WSOD) is a task that detects objects in
an image using a model trained only on image-level annotations. Current
state-of-the-art models benefit from self-supervised instance-level
supervision, but since weak supervision does not include count or location
information, the most common ``argmax'' labeling method often ignores many
instances of objects. To alleviate this issue, we propose a novel multiple
instance labeling method called object discovery. We further introduce a new
contrastive loss under weak supervision where no instance-level information is
available for sampling, called weakly supervised contrastive loss (WSCL). WSCL
aims to construct a credible similarity threshold for object discovery by
leveraging consistent features for embedding vectors in the same class. As a
result, we achieve new state-of-the-art results on MS-COCO 2014 and 2017 as
well as PASCAL VOC 2012, and competitive results on PASCAL VOC 2007.
Related papers
- Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation [58.37525311718006]
We put forth a novel formulation of the aerial object detection problem, namely open-vocabulary aerial object detection (OVAD)
We propose CastDet, a CLIP-activated student-teacher detection framework that serves as the first OVAD detector specifically designed for the challenging aerial scenario.
Our framework integrates a robust localization teacher along with several box selection strategies to generate high-quality proposals for novel objects.
arXiv Detail & Related papers (2024-11-04T12:59:13Z) - Object-Centric Multiple Object Tracking [124.30650395969126]
This paper proposes a video object-centric model for multiple-object tracking pipelines.
It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module.
Benefited from object-centric learning, we only require sparse detection labels for object localization and feature binding.
arXiv Detail & Related papers (2023-09-01T03:34:12Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - Learning to Detect Instance-level Salient Objects Using Complementary
Image Labels [55.049347205603304]
We present the first weakly-supervised approach to the salient instance detection problem.
We propose a novel weakly-supervised network with three branches: a Saliency Detection Branch leveraging class consistency information to locate candidate objects; a Boundary Detection Branch exploiting class discrepancy information to delineate object boundaries; and a Centroid Detection Branch using subitizing information to detect salient instance centroids.
arXiv Detail & Related papers (2021-11-19T10:15:22Z) - Synthesizing the Unseen for Zero-shot Object Detection [72.38031440014463]
We propose to synthesize visual features for unseen classes, so that the model learns both seen and unseen objects in the visual domain.
We use a novel generative model that uses class-semantics to not only generate the features but also to discriminatively separate them.
arXiv Detail & Related papers (2020-10-19T12:36:11Z) - Cross-Supervised Object Detection [42.783400918552765]
We show how to build better object detectors from weakly labeled images of new categories by leveraging knowledge learned from fully labeled base categories.
We propose a unified framework that combines a detection head trained from instance-level annotations and a recognition head learned from image-level annotations.
arXiv Detail & Related papers (2020-06-26T15:33:48Z) - UniT: Unified Knowledge Transfer for Any-shot Object Detection and
Segmentation [52.487469544343305]
Methods for object detection and segmentation rely on large scale instance-level annotations for training.
We propose an intuitive and unified semi-supervised model that is applicable to a range of supervision.
arXiv Detail & Related papers (2020-06-12T22:45:47Z) - Distilling Knowledge from Refinement in Multiple Instance Detection
Networks [0.0]
Weakly supervised object detection (WSOD) aims to tackle the object detection problem using only labeled image categories as supervision.
We present an adaptive supervision aggregation function that dynamically changes the aggregation criteria for selecting boxes related to one of the ground-truth classes, background, or even ignored during the generation of each refinement module supervision.
arXiv Detail & Related papers (2020-04-23T02:49:40Z) - Object Instance Mining for Weakly Supervised Object Detection [24.021995037282394]
This paper introduces an end-to-end object instance mining (OIM) framework for weakly supervised object detection.
OIM attempts to detect all possible object instances existing in each image by introducing information propagation on the spatial and appearance graphs.
During the iterative learning process, the less discriminative object instances from the same class can be gradually detected and utilized for training.
arXiv Detail & Related papers (2020-02-04T02:11:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.