Object Instance Mining for Weakly Supervised Object Detection
- URL: http://arxiv.org/abs/2002.01087v1
- Date: Tue, 4 Feb 2020 02:11:39 GMT
- Title: Object Instance Mining for Weakly Supervised Object Detection
- Authors: Chenhao Lin, Siwen Wang, Dongqi Xu, Yu Lu, Wayne Zhang
- Abstract summary: This paper introduces an end-to-end object instance mining (OIM) framework for weakly supervised object detection.
OIM attempts to detect all possible object instances existing in each image by introducing information propagation on the spatial and appearance graphs.
During the iterative learning process, the less discriminative object instances from the same class can be gradually detected and utilized for training.
- Score: 24.021995037282394
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weakly supervised object detection (WSOD) using only image-level annotations
has attracted growing attention over the past few years. Existing approaches
using multiple instance learning easily fall into local optima, because such
mechanism tends to learn from the most discriminative object in an image for
each category. Therefore, these methods suffer from missing object instances
which degrade the performance of WSOD. To address this problem, this paper
introduces an end-to-end object instance mining (OIM) framework for weakly
supervised object detection. OIM attempts to detect all possible object
instances existing in each image by introducing information propagation on the
spatial and appearance graphs, without any additional annotations. During the
iterative learning process, the less discriminative object instances from the
same class can be gradually detected and utilized for training. In addition, we
design an object instance reweighted loss to learn larger portion of each
object instance to further improve the performance. The experimental results on
two publicly available databases, VOC 2007 and 2012, demonstrate the efficacy
of proposed approach.
Related papers
- Improving Object Detection via Local-global Contrastive Learning [27.660633883387753]
We present a novel image-to-image translation method that specifically targets cross-domain object detection.
We learn to represent objects by contrasting local-global information.
This affords investigation of an under-explored challenge: obtaining performant detection, under domain shifts.
arXiv Detail & Related papers (2024-10-07T14:18:32Z) - Few-shot Oriented Object Detection with Memorable Contrastive Learning in Remote Sensing Images [11.217630579076237]
Few-shot object detection (FSOD) has garnered significant research attention in the field of remote sensing.
We propose a novel FSOD method for remote sensing images called Few-shot Oriented object detection with Memorable Contrastive learning (FOMC)
Specifically, we employ oriented bounding boxes instead of traditional horizontal bounding boxes to learn a better feature representation for arbitrary-oriented aerial objects.
arXiv Detail & Related papers (2024-03-20T08:15:18Z) - Object-Centric Multiple Object Tracking [124.30650395969126]
This paper proposes a video object-centric model for multiple-object tracking pipelines.
It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module.
Benefited from object-centric learning, we only require sparse detection labels for object localization and feature binding.
arXiv Detail & Related papers (2023-09-01T03:34:12Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - Image Segmentation-based Unsupervised Multiple Objects Discovery [1.7674345486888503]
Unsupervised object discovery aims to localize objects in images.
We propose a fully unsupervised, bottom-up approach, for multiple objects discovery.
We provide state-of-the-art results for both unsupervised class-agnostic object detection and unsupervised image segmentation.
arXiv Detail & Related papers (2022-12-20T09:48:24Z) - Object Discovery via Contrastive Learning for Weakly Supervised Object
Detection [12.822548027334328]
Weakly Supervised Object Detection is a task that detects objects in an image using a model trained only on image-level annotations.
We propose a novel multiple instance labeling method called object discovery.
We achieve new state-of-the-art results on MS-COCO 2014 and 2017 as well as PASCAL VOC 2012, and competitive results on PASCAL VOC 2007.
arXiv Detail & Related papers (2022-08-16T07:36:12Z) - Contrastive Object Detection Using Knowledge Graph Embeddings [72.17159795485915]
We compare the error statistics of the class embeddings learned from a one-hot approach with semantically structured embeddings from natural language processing or knowledge graphs.
We propose a knowledge-embedded design for keypoint-based and transformer-based object detection architectures.
arXiv Detail & Related papers (2021-12-21T17:10:21Z) - Discovery-and-Selection: Towards Optimal Multiple Instance Learning for
Weakly Supervised Object Detection [86.86602297364826]
We propose a discoveryand-selection approach fused with multiple instance learning (DS-MIL)
Our proposed DS-MIL approach can consistently improve the baselines, reporting state-of-the-art performance.
arXiv Detail & Related papers (2021-10-18T07:06:57Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z) - A Few-Shot Sequential Approach for Object Counting [63.82757025821265]
We introduce a class attention mechanism that sequentially attends to objects in the image and extracts their relevant features.
The proposed technique is trained on point-level annotations and uses a novel loss function that disentangles class-dependent and class-agnostic aspects of the model.
We present our results on a variety of object-counting/detection datasets, including FSOD and MS COCO.
arXiv Detail & Related papers (2020-07-03T18:23:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.