Pairwise Similarity Knowledge Transfer for Weakly Supervised Object
Localization
- URL: http://arxiv.org/abs/2003.08375v2
- Date: Sun, 19 Jul 2020 14:09:18 GMT
- Title: Pairwise Similarity Knowledge Transfer for Weakly Supervised Object
Localization
- Authors: Amir Rahimi, Amirreza Shaban, Thalaiyasingam Ajanthan, Richard
Hartley, Byron Boots
- Abstract summary: We study the problem of learning localization model on target classes with weakly supervised image labels.
In this work, we argue that learning only an objectness function is a weak form of knowledge transfer.
Experiments on the COCO and ILSVRC 2013 detection datasets show that the performance of the localization model improves significantly with the inclusion of pairwise similarity function.
- Score: 53.99850033746663
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weakly Supervised Object Localization (WSOL) methods only require image level
labels as opposed to expensive bounding box annotations required by fully
supervised algorithms. We study the problem of learning localization model on
target classes with weakly supervised image labels, helped by a fully annotated
source dataset. Typically, a WSOL model is first trained to predict class
generic objectness scores on an off-the-shelf fully supervised source dataset
and then it is progressively adapted to learn the objects in the weakly
supervised target dataset. In this work, we argue that learning only an
objectness function is a weak form of knowledge transfer and propose to learn a
classwise pairwise similarity function that directly compares two input
proposals as well. The combined localization model and the estimated object
annotations are jointly learned in an alternating optimization paradigm as is
typically done in standard WSOL methods. In contrast to the existing work that
learns pairwise similarities, our approach optimizes a unified objective with
convergence guarantee and it is computationally efficient for large-scale
applications. Experiments on the COCO and ILSVRC 2013 detection datasets show
that the performance of the localization model improves significantly with the
inclusion of pairwise similarity function. For instance, in the ILSVRC dataset,
the Correct Localization (CorLoc) performance improves from 72.8% to 78.2%
which is a new state-of-the-art for WSOL task in the context of knowledge
transfer.
Related papers
- Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - Transferability Metrics for Object Detection [0.0]
Transfer learning aims to make the most of existing pre-trained models to achieve better performance on a new task in limited data scenarios.
We extend transferability metrics to object detection using ROI-Align and TLogME.
We show that TLogME provides a robust correlation with transfer performance and outperforms other transferability metrics on local and global level features.
arXiv Detail & Related papers (2023-06-27T08:49:31Z) - Deep Active Ensemble Sampling For Image Classification [8.31483061185317]
Active learning frameworks aim to reduce the cost of data annotation by actively requesting the labeling for the most informative data points.
Some proposed approaches include uncertainty-based techniques, geometric methods, implicit combination of uncertainty-based and geometric approaches.
We present an innovative integration of recent progress in both uncertainty-based and geometric frameworks to enable an efficient exploration/exploitation trade-off in sample selection strategy.
Our framework provides two advantages: (1) accurate posterior estimation, and (2) tune-able trade-off between computational overhead and higher accuracy.
arXiv Detail & Related papers (2022-10-11T20:20:20Z) - Semi-supervised Domain Adaptive Structure Learning [72.01544419893628]
Semi-supervised domain adaptation (SSDA) is a challenging problem requiring methods to overcome both 1) overfitting towards poorly annotated data and 2) distribution shift across domains.
We introduce an adaptive structure learning method to regularize the cooperation of SSL and DA.
arXiv Detail & Related papers (2021-12-12T06:11:16Z) - Dense Contrastive Learning for Self-Supervised Visual Pre-Training [102.15325936477362]
We present dense contrastive learning, which implements self-supervised learning by optimizing a pairwise contrastive (dis)similarity loss at the pixel level between two views of input images.
Compared to the baseline method MoCo-v2, our method introduces negligible computation overhead (only 1% slower)
arXiv Detail & Related papers (2020-11-18T08:42:32Z) - Interpolation-based semi-supervised learning for object detection [44.37685664440632]
We propose an Interpolation-based Semi-supervised learning method for object detection.
The proposed losses dramatically improve the performance of semi-supervised learning as well as supervised learning.
arXiv Detail & Related papers (2020-06-03T10:53:44Z) - iFAN: Image-Instance Full Alignment Networks for Adaptive Object
Detection [48.83883375118966]
iFAN aims to precisely align feature distributions on both image and instance levels.
It outperforms state-of-the-art methods with a boost of 10%+ AP over the source-only baseline.
arXiv Detail & Related papers (2020-03-09T13:27:06Z) - Improving Few-shot Learning by Spatially-aware Matching and
CrossTransformer [116.46533207849619]
We study the impact of scale and location mismatch in the few-shot learning scenario.
We propose a novel Spatially-aware Matching scheme to effectively perform matching across multiple scales and locations.
arXiv Detail & Related papers (2020-01-06T14:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.