Decoupling Makes Weakly Supervised Local Feature Better
- URL: http://arxiv.org/abs/2201.02861v1
- Date: Sat, 8 Jan 2022 16:51:02 GMT
- Title: Decoupling Makes Weakly Supervised Local Feature Better
- Authors: Kunhong Li, LongguangWang, Li Liu, Qing Ran, Kai Xu, Yulan Guo
- Abstract summary: We propose a decoupled describe-then-detect pipeline tailored for weakly supervised local feature learning.
Within our pipeline, the detection step is decoupled from the description step and postponed until discriminative and robust descriptors are learned.
In addition, we introduce a line-to-window search strategy to explicitly use the camera pose information for better descriptor learning.
- Score: 39.17900986173409
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weakly supervised learning can help local feature methods to overcome the
obstacle of acquiring a large-scale dataset with densely labeled
correspondences. However, since weak supervision cannot distinguish the losses
caused by the detection and description steps, directly conducting weakly
supervised learning within a joint describe-then-detect pipeline suffers
limited performance. In this paper, we propose a decoupled describe-then-detect
pipeline tailored for weakly supervised local feature learning. Within our
pipeline, the detection step is decoupled from the description step and
postponed until discriminative and robust descriptors are learned. In addition,
we introduce a line-to-window search strategy to explicitly use the camera pose
information for better descriptor learning. Extensive experiments show that our
method, namely PoSFeat (Camera Pose Supervised Feature), outperforms previous
fully and weakly supervised methods and achieves state-of-the-art performance
on a wide range of downstream tasks.
Related papers
- Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection [57.646582245834324]
We propose a simple yet effective deepfake detector called LSDA.
It is based on a idea: representations with a wider variety of forgeries should be able to learn a more generalizable decision boundary.
We show that our proposed method is surprisingly effective and transcends state-of-the-art detectors across several widely used benchmarks.
arXiv Detail & Related papers (2023-11-19T09:41:10Z) - Towards End-to-End Unsupervised Saliency Detection with Self-Supervised
Top-Down Context [25.85453873366275]
We propose a self-supervised end-to-end salient object detection framework via top-down context.
We exploit the self-localization from the deepest feature to construct the location maps which are then leveraged to learn the most instructive segmentation guidance.
Our method achieves leading performance among the recent end-to-end methods and most of the multi-stage solutions.
arXiv Detail & Related papers (2023-10-14T08:43:22Z) - Box-based Refinement for Weakly Supervised and Unsupervised Localization
Tasks [57.70351255180495]
We train the detectors on top of the network output instead of the image data and apply suitable loss backpropagation.
Our findings reveal a significant improvement in phrase grounding for the what is where by looking'' task.
arXiv Detail & Related papers (2023-09-07T17:36:02Z) - OCTAve: 2D en face Optical Coherence Tomography Angiography Vessel
Segmentation in Weakly-Supervised Learning with Locality Augmentation [14.322349196837209]
We propose the application of the scribble-base weakly-supervised learning method to automate the pixel-level annotation.
The proposed method, called OCTAve, combines the weakly-supervised learning using scribble-annotated ground truth augmented with an adversarial and a novel self-supervised deep supervision.
arXiv Detail & Related papers (2022-07-25T14:40:56Z) - ReDFeat: Recoupling Detection and Description for Multimodal Feature
Learning [51.07496081296863]
We recouple independent constraints of detection and description of multimodal feature learning with a mutual weighting strategy.
We propose a detector that possesses a large receptive field and is equipped with learnable non-maximum suppression layers.
We build a benchmark that contains cross visible, infrared, near-infrared and synthetic aperture radar image pairs for evaluating the performance of features in feature matching and image registration tasks.
arXiv Detail & Related papers (2022-05-16T04:24:22Z) - WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection [75.80075054706079]
We propose a weakly- and semi-supervised object detection framework (WSSOD)
An agent detector is first trained on a joint dataset and then used to predict pseudo bounding boxes on weakly-annotated images.
The proposed framework demonstrates remarkable performance on PASCAL-VOC and MSCOCO benchmark, achieving a high performance comparable to those obtained in fully-supervised settings.
arXiv Detail & Related papers (2021-05-21T11:58:50Z) - DeepfakeUCL: Deepfake Detection via Unsupervised Contrastive Learning [20.94569893388119]
We design a novel deepfake detection method via unsupervised contrastive learning.
We show that our method enables comparable detection performance to state-of-the-art supervised techniques.
arXiv Detail & Related papers (2021-04-23T09:48:10Z) - Disambiguation of weak supervision with exponential convergence rates [88.99819200562784]
In supervised learning, data are annotated with incomplete yet discriminative information.
In this paper, we focus on partial labelling, an instance of weak supervision where, from a given input, we are given a set of potential targets.
We propose an empirical disambiguation algorithm to recover full supervision from weak supervision.
arXiv Detail & Related papers (2021-02-04T18:14:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.