Deep Semantic Matching with Foreground Detection and Cycle-Consistency
- URL: http://arxiv.org/abs/2004.00144v1
- Date: Tue, 31 Mar 2020 22:38:09 GMT
- Title: Deep Semantic Matching with Foreground Detection and Cycle-Consistency
- Authors: Yun-Chun Chen, Po-Hsiang Huang, Li-Yu Yu, Jia-Bin Huang, Ming-Hsuan
Yang, Yen-Yu Lin
- Abstract summary: We address weakly supervised semantic matching based on a deep network.
We explicitly estimate the foreground regions to suppress the effect of background clutter.
We develop cycle-consistent losses to enforce the predicted transformations across multiple images to be geometrically plausible and consistent.
- Score: 103.22976097225457
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Establishing dense semantic correspondences between object instances remains
a challenging problem due to background clutter, significant scale and pose
differences, and large intra-class variations. In this paper, we address weakly
supervised semantic matching based on a deep network where only image pairs
without manual keypoint correspondence annotations are provided. To facilitate
network training with this weaker form of supervision, we 1) explicitly
estimate the foreground regions to suppress the effect of background clutter
and 2) develop cycle-consistent losses to enforce the predicted transformations
across multiple images to be geometrically plausible and consistent. We train
the proposed model using the PF-PASCAL dataset and evaluate the performance on
the PF-PASCAL, PF-WILLOW, and TSS datasets. Extensive experimental results show
that the proposed approach performs favorably against the state-of-the-art
methods.
Related papers
- Semantic-guided Adversarial Diffusion Model for Self-supervised Shadow Removal [5.083330121710086]
GAN-based training often faces issues such as mode collapse and unstable optimization.
We propose a semantic-guided adversarial diffusion framework for self-supervised shadow removal.
We conduct experiments on multiple public datasets, and the experimental results demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2024-07-01T09:14:38Z) - Progressively Dual Prior Guided Few-shot Semantic Segmentation [57.37506990980975]
Few-shot semantic segmentation task aims at performing segmentation in query images with a few annotated support samples.
We propose a progressively dual prior guided few-shot semantic segmentation network.
arXiv Detail & Related papers (2022-11-20T16:19:47Z) - Boosting Few-shot Fine-grained Recognition with Background Suppression
and Foreground Alignment [53.401889855278704]
Few-shot fine-grained recognition (FS-FGR) aims to recognize novel fine-grained categories with the help of limited available samples.
We propose a two-stage background suppression and foreground alignment framework, which is composed of a background activation suppression (BAS) module, a foreground object alignment (FOA) module, and a local to local (L2L) similarity metric.
Experiments conducted on multiple popular fine-grained benchmarks demonstrate that our method outperforms the existing state-of-the-art by a large margin.
arXiv Detail & Related papers (2022-10-04T07:54:40Z) - Semi-supervised Domain Adaptive Structure Learning [72.01544419893628]
Semi-supervised domain adaptation (SSDA) is a challenging problem requiring methods to overcome both 1) overfitting towards poorly annotated data and 2) distribution shift across domains.
We introduce an adaptive structure learning method to regularize the cooperation of SSL and DA.
arXiv Detail & Related papers (2021-12-12T06:11:16Z) - Learning Contrastive Representation for Semantic Correspondence [150.29135856909477]
We propose a multi-level contrastive learning approach for semantic matching.
We show that image-level contrastive learning is a key component to encourage the convolutional features to find correspondence between similar objects.
arXiv Detail & Related papers (2021-09-22T18:34:14Z) - SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from
Monocular images [94.36401543589523]
We introduce the concept of semantic objectness to exploit the geometric relationship of these two tasks.
We then propose a Semantic Object and Depth Estimation Network (SOSD-Net) based on the objectness assumption.
To the best of our knowledge, SOSD-Net is the first network that exploits the geometry constraint for simultaneous monocular depth estimation and semantic segmentation.
arXiv Detail & Related papers (2021-01-19T02:41:03Z) - D-LEMA: Deep Learning Ensembles from Multiple Annotations -- Application
to Skin Lesion Segmentation [14.266037264648533]
Leveraging a collection of annotators' opinions for an image is an interesting way of estimating a gold standard.
We propose an approach to handle annotators' disagreements when training a deep model.
arXiv Detail & Related papers (2020-12-14T01:51:22Z) - Contextual Interference Reduction by Selective Fine-Tuning of Neural
Networks [1.0152838128195465]
We study the role of the context on interfering with a disentangled foreground target object representation.
We work on a framework that benefits from the bottom-up and top-down processing paradigms.
arXiv Detail & Related papers (2020-11-21T20:11:12Z) - ProAlignNet : Unsupervised Learning for Progressively Aligning Noisy
Contours [12.791313859673187]
"ProAlignNet" accounts for large scale misalignments and complex transformations between the contour shapes.
It learns by training with a novel loss function which is derived an upperbound of a proximity-sensitive and local shape-dependent similarity metric.
In two real-world applications, the proposed models consistently perform superior to state-of-the-art methods.
arXiv Detail & Related papers (2020-05-23T14:56:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.