Semi-supervised Deep Large-baseline Homography Estimation with
Progressive Equivalence Constraint
- URL: http://arxiv.org/abs/2212.02763v1
- Date: Tue, 6 Dec 2022 05:28:05 GMT
- Title: Semi-supervised Deep Large-baseline Homography Estimation with
Progressive Equivalence Constraint
- Authors: Hai Jiang, Haipeng Li, Yuhang Lu, Songchen Han, and Shuaicheng Liu
- Abstract summary: Homography estimation is erroneous in the case of large-baseline due to the low image overlay and limited receptive field.
We propose a progressive estimation strategy by converting large-baseline homography into multiple intermediate ones.
Our method achieves state-of-the-art performance in large-baseline scenes while keeping competitive performance in small-baseline scenes.
- Score: 25.022907946911033
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Homography estimation is erroneous in the case of large-baseline due to the
low image overlay and limited receptive field. To address it, we propose a
progressive estimation strategy by converting large-baseline homography into
multiple intermediate ones, cumulatively multiplying these intermediate items
can reconstruct the initial homography. Meanwhile, a semi-supervised homography
identity loss, which consists of two components: a supervised objective and an
unsupervised objective, is introduced. The first supervised loss is acting to
optimize intermediate homographies, while the second unsupervised one helps to
estimate a large-baseline homography without photometric losses. To validate
our method, we propose a large-scale dataset that covers regular and
challenging scenes. Experiments show that our method achieves state-of-the-art
performance in large-baseline scenes while keeping competitive performance in
small-baseline scenes. Code and dataset are available at
https://github.com/megvii-research/LBHomo.
Related papers
- Topograph: An efficient Graph-Based Framework for Strictly Topology Preserving Image Segmentation [78.54656076915565]
Topological correctness plays a critical role in many image segmentation tasks.
Most networks are trained using pixel-wise loss functions, such as Dice, neglecting topological accuracy.
We propose a novel, graph-based framework for topologically accurate image segmentation.
arXiv Detail & Related papers (2024-11-05T16:20:14Z) - Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation [15.285720572043678]
We formulate unsupervised domain adaptation for monocular depth estimation as a consistency-based semi-supervised learning problem.
We introduce a pairwise loss function that regularises predictions on the source domain while enforcing consistency across multiple augmented views.
In our experiments, we rely on the standard depth estimation benchmarks KITTI and NYUv2 to demonstrate state-of-the-art results.
arXiv Detail & Related papers (2024-05-27T23:32:06Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Unsupervised Homography Estimation with Coplanarity-Aware GAN [39.477228263736905]
Estimating homography from an image pair is a fundamental problem in image alignment.
HomoGAN is designed to guide unsupervised homography estimation to focus on the dominant plane.
Results show that our matching error is 22% lower than the previous SOTA method.
arXiv Detail & Related papers (2022-05-08T09:26:47Z) - Semi-supervised Semantic Segmentation with Directional Context-aware
Consistency [66.49995436833667]
We focus on the semi-supervised segmentation problem where only a small set of labeled data is provided with a much larger collection of totally unlabeled images.
A preferred high-level representation should capture the contextual information while not losing self-awareness.
We present the Directional Contrastive Loss (DC Loss) to accomplish the consistency in a pixel-to-pixel manner.
arXiv Detail & Related papers (2021-06-27T03:42:40Z) - Large-scale Unsupervised Semantic Segmentation [163.3568726730319]
We propose a new problem of large-scale unsupervised semantic segmentation (LUSS) with a newly created benchmark dataset to track the research progress.
Based on the ImageNet dataset, we propose the ImageNet-S dataset with 1.2 million training images and 40k high-quality semantic segmentation annotations for evaluation.
arXiv Detail & Related papers (2021-06-06T15:02:11Z) - Perceptual Loss for Robust Unsupervised Homography Estimation [1.2891210250935146]
BiHomE minimizes the distance in the feature space between the warped image from the source viewpoint and the corresponding image from the target viewpoint.
We show that biHomE achieves state-of-the-art performance on synthetic COCO dataset, which is also comparable or better compared to supervised approaches.
arXiv Detail & Related papers (2021-04-20T14:41:54Z) - Motion Basis Learning for Unsupervised Deep Homography Estimation with
Subspace Projection [27.68752841842823]
We introduce a new framework for unsupervised deep homography estimation.
We show that our approach outperforms the state-of-the-art on the homography benchmark datasets.
arXiv Detail & Related papers (2021-03-29T05:51:34Z) - Towards Unbiased COVID-19 Lesion Localisation and Segmentation via
Weakly Supervised Learning [66.36706284671291]
We propose a data-driven framework supervised by only image-level labels to support unbiased lesion localisation.
The framework can explicitly separate potential lesions from original images, with the help of a generative adversarial network and a lesion-specific decoder.
arXiv Detail & Related papers (2021-03-01T06:05:49Z) - Edge-Preserving Guided Semantic Segmentation for VIPriors Challenge [3.435043566706133]
Current state-of-the-art and deep learning-based semantic segmentation techniques are hard to train well.
We propose edge-preserving guidance to obtain the extra prior information.
Experiments demonstrate that the proposed method can achieve excellent performance under small-scale training set.
arXiv Detail & Related papers (2020-07-17T11:49:10Z) - Deep Semantic Matching with Foreground Detection and Cycle-Consistency [103.22976097225457]
We address weakly supervised semantic matching based on a deep network.
We explicitly estimate the foreground regions to suppress the effect of background clutter.
We develop cycle-consistent losses to enforce the predicted transformations across multiple images to be geometrically plausible and consistent.
arXiv Detail & Related papers (2020-03-31T22:38:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.