Image Stitching Based on Planar Region Consensus
- URL: http://arxiv.org/abs/2007.02722v1
- Date: Mon, 6 Jul 2020 13:07:20 GMT
- Title: Image Stitching Based on Planar Region Consensus
- Authors: Aocheng Li, Jie Guo, Yanwen Guo
- Abstract summary: We propose a new image stitching method which stitches images by allowing for the alignment of a set of matched dominant planar regions.
We use rich semantic information directly from RGB images to extract planar image regions with a deep Convolutional Neural Network (CNN)
Our method can deal with different situations and outperforms the state-of-the-arts on challenging scenes.
- Score: 22.303750435673752
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image stitching for two images without a global transformation between them
is notoriously difficult. In this paper, noticing the importance of planar
structure under perspective geometry, we propose a new image stitching method
which stitches images by allowing for the alignment of a set of matched
dominant planar regions. Clearly different from previous methods resorting to
plane segmentation, the key to our approach is to utilize rich semantic
information directly from RGB images to extract planar image regions with a
deep Convolutional Neural Network (CNN). We specifically design a new module to
make fully use of existing semantic segmentation networks to accommodate planar
segmentation. To train the network, a dataset for planar region segmentation is
contributed. With the planar region knowledge, a set of local transformations
can be obtained by constraining matched regions, enabling more precise
alignment in the overlapping area. We also use planar knowledge to estimate a
transformation field over the whole image. The final mosaic is obtained by a
mesh-based optimization framework which maintains high alignment accuracy and
relaxes similarity transformation at the same time. Extensive experiments with
quantitative comparisons show that our method can deal with different
situations and outperforms the state-of-the-arts on challenging scenes.
Related papers
- Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers [59.0181939916084]
Traditional 3D networks mainly focus on local geometric details and ignore the topological structure between local geometries.
We propose a novel Priors Distillation (RPD) method to extract priors from the well-trained transformers on massive images.
Experiments on the PointDA-10 and the Sim-to-Real datasets verify that the proposed method consistently achieves the state-of-the-art performance of UDA for point cloud classification.
arXiv Detail & Related papers (2024-07-26T06:29:09Z) - ReFit: A Framework for Refinement of Weakly Supervised Semantic
Segmentation using Object Border Fitting for Medical Images [4.945138408504987]
Weakly Supervised Semantic (WSSS) relying only on image-level supervision is a promising approach to deal with the need for networks.
We propose our novel ReFit framework, which deploys state-of-the-art class activation maps combined with various post-processing techniques.
By applying our method to WSSS predictions, we achieved up to 10% improvement over the current state-of-the-art WSSS methods for medical imaging.
arXiv Detail & Related papers (2023-03-14T12:46:52Z) - Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image
Matching [102.39635336450262]
We address the problem of ground-to-satellite image geo-localization by matching a query image captured at the ground level against a large-scale database with geotagged satellite images.
Our new method is able to achieve the fine-grained location of a query image, up to pixel size precision of the satellite image.
arXiv Detail & Related papers (2022-03-26T20:10:38Z) - Cross-Image Region Mining with Region Prototypical Network for Weakly
Supervised Segmentation [45.39679291105364]
We propose a region network RPNet to explore the cross-image object diversity of the training set.
Similar object parts across images are identified via region feature comparison.
Experiments show that the proposed method generates more complete and accurate pseudo object masks.
arXiv Detail & Related papers (2021-08-17T02:51:02Z) - LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution
Homography Estimation [52.63874513999119]
Cross-resolution image alignment is a key problem in multiscale giga photography.
Existing deep homography methods neglecting the explicit formulation of correspondences between them, which leads to degraded accuracy in cross-resolution challenges.
We propose a local transformer network embedded within a multiscale structure to explicitly learn correspondences between the multimodal inputs.
arXiv Detail & Related papers (2021-06-08T02:51:45Z) - Region Similarity Representation Learning [94.88055458257081]
Region Similarity Representation Learning (ReSim) is a new approach to self-supervised representation learning for localization-based tasks.
ReSim learns both regional representations for localization as well as semantic image-level representations.
We show how ReSim learns representations which significantly improve the localization and classification performance compared to a competitive MoCo-v2 baseline.
arXiv Detail & Related papers (2021-03-24T00:42:37Z) - SPAN: Spatial Pyramid Attention Network forImage Manipulation
Localization [24.78951727072683]
We present a novel framework, Spatial Pyramid Attention Network (SPAN) for detection and localization of multiple types of image manipulations.
SPAN is trained on a generic, synthetic dataset but can also be fine tuned for specific datasets.
The proposed method shows significant gains in performance on standard datasets over previous state-of-the-art methods.
arXiv Detail & Related papers (2020-09-01T21:59:35Z) - Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization [54.00111565818903]
Cross-view geo-localization is to spot images of the same geographic target from different platforms.
Existing methods usually concentrate on mining the fine-grained feature of the geographic target in the image center.
We introduce a simple and effective deep neural network, called Local Pattern Network (LPN), to take advantage of contextual information.
arXiv Detail & Related papers (2020-08-26T16:06:11Z) - Split-Merge Pooling [36.2980225204665]
Split-Merge pooling is introduced to preserve spatial information without subsampling.
We evaluate our approach for dense semantic segmentation of large image sizes taken from the Cityscapes and GTA-5 datasets.
arXiv Detail & Related papers (2020-06-13T23:20:30Z) - Neural Subdivision [58.97214948753937]
This paper introduces Neural Subdivision, a novel framework for data-driven coarseto-fine geometry modeling.
We optimize for the same set of network weights across all local mesh patches, thus providing an architecture that is not constrained to a specific input mesh, fixed genus, or category.
We demonstrate that even when trained on a single high-resolution mesh our method generates reasonable subdivisions for novel shapes.
arXiv Detail & Related papers (2020-05-04T20:03:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.