Multi-Resolution Factor Graph Based Stereo Correspondence Algorithm
- URL: http://arxiv.org/abs/2202.01309v1
- Date: Wed, 2 Feb 2022 22:27:10 GMT
- Title: Multi-Resolution Factor Graph Based Stereo Correspondence Algorithm
- Authors: Hanieh Shabanian, Madhusudhanan Balasubramanian
- Abstract summary: A dense depth-map of a scene at an arbitrary view orientation can be estimated from dense view correspondences.
We present a new multi-resolution factor graph-based stereo matching algorithm (MR-FGS)
The MR-FGS algorithm was evaluated qualitatively and quantitatively using stereo pairs in the Middlebury stereo benchmark dataset.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A dense depth-map of a scene at an arbitrary view orientation can be
estimated from dense view correspondences among multiple lower-dimensional
views of the scene. These low-dimensional view correspondences are dependent on
the geometrical relationship among the views and the scene. Determining dense
view correspondences is difficult in part due to presence of homogeneous
regions in the scene and due to presence of occluded regions and illumination
differences among the views. We present a new multi-resolution factor
graph-based stereo matching algorithm (MR-FGS) that utilizes both intra- and
inter-resolution dependencies among the views as well as among the disparity
estimates. The proposed framework allows exchange of information among multiple
resolutions of the correspondence problem and is useful for handling larger
homogeneous regions in a scene. The MR-FGS algorithm was evaluated
qualitatively and quantitatively using stereo pairs in the Middlebury stereo
benchmark dataset based on commonly used performance measures. When compared to
a recently developed factor graph model (FGS), the MR-FGS algorithm provided
more accurate disparity estimates without requiring the commonly used
post-processing procedure known as the left-right consistency check. The
multi-resolution dependency constraint within the factor-graph model
significantly improved contrast along depth boundaries in the MR-FGS generated
disparity maps.
Related papers
- Cross-view geo-localization, Image retrieval, Multiscale geometric modeling, Frequency domain enhancement [1.6686955491488273]
Cross-view geo-localization (CVGL) aims to establish spatial correspondences between images captured from significantly different viewpoints.<n>CVGL remains challenging due to severe geometric asymmetry, texture inconsistency across imaging domains, and the progressive degradation of discriminative local information.<n>This paper proposes the Spatial and Frequency Domain Enhancement Network (SFDE), which leverages complementary representations from spatial and frequency domains.
arXiv Detail & Related papers (2026-03-03T08:25:35Z) - MS-ISSM: Objective Quality Assessment of Point Clouds Using Multi-scale Implicit Structural Similarity [65.85858856481131]
unstructured and irregular nature of point clouds poses a significant challenge for objective quality assessment (PCQA)<n>We propose the Multi-scale Implicit Structural Similarity Measurement (MS-ISSM)
arXiv Detail & Related papers (2026-01-03T14:58:52Z) - Unsupervised Multi-View Visual Anomaly Detection via Progressive Homography-Guided Alignment [14.782512101141016]
Unsupervised visual anomaly detection from multi-view images presents a significant challenge.<n>ViewSense-AD (VSAD) learns viewpoint-invariant representations by explicitly modeling geometric consistency across views.<n>Anomaly detection is performed by comparing multi-level features from the diffusion model against a learned memory bank of normal prototypes.
arXiv Detail & Related papers (2025-11-24T05:01:16Z) - Graph-Based Uncertainty Modeling and Multimodal Fusion for Salient Object Detection [12.743278093269325]
We propose a dynamic uncertainty propagation and multimodal collaborative reasoning network (DUP-MCRNet)<n>DUGC is designed to propagate uncertainty between layers through a sparse graph constructed based on spatial semantic distance.<n>MCF uses learnable modality gating weights to weightedly fuse the attention maps of RGB, depth, and edge features.
arXiv Detail & Related papers (2025-08-28T04:31:48Z) - MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry [18.419627185893926]
MAC-VO is a novel learning-based stereo VO that leverages the learned metrics-aware matching uncertainty for dual purposes.
Our keypoint selector employs the learned uncertainty to filter out the low-quality features based on global inconsistency.
arXiv Detail & Related papers (2024-09-14T16:49:42Z) - Multi-Spectral Image Stitching via Spatial Graph Reasoning [52.27796682972484]
We propose a spatial graph reasoning based multi-spectral image stitching method.
We embed multi-scale complementary features from the same view position into a set of nodes.
By introducing long-range coherence along spatial and channel dimensions, the complementarity of pixel relations and channel interdependencies aids in the reconstruction of aligned multi-view features.
arXiv Detail & Related papers (2023-07-31T15:04:52Z) - Multiscale Dynamic Graph Representation for Biometric Recognition with
Occlusions [43.05765549682057]
Occlusion is a common problem with biometric recognition in the wild.
We propose a novel unified framework integrating the merits of both CNNs and graph models.
arXiv Detail & Related papers (2023-07-27T04:18:08Z) - Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth
Estimation in Dynamic Scenes [51.20150148066458]
We propose a novel method to learn to fuse the multi-view and monocular cues encoded as volumes without needing the generalizationally crafted masks.
Experiments on real-world datasets prove the significant effectiveness and ability of the proposed method.
arXiv Detail & Related papers (2023-04-18T13:55:24Z) - A Novel Factor Graph-Based Optimization Technique for Stereo
Correspondence Estimation [0.0]
We present a new factor graph-based probabilistic graphical model for disparity estimation.
The new factor graph-based method provided disparity estimates with higher accuracy when compared to the recent non-learning- and learning-based disparity estimation algorithms.
arXiv Detail & Related papers (2021-09-22T23:30:33Z) - LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution
Homography Estimation [52.63874513999119]
Cross-resolution image alignment is a key problem in multiscale giga photography.
Existing deep homography methods neglecting the explicit formulation of correspondences between them, which leads to degraded accuracy in cross-resolution challenges.
We propose a local transformer network embedded within a multiscale structure to explicitly learn correspondences between the multimodal inputs.
arXiv Detail & Related papers (2021-06-08T02:51:45Z) - Spatial-spectral Hyperspectral Image Classification via Multiple Random
Anchor Graphs Ensemble Learning [88.60285937702304]
This paper proposes a novel spatial-spectral HSI classification method via multiple random anchor graphs ensemble learning (RAGE)
Firstly, the local binary pattern is adopted to extract the more descriptive features on each selected band, which preserves local structures and subtle changes of a region.
Secondly, the adaptive neighbors assignment is introduced in the construction of anchor graph, to reduce the computational complexity.
arXiv Detail & Related papers (2021-03-25T09:31:41Z) - Semantic Change Detection with Asymmetric Siamese Networks [71.28665116793138]
Given two aerial images, semantic change detection aims to locate the land-cover variations and identify their change types with pixel-wise boundaries.
This problem is vital in many earth vision related tasks, such as precise urban planning and natural resource management.
We present an asymmetric siamese network (ASN) to locate and identify semantic changes through feature pairs obtained from modules of widely different structures.
arXiv Detail & Related papers (2020-10-12T13:26:30Z) - Multi-Margin based Decorrelation Learning for Heterogeneous Face
Recognition [90.26023388850771]
This paper presents a deep neural network approach to extract decorrelation representations in a hyperspherical space for cross-domain face images.
The proposed framework can be divided into two components: heterogeneous representation network and decorrelation representation learning.
Experimental results on two challenging heterogeneous face databases show that our approach achieves superior performance on both verification and recognition tasks.
arXiv Detail & Related papers (2020-05-25T07:01:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.