Hierarchical GraphCut Phase Unwrapping based on Invariance of Diffeomorphisms Framework
- URL: http://arxiv.org/abs/2511.18682v1
- Date: Mon, 24 Nov 2025 01:45:26 GMT
- Title: Hierarchical GraphCut Phase Unwrapping based on Invariance of Diffeomorphisms Framework
- Authors: Xiang Gao, Xinmu Wang, Zhou Zhao, Junqi Huang, Xianfeng David Gu,
- Abstract summary: A key step in structured-light scanning is phase unwrapping, which recovers continuous phase values from measurements wrapped modulo 2pi.<n>The goal is to estimate the unwrapped phase count k in the equation Phi = phi + 2pi k, where phi is the wrapped phase and Phi is the true phase.<n>Existing methods trade speed for accuracy, while accurate algorithms are too slow for real-time use.<n>This work proposes a phase unwrapping framework that reformulates GraphCut-based unwrapping as a pixel-labeling problem.
- Score: 38.53305416499726
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent years have witnessed rapid advancements in 3D scanning technologies, with applications spanning VR/AR, digital human creation, and medical imaging. Structured-light scanning with phase-shifting techniques is preferred for its use of low-intensity visible light and high accuracy, making it well suited for capturing 4D facial dynamics. A key step is phase unwrapping, which recovers continuous phase values from measurements wrapped modulo 2pi. The goal is to estimate the unwrapped phase count k in the equation Phi = phi + 2pi k, where phi is the wrapped phase and Phi is the true phase. Noise, occlusions, and complex 3D geometry make recovering the true phase challenging because phase unwrapping is ill-posed: measurements only provide modulo 2pi values, and estimating k requires assumptions about surface continuity. Existing methods trade speed for accuracy: fast approaches lack precision, while accurate algorithms are too slow for real-time use. To overcome these limitations, this work proposes a phase unwrapping framework that reformulates GraphCut-based unwrapping as a pixel-labeling problem. This framework improves the estimation of the unwrapped phase count k through the invariance property of diffeomorphisms applied in image space via conformal and optimal transport (OT) maps. An odd number of diffeomorphisms are precomputed from the input phase data, and a hierarchical GraphCut algorithm is applied in each domain. The resulting label maps are fused via majority voting to robustly estimate k at each pixel. Experimental results demonstrate a 45.5x speedup and lower L2 error in real experiments and simulations, showing potential for real-time applications.
Related papers
- NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation [88.09231548061295]
Phase-Preserving Diffusion -PD is a model-agnostic reformulation of the diffusion process.<n>-PD preserves input phase while randomizing magnitude, enabling structure-aligned generation without architectural changes.<n>-PD adds no inference-time cost and is compatible with any diffusion model for images or videos.
arXiv Detail & Related papers (2025-12-04T18:59:18Z) - Accelerating 3D Photoacoustic Computed Tomography with End-to-End Physics-Aware Neural Operators [74.65171736966131]
Photoacoustic computed tomography (PACT) combines optical contrast with ultrasonic resolution, achieving deep-tissue imaging beyond the optical diffusion limit.<n>Current implementations require dense transducer arrays and prolonged acquisition times, limiting clinical translation.<n>We introduce Pano, an end-to-end physics-aware model that directly learns the inverse acoustic mapping from sensor measurements to volumetric reconstructions.
arXiv Detail & Related papers (2025-09-11T23:12:55Z) - 3D-2D Neural Nets for Phase Retrieval in Noisy Interferometric Imaging [0.0]
We introduce a 3D-2D Phase Retrieval U-Net (PRUNe) that takes noisy and randomly phase-shifted interferograms as inputs, and outputs a single 2D phase image.
A 3D downsampling convolutional encoder captures correlations within and between frames to produce a 2D latent space, which is upsampled by a 2D decoder into a phase image.
We find PRUNe reconstructions consistently show more accurate and smooth reconstructions, with a x2.5 - 4 lower mean squared error at multiple signal-to-noise ratios for interferograms with low (
arXiv Detail & Related papers (2024-02-08T21:19:16Z) - Enhanced fringe-to-phase framework using deep learning [2.243491254050456]
We introduce SFNet, a symmetric fusion network that transforms two fringe images into an absolute phase.
To enhance output reliability, Our framework predicts refined phases by incorporating information from fringe images of a different frequency than those used as input.
arXiv Detail & Related papers (2024-02-01T19:47:34Z) - Generalization of pixel-wise phase estimation by CNN and improvement of
phase-unwrapping by MRF optimization for one-shot 3D scan [0.621405559652172]
Active stereo technique using single pattern projection, a.k.a. one-shot 3D scan, have drawn a wide attention from industry, medical purposes, etc.
One severe drawback of one-shot 3D scan is sparse reconstruction.
We propose a pixel-wise technique for one-shot scan, which is applicable to any types of static pattern if the pattern is regular and periodic.
arXiv Detail & Related papers (2023-09-26T10:45:04Z) - Pixel Adaptive Deep Unfolding Transformer for Hyperspectral Image
Reconstruction [58.32266851510948]
We propose a Pixel Adaptive Deep Unfolding Transformer (PADUT) for HSI reconstruction.
In the data module, a pixel adaptive descent step is employed to focus on pixel-level degradation.
In the prior module, we introduce the Non-local Spectral Transformer (NST) to emphasize the 3D characteristics of HSI for recovering.
arXiv Detail & Related papers (2023-08-21T16:12:31Z) - Self-supervised phase unwrapping in fringe projection profilometry [0.0]
A novel self-supervised phase unwrapping method for single-camera fringe projection profilometry is proposed.
The trained network can retrieve the absolute fringe order from one phase map of 64-period and overperform DF-TPU approaches in terms of depth accuracy.
Experimental results demonstrate the validation of the proposed method on real scenes of motion blur, isolated objects, low reflectivity, and phase discontinuity.
arXiv Detail & Related papers (2023-02-13T14:16:34Z) - Deep Iterative Phase Retrieval for Ptychography [13.49645012479288]
In order to reconstruct an object from its diffraction pattern, the inverse Fourier transform must be computed.
In this work we consider ptychography, a sub-field of diffractive imaging, where objects are reconstructed from multiple overlapping diffraction images.
We propose an augmentation of existing iterative phase retrieval algorithms with a neural network designed for refining the result of each iteration.
arXiv Detail & Related papers (2022-02-17T09:13:35Z) - Uncertainty-Aware Camera Pose Estimation from Points and Lines [101.03675842534415]
Perspective-n-Point-and-Line (Pn$PL) aims at fast, accurate and robust camera localizations with respect to a 3D model from 2D-3D feature coordinates.
arXiv Detail & Related papers (2021-07-08T15:19:36Z) - Displacement-Invariant Cost Computation for Efficient Stereo Matching [122.94051630000934]
Deep learning methods have dominated stereo matching leaderboards by yielding unprecedented disparity accuracy.
But their inference time is typically slow, on the order of seconds for a pair of 540p images.
We propose a emphdisplacement-invariant cost module to compute the matching costs without needing a 4D feature volume.
arXiv Detail & Related papers (2020-12-01T23:58:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.