Related papers: Towards Rotation-only Imaging Geometry: Rotation Estimation

Towards Rotation-only Imaging Geometry: Rotation Estimation

URL: http://arxiv.org/abs/2511.12415v1
Date: Sun, 16 Nov 2025 02:04:32 GMT
Title: Towards Rotation-only Imaging Geometry: Rotation Estimation
Authors: Xinrui Li, Qi Cai, Yuanxin Wu,
Abstract summary: Structure from Motion (SfM) is a critical task in computer vision, aiming to recover the 3D scene structure and camera motion from a sequence of 2D images.<n>Recent pose-only imaging geometry decouples 3D coordinates from camera poses and demonstrates significantly better SfM performance through pose adjustment.
Score: 11.806182001858454
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Structure from Motion (SfM) is a critical task in computer vision, aiming to recover the 3D scene structure and camera motion from a sequence of 2D images. The recent pose-only imaging geometry decouples 3D coordinates from camera poses and demonstrates significantly better SfM performance through pose adjustment. Continuing the pose-only perspective, this paper explores the critical relationship between the scene structures, rotation and translation. Notably, the translation can be expressed in terms of rotation, allowing us to condense the imaging geometry representation onto the rotation manifold. A rotation-only optimization framework based on reprojection error is proposed for both two-view and multi-view scenarios. The experiment results demonstrate superior accuracy and robustness performance over the current state-of-the-art rotation estimation methods, even comparable to multiple bundle adjustment iteration results. Hopefully, this work contributes to even more accurate, efficient and reliable 3D visual computing.

Related papers

Gaussian Alignment for Relative Camera Pose Estimation via Single-View Reconstruction [18.936573991468926]
GARPS is a training-free framework that casts this problem as the direct alignment of two independently reconstructed 3D scenes.<n>It refines an initial pose from a feed-forward two-view pose estimator by optimising a differentiable GMM alignment objective.<n>Experiments on the Real-Estate10K dataset demonstrate that GARPS outperforms both classical and state-of-the-art learning-based methods.
arXiv Detail & Related papers (2025-09-17T02:57:34Z)
Surf3R: Rapid Surface Reconstruction from Sparse RGB Views in Seconds [34.38496869014632]
Surf3R is an end-to-end feedforward approach that reconstructs 3D surfaces from sparse views without estimating camera poses.<n>Our method employs a multi-branch and multi-view decoding architecture in which multiple reference views jointly guide the reconstruction process.
arXiv Detail & Related papers (2025-08-06T14:53:42Z)
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection [48.11373832295736]
This paper focuses on rotation symmetry, where objects remain unchanged when rotated around a central axis.<n>Traditional methods relied on hand-crafted feature matching, while recent segmentation models based on convolutional neural networks detect rotation centers but struggle with 3D geometric consistency.<n>We propose a model that directly predicts rotation centers and vertices in 3D space and projects the results back to 2D while preserving structural integrity.
arXiv Detail & Related papers (2025-03-26T05:02:16Z)
Camera Movement Estimation and Path Correction using the Combination of Modified A-SIFT and Stereo System for 3D Modelling [1.6574413179773757]
Efficient camera path generation can help resolve issues in creating accurate and efficient 3D models.<n>A modified version of the Affine Scale-Invariant Feature Transform (ASIFT) is proposed to extract more matching points with reduced computational overhead.<n>A novel two-camera-based rotation correction model is introduced to mitigate small rotational errors.<n>A stereo camera-based translation estimation and correction model is implemented to determine camera movement in 3D space.
arXiv Detail & Related papers (2025-03-22T06:37:54Z)
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views [100.45129752375658]
We present FLARE, a feed-forward model designed to infer high-quality camera poses and 3D geometry from uncalibrated sparse-view images.<n>Our solution features a cascaded learning paradigm with camera pose serving as the critical bridge, recognizing its essential role in mapping 3D structures onto 2D image planes.
arXiv Detail & Related papers (2025-02-17T18:54:05Z)
GeoGS3D: Single-view 3D Reconstruction via Geometric-aware Diffusion Model and Gaussian Splatting [81.03553265684184]
We introduce GeoGS3D, a framework for reconstructing detailed 3D objects from single-view images. We propose a novel metric, Gaussian Divergence Significance (GDS), to prune unnecessary operations during optimization. Experiments demonstrate that GeoGS3D generates images with high consistency across views and reconstructs high-quality 3D objects.
arXiv Detail & Related papers (2024-03-15T12:24:36Z)
FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models [67.96827539201071]
We propose a novel test-time optimization approach for 3D scene reconstruction. Our method achieves state-of-the-art cross-dataset reconstruction on five zero-shot testing datasets.
arXiv Detail & Related papers (2023-08-10T17:55:02Z)
Towards Scalable Multi-View Reconstruction of Geometry and Materials [27.660389147094715]
We propose a novel method for joint recovery of camera pose, object geometry and spatially-varying Bidirectional Reflectance Distribution Function (svBRDF) of 3D scenes. The input are high-resolution RGBD images captured by a mobile, hand-held capture system with point lights for active illumination.
arXiv Detail & Related papers (2023-06-06T15:07:39Z)
RelPose++: Recovering 6D Poses from Sparse-view Observations [66.6922660401558]
We address the task of estimating 6D camera poses from sparse-view image sets (2-8 images) We build on the recent RelPose framework which learns a network that infers distributions over relative rotations over image pairs. Our final system results in large improvements in 6D pose prediction over prior art on both seen and unseen object categories.
arXiv Detail & Related papers (2023-05-08T17:59:58Z)
Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation [57.11299763566534]
We present a solution to recover 3D pose from multi-view images captured with spatially calibrated cameras. We exploit 3D geometry to fuse input images into a unified latent representation of pose, which is disentangled from camera view-points. Our architecture then conditions the learned representation on camera projection operators to produce accurate per-view 2d detections.
arXiv Detail & Related papers (2020-04-05T12:52:29Z)

This list is automatically generated from the titles and abstracts of the papers in this site.