Deep Projective Rotation Estimation through Relative Supervision
- URL: http://arxiv.org/abs/2211.11182v1
- Date: Mon, 21 Nov 2022 04:58:07 GMT
- Title: Deep Projective Rotation Estimation through Relative Supervision
- Authors: Brian Okorn, Chuer Pan, Martial Hebert, David Held
- Abstract summary: Deep learning has offered a way to develop image-based orientation estimators.
These estimators often require training on a large labeled dataset.
We propose a new algorithm for selfsupervised orientation estimation.
- Score: 31.05330535795121
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Orientation estimation is the core to a variety of vision and robotics tasks
such as camera and object pose estimation. Deep learning has offered a way to
develop image-based orientation estimators; however, such estimators often
require training on a large labeled dataset, which can be time-intensive to
collect. In this work, we explore whether self-supervised learning from
unlabeled data can be used to alleviate this issue. Specifically, we assume
access to estimates of the relative orientation between neighboring poses, such
that can be obtained via a local alignment method. While self-supervised
learning has been used successfully for translational object keypoints, in this
work, we show that naively applying relative supervision to the rotational
group $SO(3)$ will often fail to converge due to the non-convexity of the
rotational space. To tackle this challenge, we propose a new algorithm for
self-supervised orientation estimation which utilizes Modified Rodrigues
Parameters to stereographically project the closed manifold of $SO(3)$ to the
open manifold of $\mathbb{R}^{3}$, allowing the optimization to be done in an
open Euclidean space. We empirically validate the benefits of the proposed
algorithm for rotational averaging problem in two settings: (1) direct
optimization on rotation parameters, and (2) optimization of parameters of a
convolutional neural network that predicts object orientations from images. In
both settings, we demonstrate that our proposed algorithm is able to converge
to a consistent relative orientation frame much faster than algorithms that
purely operate in the $SO(3)$ space. Additional information can be found at
https://sites.google.com/view/deep-projective-rotation/home .
Related papers
- An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation [10.05584976985694]
3D-2D constraints provided by line features have been widely used in Visual Odometry (VO) and Structure-from-Motion (SfM) systems.
We propose a novel three-view pose solver based on rotation-translation decoupled estimation.
arXiv Detail & Related papers (2024-03-18T10:21:05Z) - Fixation-based Self-calibration for Eye Tracking in VR Headsets [0.21561701531034413]
The proposed method is based on the assumptions that the user's viewpoint can freely move.
fixations are first detected from the time-series data of uncalibrated gaze directions.
The calibration parameters are optimized by minimizing the sum of a dispersion metrics of the PoRs.
arXiv Detail & Related papers (2023-11-01T09:34:15Z) - Neural Gradient Learning and Optimization for Oriented Point Normal
Estimation [53.611206368815125]
We propose a deep learning approach to learn gradient vectors with consistent orientation from 3D point clouds for normal estimation.
We learn an angular distance field based on local plane geometry to refine the coarse gradient vectors.
Our method efficiently conducts global gradient approximation while achieving better accuracy and ability generalization of local feature description.
arXiv Detail & Related papers (2023-09-17T08:35:11Z) - Vanishing Point Estimation in Uncalibrated Images with Prior Gravity
Direction [82.72686460985297]
We tackle the problem of estimating a Manhattan frame.
We derive two new 2-line solvers, one of which does not suffer from singularities affecting existing solvers.
We also design a new non-minimal method, running on an arbitrary number of lines, to boost the performance in local optimization.
arXiv Detail & Related papers (2023-08-21T13:03:25Z) - The Probabilistic Normal Epipolar Constraint for Frame-To-Frame Rotation
Optimization under Uncertain Feature Positions [53.478856119297284]
We introduce the probabilistic normal epipolar constraint (PNEC) that overcomes the limitation by accounting for anisotropic and inhomogeneous uncertainties in the feature positions.
In experiments on synthetic data, we demonstrate that the novel PNEC yields more accurate rotation estimates than the original NEC.
We integrate the proposed method into a state-of-the-art monocular rotation-only odometry system and achieve consistently improved results for the real-world KITTI dataset.
arXiv Detail & Related papers (2022-04-05T14:47:11Z) - Extreme Rotation Estimation using Dense Correlation Volumes [73.35119461422153]
We present a technique for estimating the relative 3D rotation of an RGB image pair in an extreme setting.
We observe that, even when images do not overlap, there may be rich hidden cues as to their geometric relationship.
We propose a network design that can automatically learn such implicit cues by comparing all pairs of points between the two input images.
arXiv Detail & Related papers (2021-04-28T02:00:04Z) - Self-supervised Geometric Perception [96.89966337518854]
Self-supervised geometric perception is a framework to learn a feature descriptor for correspondence matching without any ground-truth geometric model labels.
We show that SGP achieves state-of-the-art performance that is on-par or superior to the supervised oracles trained using ground-truth labels.
arXiv Detail & Related papers (2021-03-04T15:34:43Z) - Amplifying the Anterior-Posterior Difference via Data Enhancement -- A
More Robust Deep Monocular Orientation Estimation Solution [7.540176446791261]
Existing deep-learning based monocular orientation estimation algorithms face the problem of confusion between the anterior and posterior parts of the objects.
We propose a pretraining method which focuses on predicting the left/right semicircle in which the orientation of the object is located.
Experiment results show that the proposed semicircle prediction enhances the accuracy of orientation estimation, and mitigates the problem stated above.
arXiv Detail & Related papers (2020-12-21T15:36:13Z) - DiSCO: Differentiable Scan Context with Orientation [13.797651328615347]
We propose a LiDAR-based place recognition method, named Differentiable Scan Context with Orientation (DiSCO)
It simultaneously finds the scan at a similar place and estimates their relative orientation.
DiSCO is validated on three datasets with long-term outdoor conditions.
arXiv Detail & Related papers (2020-10-21T12:38:21Z) - Robust 6D Object Pose Estimation by Learning RGB-D Features [59.580366107770764]
We propose a novel discrete-continuous formulation for rotation regression to resolve this local-optimum problem.
We uniformly sample rotation anchors in SO(3), and predict a constrained deviation from each anchor to the target, as well as uncertainty scores for selecting the best prediction.
Experiments on two benchmarks: LINEMOD and YCB-Video, show that the proposed method outperforms state-of-the-art approaches.
arXiv Detail & Related papers (2020-02-29T06:24:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.