Handling Object Symmetries in CNN-based Pose Estimation
- URL: http://arxiv.org/abs/2011.13209v2
- Date: Tue, 27 Apr 2021 07:24:24 GMT
- Title: Handling Object Symmetries in CNN-based Pose Estimation
- Authors: Jesse Richter-Klug and Udo Frese
- Abstract summary: We investigate the problems that Convolutional Neural Networks (CNN)-based pose estimators have with symmetric objects.
In particular, we find that the popular min-over-symmetries approach for creating a symmetry-aware loss tends not to work well with gradient-based optimization.
We propose a representation called "closed symmetry loop" (csl) from these insights, where the angle of relevant vectors is multiplied by the symmetry order and then generalize it to 6-DOF.
- Score: 7.1577508803778045
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we investigate the problems that Convolutional Neural Networks
(CNN)-based pose estimators have with symmetric objects. We considered the
value of the CNN's output representation when continuously rotating the object
and found that it has to form a closed loop after each step of symmetry.
Otherwise, the CNN (which is itself a continuous function) has to replicate an
uncontinuous function. On a 1-DOF toy example we show that commonly used
representations do not fulfill this demand and analyze the problems caused
thereby. In particular, we find that the popular min-over-symmetries approach
for creating a symmetry-aware loss tends not to work well with gradient-based
optimization, i.e. deep learning.
We propose a representation called "closed symmetry loop" (csl) from these
insights, where the angle of relevant vectors is multiplied by the symmetry
order and then generalize it to 6-DOF. The representation extends our algorithm
from [Richter-Klug, ICVS, 2019] including a method to disambiguate symmetric
equivalents during the final pose estimation. The algorithm handles continuous
rotational symmetry (e.g. a bottle) and discrete rotational symmetry (e.g. a
4-fold symmetric box). It is evaluated on the T-LESS dataset, where it reaches
state-of-the-art for unrefining RGB-based methods.
Related papers
- Resolving Symmetry Ambiguity in Correspondence-based Methods for Instance-level Object Pose Estimation [5.821462441570274]
We propose SymCode, a symmetry-aware surface that encodes the object surface based on one-to-many correspondences.
We also introduce SymNet, a fast end-to-end network that directly regresses an object's 6D pose parameters without solving a problem.
arXiv Detail & Related papers (2024-05-17T05:48:56Z) - Equivariant Symmetry Breaking Sets [0.6475999521931204]
Equivariant neural networks (ENNs) have been shown to be extremely effective in applications involving underlying symmetries.
We propose a novel symmetry breaking framework that is fully equivariant and is the first which fully addresses spontaneous symmetry breaking.
arXiv Detail & Related papers (2024-02-05T02:35:11Z) - Oracle-Preserving Latent Flows [58.720142291102135]
We develop a methodology for the simultaneous discovery of multiple nontrivial continuous symmetries across an entire labelled dataset.
The symmetry transformations and the corresponding generators are modeled with fully connected neural networks trained with a specially constructed loss function.
The two new elements in this work are the use of a reduced-dimensionality latent space and the generalization to transformations invariant with respect to high-dimensional oracles.
arXiv Detail & Related papers (2023-02-02T00:13:32Z) - Deep Learning Symmetries and Their Lie Groups, Algebras, and Subalgebras
from First Principles [55.41644538483948]
We design a deep-learning algorithm for the discovery and identification of the continuous group of symmetries present in a labeled dataset.
We use fully connected neural networks to model the transformations symmetry and the corresponding generators.
Our study also opens the door for using a machine learning approach in the mathematical study of Lie groups and their properties.
arXiv Detail & Related papers (2023-01-13T16:25:25Z) - Breaking the Symmetry: Resolving Symmetry Ambiguities in Equivariant
Neural Networks [4.147346416230272]
We present OAVNN: Orientation Aware Vector Neuron Network, an extension of the Vector Neuron Network.
OAVNN is a rotation equivariant network that is robust to planar symmetric inputs.
arXiv Detail & Related papers (2022-10-29T16:28:59Z) - When Random Tensors meet Random Matrices [50.568841545067144]
This paper studies asymmetric order-$d$ spiked tensor models with Gaussian noise.
We show that the analysis of the considered model boils down to the analysis of an equivalent spiked symmetric textitblock-wise random matrix.
arXiv Detail & Related papers (2021-12-23T04:05:01Z) - Robust Extrinsic Symmetry Estimation in 3D Point Clouds [4.416484585765027]
Detecting the reflection symmetry plane of an object represented by a 3D point cloud is a fundamental problem in 3D computer vision and geometry processing.
We propose a statistical estimator-based approach for the plane of reflection symmetry that is robust to outliers and missing parts.
arXiv Detail & Related papers (2021-09-21T03:09:51Z) - Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix
Factorization [49.090785356633695]
We study the asymmetric low-rank factorization problem: [mathbfU in mathbbRm min d, mathbfU$ and mathV$.
arXiv Detail & Related papers (2021-06-27T17:25:24Z) - Detecting Symmetries with Neural Networks [0.0]
We make extensive use of the structure in the embedding layer of the neural network.
We identify whether a symmetry is present and to identify orbits of the symmetry in the input.
For this example we present a novel data representation in terms of graphs.
arXiv Detail & Related papers (2020-03-30T17:58:24Z) - Semiparametric Nonlinear Bipartite Graph Representation Learning with
Provable Guarantees [106.91654068632882]
We consider the bipartite graph and formalize its representation learning problem as a statistical estimation problem of parameters in a semiparametric exponential family distribution.
We show that the proposed objective is strongly convex in a neighborhood around the ground truth, so that a gradient descent-based method achieves linear convergence rate.
Our estimator is robust to any model misspecification within the exponential family, which is validated in extensive experiments.
arXiv Detail & Related papers (2020-03-02T16:40:36Z) - Robust 6D Object Pose Estimation by Learning RGB-D Features [59.580366107770764]
We propose a novel discrete-continuous formulation for rotation regression to resolve this local-optimum problem.
We uniformly sample rotation anchors in SO(3), and predict a constrained deviation from each anchor to the target, as well as uncertainty scores for selecting the best prediction.
Experiments on two benchmarks: LINEMOD and YCB-Video, show that the proposed method outperforms state-of-the-art approaches.
arXiv Detail & Related papers (2020-02-29T06:24:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.