S2P3: Self-Supervised Polarimetric Pose Prediction
- URL: http://arxiv.org/abs/2312.01105v1
- Date: Sat, 2 Dec 2023 10:46:40 GMT
- Title: S2P3: Self-Supervised Polarimetric Pose Prediction
- Authors: Patrick Ruhkamp, Daoyi Gao, Nassir Navab, Benjamin Busam
- Abstract summary: This paper proposes the first self-supervised 6D object pose prediction from multimodal RGB+polarimetric images.
The novel training paradigm comprises 1) a physical model to extract geometric information of polarized light, 2) a teacher-student knowledge distillation scheme and 3) a self-supervised loss formulation through differentiable constraints.
- Score: 55.43547228561919
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper proposes the first self-supervised 6D object pose prediction from
multimodal RGB+polarimetric images. The novel training paradigm comprises 1) a
physical model to extract geometric information of polarized light, 2) a
teacher-student knowledge distillation scheme and 3) a self-supervised loss
formulation through differentiable rendering and an invertible physical
constraint. Both networks leverage the physical properties of polarized light
to learn robust geometric representations by encoding shape priors and
polarization characteristics derived from our physical model. Geometric
pseudo-labels from the teacher support the student network without the need for
annotated real data. Dense appearance and geometric information of objects are
obtained through a differentiable renderer with the predicted pose for
self-supervised direct coupling. The student network additionally features our
proposed invertible formulation of the physical shape priors that enables
end-to-end self-supervised training through physical constraints of derived
polarization characteristics compared against polarimetric input images. We
specifically focus on photometrically challenging objects with texture-less or
reflective surfaces and transparent materials for which the most prominent
performance gain is reported.
Related papers
- A Hitchhiker's Guide to Geometric GNNs for 3D Atomic Systems [87.30652640973317]
Recent advances in computational modelling of atomic systems represent them as geometric graphs with atoms embedded as nodes in 3D Euclidean space.
Geometric Graph Neural Networks have emerged as the preferred machine learning architecture powering applications ranging from protein structure prediction to molecular simulations and material generation.
This paper provides a comprehensive and self-contained overview of the field of Geometric GNNs for 3D atomic systems.
arXiv Detail & Related papers (2023-12-12T18:44:19Z) - Polarimetric Information for Multi-Modal 6D Pose Estimation of
Photometrically Challenging Objects with Limited Data [51.95347650131366]
6D pose estimation pipelines that rely on RGB-only or RGB-D data show limitations for photometrically challenging objects.
A supervised learning-based method utilising complementary polarisation information is proposed to overcome such limitations.
arXiv Detail & Related papers (2023-08-21T10:56:00Z) - Transparent Shape from a Single View Polarization Image [6.18278691318801]
This paper presents a learning-based method for transparent surface estimation from a single view polarization image.
Existing shape from polarization(SfP) methods have the difficulty in estimating transparent shape since the inherent transmission interference heavily reduces the reliability of physics-based prior.
arXiv Detail & Related papers (2022-04-13T12:24:32Z) - Self-Supervised Image Representation Learning with Geometric Set
Consistency [50.12720780102395]
We propose a method for self-supervised image representation learning under the guidance of 3D geometric consistency.
Specifically, we introduce 3D geometric consistency into a contrastive learning framework to enforce the feature consistency within image views.
arXiv Detail & Related papers (2022-03-29T08:57:33Z) - {\phi}-SfT: Shape-from-Template with a Physics-Based Deformation Model [69.27632025495512]
Shape-from-Template (SfT) methods estimate 3D surface deformations from a single monocular RGB camera.
This paper proposes a new SfT approach explaining 2D observations through physical simulations.
arXiv Detail & Related papers (2022-03-22T17:59:57Z) - Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel
Space [43.654464513994164]
We present a method for learning causal relationships in high-dimensional data (images, videos)
Our method does not require the knowledge or supervision of any ground truth positions or other object or scene properties.
We introduce a new challenging and carefully designed counterfactual benchmark for predictions in pixel space.
arXiv Detail & Related papers (2022-02-01T12:18:30Z) - Polarimetric Pose Prediction [42.47531308682873]
Colour-band separated wavelength and intensity are arguably the most commonly used ones for monocular 6D object pose estimation.
This paper explores how complementary polarisation information, i.e. the orientation of light wave oscillations, can influence the accuracy of pose predictions.
arXiv Detail & Related papers (2021-12-07T16:38:10Z) - SIDER: Single-Image Neural Optimization for Facial Geometric Detail
Recovery [54.64663713249079]
SIDER is a novel photometric optimization method that recovers detailed facial geometry from a single image in an unsupervised manner.
In contrast to prior work, SIDER does not rely on any dataset priors and does not require additional supervision from multiple views, lighting changes or ground truth 3D shape.
arXiv Detail & Related papers (2021-08-11T22:34:53Z) - 3D Human Shape Reconstruction from a Polarization Image [34.240256720930155]
This paper tackles the problem of estimating 3D body shape of clothed humans from single polarized 2D images.
A dedicated two-stage deep learning approach, SfP, is proposed.
arXiv Detail & Related papers (2020-07-17T22:36:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.