SAMP: Shape and Motion Priors for 4D Vehicle Reconstruction
- URL: http://arxiv.org/abs/2005.00922v1
- Date: Sat, 2 May 2020 21:23:54 GMT
- Title: SAMP: Shape and Motion Priors for 4D Vehicle Reconstruction
- Authors: Francis Engelmann, J\"org St\"uckler, Bastian Leibe
- Abstract summary: We represent shapes by 3D signed distance functions and embed them in a low-dimensional manifold.
We employ a motion model to regularize the trajectory to plausible object motions.
We show state-of-the-art results in terms of shape reconstruction and pose estimation accuracy.
- Score: 25.408575404593176
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Inferring the pose and shape of vehicles in 3D from a movable platform still
remains a challenging task due to the projective sensing principle of cameras,
difficult surface properties e.g. reflections or transparency, and illumination
changes between images. In this paper, we propose to use 3D shape and motion
priors to regularize the estimation of the trajectory and the shape of vehicles
in sequences of stereo images. We represent shapes by 3D signed distance
functions and embed them in a low-dimensional manifold. Our optimization method
allows for imposing a common shape across all image observations along an
object track. We employ a motion model to regularize the trajectory to
plausible object motions. We evaluate our method on the KITTI dataset and show
state-of-the-art results in terms of shape reconstruction and pose estimation
accuracy.
Related papers
- Towards 3D Vision with Low-Cost Single-Photon Cameras [24.711165102559438]
We present a method for reconstructing 3D shape of arbitrary Lambertian objects based on measurements by miniature, energy-efficient, low-cost single-photon cameras.
Our work draws a connection between image-based modeling and active range scanning and is a step towards 3D vision with single-photon cameras.
arXiv Detail & Related papers (2024-03-26T15:40:05Z) - Pixel2Mesh++: 3D Mesh Generation and Refinement from Multi-View Images [82.32776379815712]
We study the problem of shape generation in 3D mesh representation from a small number of color images with or without camera poses.
We adopt to further improve the shape quality by leveraging cross-view information with a graph convolution network.
Our model is robust to the quality of the initial mesh and the error of camera pose, and can be combined with a differentiable function for test-time optimization.
arXiv Detail & Related papers (2022-04-21T03:42:31Z) - Learning Canonical 3D Object Representation for Fine-Grained Recognition [77.33501114409036]
We propose a novel framework for fine-grained object recognition that learns to recover object variation in 3D space from a single image.
We represent an object as a composition of 3D shape and its appearance, while eliminating the effect of camera viewpoint.
By incorporating 3D shape and appearance jointly in a deep representation, our method learns the discriminative representation of the object.
arXiv Detail & Related papers (2021-08-10T12:19:34Z) - Object Wake-up: 3-D Object Reconstruction, Animation, and in-situ
Rendering from a Single Image [58.69732754597448]
Given a picture of a chair, could we extract the 3-D shape of the chair, animate its plausible articulations and motions, and render in-situ in its original image space?
We devise an automated approach to extract and manipulate articulated objects in single images.
arXiv Detail & Related papers (2021-08-05T16:20:12Z) - Pose Estimation and 3D Reconstruction of Vehicles from Stereo-Images
Using a Subcategory-Aware Shape Prior [0.0]
3D reconstruction of objects or a computer vision is a prerequisite for many applications such as mobile robotics autonomous driving.
The goal of this paper is to show how 3D object reconstruction can profit from prior shape observations.
arXiv Detail & Related papers (2021-07-22T19:47:49Z) - Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving
Objects [115.71874459429381]
We address the novel task of jointly reconstructing the 3D shape, texture, and motion of an object from a single motion-blurred image.
While previous approaches address the deblurring problem only in the 2D image domain, our proposed rigorous modeling of all object properties in the 3D domain enables the correct description of arbitrary object motion.
arXiv Detail & Related papers (2021-06-16T13:18:08Z) - From Points to Multi-Object 3D Reconstruction [71.17445805257196]
We propose a method to detect and reconstruct multiple 3D objects from a single RGB image.
A keypoint detector localizes objects as center points and directly predicts all object properties, including 9-DoF bounding boxes and 3D shapes.
The presented approach performs lightweight reconstruction in a single-stage, it is real-time capable, fully differentiable and end-to-end trainable.
arXiv Detail & Related papers (2020-12-21T18:52:21Z) - Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using
Deep Shape Priors [69.02332607843569]
PriSMONet is a novel approach for learning Multi-Object 3D scene decomposition and representations from single images.
A recurrent encoder regresses a latent representation of 3D shape, pose and texture of each object from an input RGB image.
We evaluate the accuracy of our model in inferring 3D scene layout, demonstrate its generative capabilities, assess its generalization to real images, and point out benefits of the learned representation.
arXiv Detail & Related papers (2020-10-08T14:49:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.