OMAD: Object Model with Articulated Deformations for Pose Estimation and
Retrieval
- URL: http://arxiv.org/abs/2112.07334v1
- Date: Tue, 14 Dec 2021 12:45:49 GMT
- Title: OMAD: Object Model with Articulated Deformations for Pose Estimation and
Retrieval
- Authors: Han Xue, Liu Liu, Wenqiang Xu, Haoyuan Fu, Cewu Lu
- Abstract summary: We present a category-specific representation called Object Model with Articulated Deformations (OMAD) to explicitly model the articulated objects.
With the full representation of the object shape and joint states, we can address several tasks including category-level object pose estimation and the articulated object retrieval.
- Score: 46.813224754603866
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Articulated objects are pervasive in daily life. However, due to the
intrinsic high-DoF structure, the joint states of the articulated objects are
hard to be estimated. To model articulated objects, two kinds of shape
deformations namely the geometric and the pose deformation should be
considered. In this work, we present a novel category-specific parametric
representation called Object Model with Articulated Deformations (OMAD) to
explicitly model the articulated objects. In OMAD, a category is associated
with a linear shape function with shared shape basis and a non-linear joint
function. Both functions can be learned from a large-scale object model dataset
and fixed as category-specific priors. Then we propose an OMADNet to predict
the shape parameters and the joint states from an object's single observation.
With the full representation of the object shape and joint states, we can
address several tasks including category-level object pose estimation and the
articulated object retrieval. To evaluate these tasks, we create a synthetic
dataset based on PartNet-Mobility. Extensive experiments show that our simple
OMADNet can serve as a strong baseline for both tasks.
Related papers
- Interior Object Geometry via Fitted Frames [18.564031163436553]
We describe a representation targeted for anatomic objects which is designed to enable strong locational correspondence within object populations.
The method generates fitted frames on the boundary and in the interior of objects and produces alignment-free geometric features from them.
arXiv Detail & Related papers (2024-07-19T14:38:47Z) - DTF-Net: Category-Level Pose Estimation and Shape Reconstruction via
Deformable Template Field [29.42222066097076]
Estimating 6D poses and reconstructing 3D shapes of objects in open-world scenes from RGB-depth image pairs is challenging.
We propose the DTF-Net, a novel framework for pose estimation and shape reconstruction based on implicit neural fields of object categories.
arXiv Detail & Related papers (2023-08-04T10:35:40Z) - Category-level Shape Estimation for Densely Cluttered Objects [94.64287790278887]
We propose a category-level shape estimation method for densely cluttered objects.
Our framework partitions each object in the clutter via the multi-view visual information fusion.
Experiments in the simulated environment and real world show that our method achieves high shape estimation accuracy.
arXiv Detail & Related papers (2023-02-23T13:00:17Z) - Generative Category-Level Shape and Pose Estimation with Semantic
Primitives [27.692997522812615]
We propose a novel framework for category-level object shape and pose estimation from a single RGB-D image.
To handle the intra-category variation, we adopt a semantic primitive representation that encodes diverse shapes into a unified latent space.
We show that the proposed method achieves SOTA pose estimation performance and better generalization in the real-world dataset.
arXiv Detail & Related papers (2022-10-03T17:51:54Z) - Localization and Mapping using Instance-specific Mesh Models [12.235379548921061]
This paper focuses on building semantic maps, containing object poses and shapes, using a monocular camera.
Our contribution is an instance-specific mesh model of object shape that can be optimized online based on semantic information extracted from camera images.
arXiv Detail & Related papers (2021-03-08T00:24:23Z) - 3D Object Classification on Partial Point Clouds: A Practical
Perspective [91.81377258830703]
A point cloud is a popular shape representation adopted in 3D object classification.
This paper introduces a practical setting to classify partial point clouds of object instances under any poses.
A novel algorithm in an alignment-classification manner is proposed in this paper.
arXiv Detail & Related papers (2020-12-18T04:00:56Z) - Continuous Surface Embeddings [76.86259029442624]
We focus on the task of learning and representing dense correspondences in deformable object categories.
We propose a new, learnable image-based representation of dense correspondences.
We demonstrate that the proposed approach performs on par or better than the state-of-the-art methods for dense pose estimation for humans.
arXiv Detail & Related papers (2020-11-24T22:52:15Z) - Category Level Object Pose Estimation via Neural Analysis-by-Synthesis [64.14028598360741]
In this paper we combine a gradient-based fitting procedure with a parametric neural image synthesis module.
The image synthesis network is designed to efficiently span the pose configuration space.
We experimentally show that the method can recover orientation of objects with high accuracy from 2D images alone.
arXiv Detail & Related papers (2020-08-18T20:30:47Z) - Shape Prior Deformation for Categorical 6D Object Pose and Size
Estimation [62.618227434286]
We present a novel learning approach to recover the 6D poses and sizes of unseen object instances from an RGB-D image.
We propose a deep network to reconstruct the 3D object model by explicitly modeling the deformation from a pre-learned categorical shape prior.
arXiv Detail & Related papers (2020-07-16T16:45:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.