Category-Level Articulated Object Pose Estimation
- URL: http://arxiv.org/abs/1912.11913v2
- Date: Wed, 8 Apr 2020 19:46:04 GMT
- Title: Category-Level Articulated Object Pose Estimation
- Authors: Xiaolong Li, He Wang, Li Yi, Leonidas Guibas, A. Lynn Abbott, Shuran
Song
- Abstract summary: We introduce Articulation-aware Normalized Coordinate Space Hierarchy (ANCSH)
ANCSH is a canonical representation for different articulated objects in a given category.
We develop a deep network based on PointNet++ that predicts ANCSH from a single depth point cloud.
- Score: 34.57672805595464
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This project addresses the task of category-level pose estimation for
articulated objects from a single depth image. We present a novel
category-level approach that correctly accommodates object instances previously
unseen during training. We introduce Articulation-aware Normalized Coordinate
Space Hierarchy (ANCSH) - a canonical representation for different articulated
objects in a given category. As the key to achieve intra-category
generalization, the representation constructs a canonical object space as well
as a set of canonical part spaces. The canonical object space normalizes the
object orientation,scales and articulations (e.g. joint parameters and states)
while each canonical part space further normalizes its part pose and scale. We
develop a deep network based on PointNet++ that predicts ANCSH from a single
depth point cloud, including part segmentation, normalized coordinates, and
joint parameters in the canonical object space. By leveraging the canonicalized
joints, we demonstrate: 1) improved performance in part pose and scale
estimations using the induced kinematic constraints from joints; 2) high
accuracy for joint parameter estimation in camera space.
Related papers
- SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation [79.12683101131368]
Category-level object pose estimation, aiming to predict the 6D pose and 3D size of objects from known categories, typically struggles with large intra-class shape variation.
We present SecondPose, a novel approach integrating object-specific geometric features with semantic category priors from DINOv2.
arXiv Detail & Related papers (2023-11-18T17:14:07Z) - SOCS: Semantically-aware Object Coordinate Space for Category-Level 6D
Object Pose Estimation under Large Shape Variations [12.348551686086255]
Most learning-based approaches to category-level 6D pose estimation are design around normalized object coordinate space (NOCS)
We propose Semantically-aware Object Coordinate Space (SOCS) built by warping-and-aligning the objects guided by a sparse set of keypoints with semantically meaningful correspondence.
arXiv Detail & Related papers (2023-03-18T06:34:16Z) - Self-Supervised Category-Level Articulated Object Pose Estimation with
Part-Level SE(3) Equivariance [33.10167928198986]
Category-level articulated object pose estimation aims to estimate a hierarchy of articulation-aware object poses of an unseen articulated object from a known category.
We present a novel self-supervised strategy that solves this problem without any human labels.
arXiv Detail & Related papers (2023-02-28T03:02:11Z) - Generative Category-Level Shape and Pose Estimation with Semantic
Primitives [27.692997522812615]
We propose a novel framework for category-level object shape and pose estimation from a single RGB-D image.
To handle the intra-category variation, we adopt a semantic primitive representation that encodes diverse shapes into a unified latent space.
We show that the proposed method achieves SOTA pose estimation performance and better generalization in the real-world dataset.
arXiv Detail & Related papers (2022-10-03T17:51:54Z) - Pose for Everything: Towards Category-Agnostic Pose Estimation [93.07415325374761]
Category-Agnostic Pose Estimation (CAPE) aims to create a pose estimation model capable of detecting the pose of any class of object given only a few samples with keypoint definition.
A transformer-based Keypoint Interaction Module (KIM) is proposed to capture both the interactions among different keypoints and the relationship between the support and query images.
We also introduce Multi-category Pose (MP-100) dataset, which is a 2D pose dataset of 100 object categories containing over 20K instances and is well-designed for developing CAPE algorithms.
arXiv Detail & Related papers (2022-07-21T09:40:54Z) - On Hyperbolic Embeddings in 2D Object Detection [76.12912000278322]
We study whether a hyperbolic geometry better matches the underlying structure of the object classification space.
We incorporate a hyperbolic classifier in two-stage, keypoint-based, and transformer-based object detection architectures.
We observe categorical class hierarchies emerging in the structure of the classification space, resulting in lower classification errors and boosting the overall object detection performance.
arXiv Detail & Related papers (2022-03-15T16:43:40Z) - Spatial Commonsense Graph for Object Localisation in Partial Scenes [36.47035776975184]
We solve object localisation in partial scenes, a new problem of estimating the unknown position of an object given a partial 3D scan of a scene.
The proposed solution is based on a novel scene graph model, the Spatial Commonsense Graph (SCG), where objects are the nodes and edges define pairwise distances between them.
The SCG is used to estimate the unknown position of the target object in two steps: first, we feed the SCG into a novel Proximity Prediction Network, a graph neural network that uses attention to perform distance prediction between the node representing the target object and the nodes representing the observed objects in the
arXiv Detail & Related papers (2022-03-10T14:13:35Z) - Towards Self-Supervised Category-Level Object Pose and Size Estimation [121.28537953301951]
This work presents a self-supervised framework for category-level object pose and size estimation from a single depth image.
We leverage the geometric consistency residing in point clouds of the same shape for self-supervision.
arXiv Detail & Related papers (2022-03-06T06:02:30Z) - ELLIPSDF: Joint Object Pose and Shape Optimization with a Bi-level
Ellipsoid and Signed Distance Function Description [9.734266860544663]
This paper proposes an expressive yet compact model for joint object pose and shape optimization.
It infers an object-level map from multi-view RGB-D camera observations.
Our approach is evaluated on the large-scale real-world ScanNet dataset and compared against state-of-the-art methods.
arXiv Detail & Related papers (2021-08-01T03:07:31Z) - 3D Object Classification on Partial Point Clouds: A Practical
Perspective [91.81377258830703]
A point cloud is a popular shape representation adopted in 3D object classification.
This paper introduces a practical setting to classify partial point clouds of object instances under any poses.
A novel algorithm in an alignment-classification manner is proposed in this paper.
arXiv Detail & Related papers (2020-12-18T04:00:56Z) - Point-Set Anchors for Object Detection, Instance Segmentation and Pose
Estimation [85.96410825961966]
We argue that the image features extracted at a central point contain limited information for predicting distant keypoints or bounding box boundaries.
To facilitate inference, we propose to instead perform regression from a set of points placed at more advantageous positions.
We apply this proposed framework, called Point-Set Anchors, to object detection, instance segmentation, and human pose estimation.
arXiv Detail & Related papers (2020-07-06T15:59:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.