A Dynamic Keypoints Selection Network for 6DoF Pose Estimation
- URL: http://arxiv.org/abs/2110.12401v1
- Date: Sun, 24 Oct 2021 09:58:56 GMT
- Title: A Dynamic Keypoints Selection Network for 6DoF Pose Estimation
- Authors: Haowen Sun, Taiyong Wang
- Abstract summary: 6 DoF poses estimation problem aims to estimate the rotation and translation parameters between two coordinates.
We present a novel deep neural network based on dynamic keypoints selection designed for 6DoF pose estimation from a single RGBD image.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: 6 DoF poses estimation problem aims to estimate the rotation and translation
parameters between two coordinates, such as object world coordinate and camera
world coordinate. Although some advances are made with the help of deep
learning, how to full use scene information is still a problem. Prior works
tackle the problem by pixel-wise feature fusion but need to randomly selecte
numerous points from images, which can not satisfy the demands of fast
inference simultaneously and accurate pose estimation. In this work, we present
a novel deep neural network based on dynamic keypoints selection designed for
6DoF pose estimation from a single RGBD image. Our network includes three
parts, instance semantic segmentation, edge points detection and 6DoF pose
estimation. Given an RGBD image, our network is trained to predict pixel
category and the translation to edge points and center points. Then, a
least-square fitting manner is applied to estimate the 6DoF pose parameters.
Specifically, we propose a dynamic keypoints selection algorithm to choose
keypoints from the foreground feature map. It allows us to leverage geometric
and appearance information. During 6DoF pose estimation, we utilize the
instance semantic segmentation result to filter out background points and only
use foreground points to finish edge points detection and 6DoF pose estimation.
Experiments on two commonly used 6DoF estimation benchmark datasets, YCB-Video
and LineMoD, demonstrate that our method outperforms the state-of-the-art
methods and achieves significant improvements over other same category methods
time efficiency.
Related papers
- KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting [0.0]
KGpose is an end-to-end framework for 6D pose estimation of multiple objects.
Our approach combines keypoint-based method with learnable pose regression through keypoint-graph'
arXiv Detail & Related papers (2024-07-12T01:06:00Z) - Depth-based 6DoF Object Pose Estimation using Swin Transformer [1.14219428942199]
Accurately estimating the 6D pose of objects is crucial for many applications, such as robotic grasping, autonomous driving, and augmented reality.
We propose a novel framework called SwinDePose, that uses only geometric information from depth images to achieve accurate 6D pose estimation.
In experiments on the LineMod and Occlusion LineMod datasets, SwinDePose outperforms existing state-of-the-art methods for 6D object pose estimation using depth images.
arXiv Detail & Related papers (2023-03-03T18:25:07Z) - BiCo-Net: Regress Globally, Match Locally for Robust 6D Pose Estimation [32.49091033895255]
Bi-directional Correspondence Mapping Network (BiCo-Net) generates point clouds guided by a typical pose regression.
An ensemble of redundant pose predictions from locally matching and direct pose regression further refines final pose output against noisy observations.
arXiv Detail & Related papers (2022-05-07T03:37:33Z) - Coupled Iterative Refinement for 6D Multi-Object Pose Estimation [64.7198752089041]
Given a set of known 3D objects and an RGB or RGB-D input image, we detect and estimate the 6D pose of each object.
Our approach iteratively refines both pose and correspondence in a tightly coupled manner, allowing us to dynamically remove outliers to improve accuracy.
arXiv Detail & Related papers (2022-04-26T18:00:08Z) - ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose
Estimation [76.31125154523056]
We present a discrete descriptor, which can represent the object surface densely.
We also propose a coarse to fine training strategy, which enables fine-grained correspondence prediction.
arXiv Detail & Related papers (2022-03-17T16:16:24Z) - Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation [73.40404343241782]
We propose a weakly supervised 6D object pose estimation approach based on 2D keypoint detection.
Our approach achieves comparable performance with state-of-the-art fully supervised approaches.
arXiv Detail & Related papers (2022-03-07T16:23:47Z) - 6D Object Pose Estimation using Keypoints and Part Affinity Fields [24.126513851779936]
The task of 6D object pose estimation from RGB images is an important requirement for autonomous service robots to be able to interact with the real world.
We present a two-step pipeline for estimating the 6 DoF translation and orientation of known objects.
arXiv Detail & Related papers (2021-07-05T14:41:19Z) - DeepI2P: Image-to-Point Cloud Registration via Deep Classification [71.3121124994105]
DeepI2P is a novel approach for cross-modality registration between an image and a point cloud.
Our method estimates the relative rigid transformation between the coordinate frames of the camera and Lidar.
We circumvent the difficulty by converting the registration problem into a classification and inverse camera projection optimization problem.
arXiv Detail & Related papers (2021-04-08T04:27:32Z) - Back to the Feature: Learning Robust Camera Localization from Pixels to
Pose [114.89389528198738]
We introduce PixLoc, a scene-agnostic neural network that estimates an accurate 6-DoF pose from an image and a 3D model.
The system can localize in large environments given coarse pose priors but also improve the accuracy of sparse feature matching.
arXiv Detail & Related papers (2021-03-16T17:40:12Z) - Point-Set Anchors for Object Detection, Instance Segmentation and Pose
Estimation [85.96410825961966]
We argue that the image features extracted at a central point contain limited information for predicting distant keypoints or bounding box boundaries.
To facilitate inference, we propose to instead perform regression from a set of points placed at more advantageous positions.
We apply this proposed framework, called Point-Set Anchors, to object detection, instance segmentation, and human pose estimation.
arXiv Detail & Related papers (2020-07-06T15:59:56Z) - L6DNet: Light 6 DoF Network for Robust and Precise Object Pose
Estimation with Small Datasets [0.0]
We propose a novel approach to perform 6 DoF object pose estimation from a single RGB-D image.
We adopt a hybrid pipeline in two stages: data-driven and geometric.
Our approach is more robust and accurate than state-of-the-art methods.
arXiv Detail & Related papers (2020-02-03T17:41:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.