KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation
- URL: http://arxiv.org/abs/2109.10127v1
- Date: Tue, 21 Sep 2021 12:17:24 GMT
- Title: KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation
- Authors: Xingyu Liu, Shun Iwase, Kris M. Kitani
- Abstract summary: KDFNet is a novel method for 6D object pose estimation from RGB images.
We propose a continuous representation called Keypoint Distance Field (KDF) for projected 2D keypoint locations.
We use a fully convolutional neural network to regress the KDF for each keypoint.
- Score: 43.839322860501596
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present KDFNet, a novel method for 6D object pose estimation from RGB
images. To handle occlusion, many recent works have proposed to localize 2D
keypoints through pixel-wise voting and solve a Perspective-n-Point (PnP)
problem for pose estimation, which achieves leading performance. However, such
voting process is direction-based and cannot handle long and thin objects where
the direction intersections cannot be robustly found. To address this problem,
we propose a novel continuous representation called Keypoint Distance Field
(KDF) for projected 2D keypoint locations. Formulated as a 2D array, each
element of the KDF stores the 2D Euclidean distance between the corresponding
image pixel and a specified projected 2D keypoint. We use a fully convolutional
neural network to regress the KDF for each keypoint. Using this KDF encoding of
projected object keypoint locations, we propose to use a distance-based voting
scheme to localize the keypoints by calculating circle intersections in a
RANSAC fashion. We validate the design choices of our framework by extensive
ablation experiments. Our proposed method achieves state-of-the-art performance
on Occlusion LINEMOD dataset with an average ADD(-S) accuracy of 50.3% and TOD
dataset mug subset with an average ADD accuracy of 75.72%. Extensive
experiments and visualizations demonstrate that the proposed method is able to
robustly estimate the 6D pose in challenging scenarios including occlusion.
Related papers
- CheckerPose: Progressive Dense Keypoint Localization for Object Pose
Estimation with Graph Neural Network [66.24726878647543]
Estimating the 6-DoF pose of a rigid object from a single RGB image is a crucial yet challenging task.
Recent studies have shown the great potential of dense correspondence-based solutions.
We propose a novel pose estimation algorithm named CheckerPose, which improves on three main aspects.
arXiv Detail & Related papers (2023-03-29T17:30:53Z) - Coupled Iterative Refinement for 6D Multi-Object Pose Estimation [64.7198752089041]
Given a set of known 3D objects and an RGB or RGB-D input image, we detect and estimate the 6D pose of each object.
Our approach iteratively refines both pose and correspondence in a tightly coupled manner, allowing us to dynamically remove outliers to improve accuracy.
arXiv Detail & Related papers (2022-04-26T18:00:08Z) - Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation [73.40404343241782]
We propose a weakly supervised 6D object pose estimation approach based on 2D keypoint detection.
Our approach achieves comparable performance with state-of-the-art fully supervised approaches.
arXiv Detail & Related papers (2022-03-07T16:23:47Z) - A Dynamic Keypoints Selection Network for 6DoF Pose Estimation [0.0]
6 DoF poses estimation problem aims to estimate the rotation and translation parameters between two coordinates.
We present a novel deep neural network based on dynamic keypoints selection designed for 6DoF pose estimation from a single RGBD image.
arXiv Detail & Related papers (2021-10-24T09:58:56Z) - 6D Object Pose Estimation using Keypoints and Part Affinity Fields [24.126513851779936]
The task of 6D object pose estimation from RGB images is an important requirement for autonomous service robots to be able to interact with the real world.
We present a two-step pipeline for estimating the 6 DoF translation and orientation of known objects.
arXiv Detail & Related papers (2021-07-05T14:41:19Z) - Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial
Keypoint Voting [7.6997148655751895]
We propose a novel keypoint voting scheme based on intersecting spheres, that is more accurate than existing schemes and allows for a smaller set of more disperse keypoints.
The scheme forms the basis of the proposed RCVPose method for 6 DoF pose estimation of 3D objects in RGB-D data.
arXiv Detail & Related papers (2021-04-06T14:06:08Z) - 3D Point-to-Keypoint Voting Network for 6D Pose Estimation [8.801404171357916]
We propose a framework for 6D pose estimation from RGB-D data based on spatial structure characteristics of 3D keypoints.
The proposed method is verified on two benchmark datasets, LINEMOD and OCCLUSION LINEMOD.
arXiv Detail & Related papers (2020-12-22T11:43:15Z) - Learning 2D-3D Correspondences To Solve The Blind Perspective-n-Point
Problem [98.92148855291363]
This paper proposes a deep CNN model which simultaneously solves for both 6-DoF absolute camera pose 2D--3D correspondences.
Tests on both real and simulated data have shown that our method substantially outperforms existing approaches.
arXiv Detail & Related papers (2020-03-15T04:17:30Z) - Robust 6D Object Pose Estimation by Learning RGB-D Features [59.580366107770764]
We propose a novel discrete-continuous formulation for rotation regression to resolve this local-optimum problem.
We uniformly sample rotation anchors in SO(3), and predict a constrained deviation from each anchor to the target, as well as uncertainty scores for selecting the best prediction.
Experiments on two benchmarks: LINEMOD and YCB-Video, show that the proposed method outperforms state-of-the-art approaches.
arXiv Detail & Related papers (2020-02-29T06:24:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.