GDN: A Coarse-To-Fine (C2F) Representation for End-To-End 6-DoF Grasp
Detection
- URL: http://arxiv.org/abs/2010.10695v4
- Date: Wed, 11 Nov 2020 07:00:03 GMT
- Title: GDN: A Coarse-To-Fine (C2F) Representation for End-To-End 6-DoF Grasp
Detection
- Authors: Kuang-Yu Jeng, Yueh-Cheng Liu, Zhe Yu Liu, Jen-Wei Wang, Ya-Liang
Chang, Hung-Ting Su, and Winston H. Hsu
- Abstract summary: We propose an end-to-end grasp detection network, Grasp Detection Network (GDN)
Compared to previous two-stage approaches, our architecture is at least 20 times faster.
We propose a new AP-based metric which considers both rotation and transition errors.
- Score: 23.480036081293242
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We proposed an end-to-end grasp detection network, Grasp Detection Network
(GDN), cooperated with a novel coarse-to-fine (C2F) grasp representation design
to detect diverse and accurate 6-DoF grasps based on point clouds. Compared to
previous two-stage approaches which sample and evaluate multiple grasp
candidates, our architecture is at least 20 times faster. It is also 8% and 40%
more accurate in terms of the success rate in single object scenes and the
complete rate in clutter scenes, respectively. Our method shows superior
results among settings with different number of views and input points.
Moreover, we propose a new AP-based metric which considers both rotation and
transition errors, making it a more comprehensive evaluation tool for grasp
detection models.
Related papers
- PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object Detection [26.843891792018447]
We present PETDet (Proposal Enhancement for Two-stage fine-grained object detection) to better handle the sub-tasks in two-stage FGOD methods.
An anchor-free Quality Oriented Proposal Network (QOPN) is proposed with dynamic label assignment and attention-based decomposition.
A novel Adaptive Recognition Loss (ARL) offers guidance for the R-CNN head to focus on high-quality proposals.
arXiv Detail & Related papers (2023-12-16T18:04:56Z) - DFIL: Deepfake Incremental Learning by Exploiting Domain-invariant
Forgery Clues [32.045504965382015]
Current deepfake detection models can generally recognize forgery images by training on a large dataset.
The accuracy of detection models degrades significantly on images generated by new deepfake methods due to the difference in data distribution.
We present a novel incremental learning framework that improves the generalization of deepfake detection models.
arXiv Detail & Related papers (2023-09-18T07:02:26Z) - GMSF: Global Matching Scene Flow [17.077134204089536]
We tackle the task of scene flow estimation from point clouds.
Given a source and a target point cloud, the objective is to estimate a translation from each point in the source point cloud to the target.
We propose a significantly simpler single-scale one-shot global matching to address the problem.
arXiv Detail & Related papers (2023-05-27T10:04:21Z) - Extending One-Stage Detection with Open-World Proposals [8.492340530784697]
We show that fully convolutional one-stage detection network FCOS can increase OWP performance by as much as 6% in recall on novel classes.
While two-stage methods worsen by 6% in recall on novel classes, we show that FCOS only drops 2% when jointly optimizing for OWP and classification.
arXiv Detail & Related papers (2022-01-07T02:29:09Z) - GKNet: grasp keypoint network for grasp candidates detection [15.214390498300101]
This paper presents a different approach to grasp detection by treating it as keypoint detection.
The deep network detects each grasp candidate as a pair of keypoints, convertible to the grasp representation g = x, y, w, thetaT, rather than a triplet or quartet of corner points.
arXiv Detail & Related papers (2021-06-16T00:34:55Z) - Distribution Alignment: A Unified Framework for Long-tail Visual
Recognition [52.36728157779307]
We propose a unified distribution alignment strategy for long-tail visual recognition.
We then introduce a generalized re-weight method in the two-stage learning to balance the class prior.
Our approach achieves the state-of-the-art results across all four recognition tasks with a simple and unified framework.
arXiv Detail & Related papers (2021-03-30T14:09:53Z) - City-scale Scene Change Detection using Point Clouds [71.73273007900717]
We propose a method for detecting structural changes in a city using images captured from mounted cameras over two different times.
A direct comparison of the two point clouds for change detection is not ideal due to inaccurate geo-location information.
To circumvent this problem, we propose a deep learning-based non-rigid registration on the point clouds.
Experiments show that our method is able to detect scene changes effectively, even in the presence of viewpoint and illumination differences.
arXiv Detail & Related papers (2021-03-26T08:04:13Z) - MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors.
To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network.
Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z) - Diverse Knowledge Distillation for End-to-End Person Search [81.4926655119318]
Person search aims to localize and identify a specific person from a gallery of images.
Recent methods can be categorized into two groups, i.e., two-step and end-to-end approaches.
We propose a simple yet strong end-to-end network with diverse knowledge distillation to break the bottleneck.
arXiv Detail & Related papers (2020-12-21T09:04:27Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - Corner Proposal Network for Anchor-free, Two-stage Object Detection [174.59360147041673]
The goal of object detection is to determine the class and location of objects in an image.
This paper proposes a novel anchor-free, two-stage framework which first extracts a number of object proposals.
We demonstrate that these two stages are effective solutions for improving recall and precision.
arXiv Detail & Related papers (2020-07-27T19:04:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.