GraspNet: A Large-Scale Clustered and Densely Annotated Dataset for
Object Grasping
- URL: http://arxiv.org/abs/1912.13470v2
- Date: Wed, 1 Jan 2020 03:49:58 GMT
- Title: GraspNet: A Large-Scale Clustered and Densely Annotated Dataset for
Object Grasping
- Authors: Hao-Shu Fang, Chenxi Wang, Minghao Gou, Cewu Lu
- Abstract summary: We contribute a large-scale grasp pose detection dataset with a unified evaluation system.
Our dataset contains 87,040 RGBD images with over 370 million grasp poses.
- Score: 49.777649953381676
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object grasping is critical for many applications, which is also a
challenging computer vision problem. However, for the clustered scene, current
researches suffer from the problems of insufficient training data and the
lacking of evaluation benchmarks. In this work, we contribute a large-scale
grasp pose detection dataset with a unified evaluation system. Our dataset
contains 87,040 RGBD images with over 370 million grasp poses. Meanwhile, our
evaluation system directly reports whether a grasping is successful or not by
analytic computation, which is able to evaluate any kind of grasp poses without
exhausted labeling pose ground-truth. We conduct extensive experiments to show
that our dataset and evaluation system can align well with real-world
experiments. Our dataset, source code and models will be made publicly
available.
Related papers
- Visual Context-Aware Person Fall Detection [52.49277799455569]
We present a segmentation pipeline to semi-automatically separate individuals and objects in images.
Background objects such as beds, chairs, or wheelchairs can challenge fall detection systems, leading to false positive alarms.
We demonstrate that object-specific contextual transformations during training effectively mitigate this challenge.
arXiv Detail & Related papers (2024-04-11T19:06:36Z) - Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval [28.810040080126324]
This paper presents a novel unsupervised deep metric learning approach, termed unsupervised collaborative metric learning with mixed-scale groups (MS-UGCML)
We show that our approach can learn embeddings for objects of varying scales, with an object level and image level mAPs improvement of up to 6.69% and 10.03%, respectively.
arXiv Detail & Related papers (2024-03-16T04:01:50Z) - SynDrone -- Multi-modal UAV Dataset for Urban Scenarios [11.338399194998933]
The scarcity of large-scale real datasets with pixel-level annotations poses a significant challenge to researchers.
We propose a multimodal synthetic dataset containing both images and 3D data taken at multiple flying heights.
The dataset will be made publicly available to support the development of novel computer vision methods targeting UAV applications.
arXiv Detail & Related papers (2023-08-21T06:22:10Z) - Towards Precision in Appearance-based Gaze Estimation in the Wild [3.4253416336476246]
We present a large gaze estimation dataset, PARKS-Gaze, with wider head pose and illumination variation.
The proposed dataset is more challenging and enable models to generalize on unseen participants better than the existing in-the-wild datasets.
arXiv Detail & Related papers (2023-02-05T10:09:35Z) - Large Scale Real-World Multi-Person Tracking [68.27438015329807]
This paper presents a new large scale multi-person tracking dataset -- textttPersonPath22.
It is over an order of magnitude larger than currently available high quality multi-object tracking datasets such as MOT17, HiEve, and MOT20.
arXiv Detail & Related papers (2022-11-03T23:03:13Z) - A Survey of Dataset Refinement for Problems in Computer Vision Datasets [11.45536223418548]
Large-scale datasets have played a crucial role in the advancement of computer vision.
They often suffer from problems such as class imbalance, noisy labels, dataset bias, or high resource costs.
Various data-centric solutions have been proposed to solve the dataset problems.
They improve the quality of datasets by re-organizing them, which we call dataset refinement.
arXiv Detail & Related papers (2022-10-21T03:58:43Z) - Large-scale Unsupervised Semantic Segmentation [163.3568726730319]
We propose a new problem of large-scale unsupervised semantic segmentation (LUSS) with a newly created benchmark dataset to track the research progress.
Based on the ImageNet dataset, we propose the ImageNet-S dataset with 1.2 million training images and 40k high-quality semantic segmentation annotations for evaluation.
arXiv Detail & Related papers (2021-06-06T15:02:11Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets [64.76453161039973]
REVISE (REvealing VIsual biaSEs) is a tool that assists in the investigation of a visual dataset.
It surfacing potential biases along three dimensions: (1) object-based, (2) person-based, and (3) geography-based.
arXiv Detail & Related papers (2020-04-16T23:54:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.