3D Object Segmentation for Shelf Bin Picking by Humanoid with Deep
Learning and Occupancy Voxel Grid Map
- URL: http://arxiv.org/abs/2001.05406v2
- Date: Thu, 16 Jan 2020 08:54:39 GMT
- Title: 3D Object Segmentation for Shelf Bin Picking by Humanoid with Deep
Learning and Occupancy Voxel Grid Map
- Authors: Kentaro Wada, Masaki Murooka, Kei Okada, Masayuki Inaba
- Abstract summary: We develop a method to segment target objects in 3D using multiple camera angles and voxel grid map.
We evaluate the method with the picking task experiment for target objects in narrow shelf bins.
- Score: 27.312696750923926
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Picking objects in a narrow space such as shelf bins is an important task for
humanoid to extract target object from environment. In those situations,
however, there are many occlusions between the camera and objects, and this
makes it difficult to segment the target object three dimensionally because of
the lack of three dimentional sensor inputs. We address this problem with
accumulating segmentation result with multiple camera angles, and generating
voxel model of the target object. Our approach consists of two components:
first is object probability prediction for input image with convolutional
networks, and second is generating voxel grid map which is designed for object
segmentation. We evaluated the method with the picking task experiment for
target objects in narrow shelf bins. Our method generates dense 3D object
segments even with occlusions, and the real robot successfuly picked target
objects from the narrow space.
Related papers
- PickScan: Object discovery and reconstruction from handheld interactions [99.99566882133179]
We develop an interaction-guided and class-agnostic method to reconstruct 3D representations of scenes.
Our main contribution is a novel approach to detecting user-object interactions and extracting the masks of manipulated objects.
Compared to Co-Fusion, the only comparable interaction-based and class-agnostic baseline, this corresponds to a reduction in chamfer distance of 73%.
arXiv Detail & Related papers (2024-11-17T23:09:08Z) - 3D Small Object Detection with Dynamic Spatial Pruning [62.72638845817799]
We propose an efficient feature pruning strategy for 3D small object detection.
We present a multi-level 3D detector named DSPDet3D which benefits from high spatial resolution.
It takes less than 2s to directly process a whole building consisting of more than 4500k points while detecting out almost all objects.
arXiv Detail & Related papers (2023-05-05T17:57:04Z) - OA-BEV: Bringing Object Awareness to Bird's-Eye-View Representation for
Multi-Camera 3D Object Detection [78.38062015443195]
OA-BEV is a network that can be plugged into the BEV-based 3D object detection framework.
Our method achieves consistent improvements over the BEV-based baselines in terms of both average precision and nuScenes detection score.
arXiv Detail & Related papers (2023-01-13T06:02:31Z) - Neural Correspondence Field for Object Pose Estimation [67.96767010122633]
We propose a method for estimating the 6DoF pose of a rigid object with an available 3D model from a single RGB image.
Unlike classical correspondence-based methods which predict 3D object coordinates at pixels of the input image, the proposed method predicts 3D object coordinates at 3D query points sampled in the camera frustum.
arXiv Detail & Related papers (2022-07-30T01:48:23Z) - Topologically Persistent Features-based Object Recognition in Cluttered
Indoor Environments [1.2691047660244335]
Recognition of occluded objects in unseen indoor environments is a challenging problem for mobile robots.
This work proposes a new slicing-based topological descriptor that captures the 3D shape of object point clouds.
It yields similarities between the descriptors of the occluded and the corresponding unoccluded objects, enabling object unity-based recognition.
arXiv Detail & Related papers (2022-05-16T07:01:16Z) - VIN: Voxel-based Implicit Network for Joint 3D Object Detection and
Segmentation for Lidars [12.343333815270402]
A unified neural network structure is presented for joint 3D object detection and point cloud segmentation.
We leverage rich supervision from both detection and segmentation labels rather than using just one of them.
arXiv Detail & Related papers (2021-07-07T02:16:20Z) - Counting from Sky: A Large-scale Dataset for Remote Sensing Object
Counting and A Benchmark Method [52.182698295053264]
We are interested in counting dense objects from remote sensing images. Compared with object counting in a natural scene, this task is challenging in the following factors: large scale variation, complex cluttered background, and orientation arbitrariness.
To address these issues, we first construct a large-scale object counting dataset with remote sensing images, which contains four important geographic objects.
We then benchmark the dataset by designing a novel neural network that can generate a density map of an input image.
arXiv Detail & Related papers (2020-08-28T03:47:49Z) - Learning Object Depth from Camera Motion and Video Object Segmentation [43.81711115175958]
This paper addresses the problem of learning to estimate the depth of segmented objects given some measurement of camera motion.
We create artificial object segmentations that are scaled for changes in distance between the camera and object, and our network learns to estimate object depth even with segmentation errors.
We demonstrate our approach across domains using a robot camera to locate objects from the YCB dataset and a vehicle camera to locate obstacles while driving.
arXiv Detail & Related papers (2020-07-11T03:50:57Z) - Instance Segmentation of Biomedical Images with an Object-aware
Embedding Learned with Local Constraints [7.151685185368064]
State-of-the-art approaches either perform semantic segmentation or refine object bounding boxes.
Both suffer from crowded objects to varying degrees, merging adjacent objects or suppressing a valid object.
In this work, we assign an embedding vector to each pixel through a deep neural network.
arXiv Detail & Related papers (2020-04-21T08:33:29Z) - Counting dense objects in remote sensing images [52.182698295053264]
Estimating number of interested objects from a given image is a challenging yet important task.
In this paper, we are interested in counting dense objects from remote sensing images.
To address these issues, we first construct a large-scale object counting dataset based on remote sensing images.
We then benchmark the dataset by designing a novel neural network which can generate density map of an input image.
arXiv Detail & Related papers (2020-02-14T09:13:54Z) - Object as Hotspots: An Anchor-Free 3D Object Detection Approach via
Firing of Hotspots [37.16690737208046]
We argue for an approach opposite to existing methods using object-level anchors.
Inspired by compositional models, we propose an object as composition of its interior non-empty voxels, termed hotspots.
Based on OHS, we propose an anchor-free detection head with a novel ground truth assignment strategy.
arXiv Detail & Related papers (2019-12-30T03:02:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.