Convolutional Occupancy Models for Dense Packing of Complex, Novel
Objects
- URL: http://arxiv.org/abs/2308.00091v1
- Date: Mon, 31 Jul 2023 19:08:16 GMT
- Title: Convolutional Occupancy Models for Dense Packing of Complex, Novel
Objects
- Authors: Nikhil Mishra, Pieter Abbeel, Xi Chen, Maximilian Sieb
- Abstract summary: We present a fully-convolutional shape completion model, F-CON, that can be easily combined with off-the-shelf planning methods for dense packing in the real world.
We also release a simulated dataset, COB-3D-v2, that can be used to train shape completion models for real-word robotics applications.
Finally, we equip a real-world pick-and-place system with F-CON, and demonstrate dense packing of complex, unseen objects in cluttered scenes.
- Score: 75.54599721349037
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Dense packing in pick-and-place systems is an important feature in many
warehouse and logistics applications. Prior work in this space has largely
focused on planning algorithms in simulation, but real-world packing
performance is often bottlenecked by the difficulty of perceiving 3D object
geometry in highly occluded, partially observed scenes. In this work, we
present a fully-convolutional shape completion model, F-CON, which can be
easily combined with off-the-shelf planning methods for dense packing in the
real world. We also release a simulated dataset, COB-3D-v2, that can be used to
train shape completion models for real-word robotics applications, and use it
to demonstrate that F-CON outperforms other state-of-the-art shape completion
methods. Finally, we equip a real-world pick-and-place system with F-CON, and
demonstrate dense packing of complex, unseen objects in cluttered scenes.
Across multiple planning methods, F-CON enables substantially better dense
packing than other shape completion methods.
Related papers
- Mixed Diffusion for 3D Indoor Scene Synthesis [55.94569112629208]
We present MiDiffusion, a novel mixed discrete-continuous diffusion model architecture.
We represent a scene layout by a 2D floor plan and a set of objects, each defined by its category, location, size, and orientation.
Our experimental results demonstrate that MiDiffusion substantially outperforms state-of-the-art autoregressive and diffusion models in floor-conditioned 3D scene synthesis.
arXiv Detail & Related papers (2024-05-31T17:54:52Z) - Zero-Shot Multi-Object Scene Completion [59.325611678171974]
We present a 3D scene completion method that recovers the complete geometry of multiple unseen objects in complex scenes from a single RGB-D image.
Our method outperforms the current state-of-the-art on both synthetic and real-world datasets.
arXiv Detail & Related papers (2024-03-21T17:59:59Z) - SC-Diff: 3D Shape Completion with Latent Diffusion Models [4.913210912019975]
This paper introduces a 3D shape completion approach using a 3D latent diffusion model optimized for completing shapes.
Our method combines image-based conditioning through cross-attention and spatial conditioning through the integration of 3D features from captured partial scans.
arXiv Detail & Related papers (2024-03-19T06:01:11Z) - Robust 3D Tracking with Quality-Aware Shape Completion [67.9748164949519]
We propose a synthetic target representation composed of dense and complete point clouds depicting the target shape precisely by shape completion for robust 3D tracking.
Specifically, we design a voxelized 3D tracking framework with shape completion, in which we propose a quality-aware shape completion mechanism to alleviate the adverse effect of noisy historical predictions.
arXiv Detail & Related papers (2023-12-17T04:50:24Z) - DiffComplete: Diffusion-based Generative 3D Shape Completion [114.43353365917015]
We introduce a new diffusion-based approach for shape completion on 3D range scans.
We strike a balance between realism, multi-modality, and high fidelity.
DiffComplete sets a new SOTA performance on two large-scale 3D shape completion benchmarks.
arXiv Detail & Related papers (2023-06-28T16:07:36Z) - Learning Physically Realizable Skills for Online Packing of General 3D
Shapes [41.27652080050046]
We study the problem of learning online packing skills for irregular 3D shapes.
The goal is to consecutively move a sequence of 3D objects with arbitrary shapes into a designated container.
We take physical realizability into account, involving physics dynamics and constraints of a placement.
arXiv Detail & Related papers (2022-12-05T08:23:39Z) - Latent Space Roadmap for Visual Action Planning of Deformable and Rigid
Object Manipulation [74.88956115580388]
Planning is performed in a low-dimensional latent state space that embeds images.
Our framework consists of two main components: a Visual Foresight Module (VFM) that generates a visual plan as a sequence of images, and an Action Proposal Network (APN) that predicts the actions between them.
arXiv Detail & Related papers (2020-03-19T18:43:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.