Brick-by-Brick: Combinatorial Construction with Deep Reinforcement
Learning
- URL: http://arxiv.org/abs/2110.15481v1
- Date: Fri, 29 Oct 2021 01:09:51 GMT
- Title: Brick-by-Brick: Combinatorial Construction with Deep Reinforcement
Learning
- Authors: Hyunsoo Chung, Jungtaek Kim, Boris Knyazev, Jinhwi Lee, Graham W.
Taylor, Jaesik Park, Minsu Cho
- Abstract summary: We introduce a novel formulation, complex construction, which requires a building agent to assemble unit primitives sequentially.
To construct a target object, we provide incomplete knowledge about the desired target (i.e., 2D images) instead of exact and explicit information to the agent.
We demonstrate that the proposed method successfully learns to construct an unseen object conditioned on a single image or multiple views of a target object.
- Score: 52.85981207514049
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Discovering a solution in a combinatorial space is prevalent in many
real-world problems but it is also challenging due to diverse complex
constraints and the vast number of possible combinations. To address such a
problem, we introduce a novel formulation, combinatorial construction, which
requires a building agent to assemble unit primitives (i.e., LEGO bricks)
sequentially -- every connection between two bricks must follow a fixed rule,
while no bricks mutually overlap. To construct a target object, we provide
incomplete knowledge about the desired target (i.e., 2D images) instead of
exact and explicit volumetric information to the agent. This problem requires a
comprehensive understanding of partial information and long-term planning to
append a brick sequentially, which leads us to employ reinforcement learning.
The approach has to consider a variable-sized action space where a large number
of invalid actions, which would cause overlap between bricks, exist. To resolve
these issues, our model, dubbed Brick-by-Brick, adopts an action validity
prediction network that efficiently filters invalid actions for an actor-critic
network. We demonstrate that the proposed method successfully learns to
construct an unseen object conditioned on a single image or multiple views of a
target object.
Related papers
- Learning to Build by Building Your Own Instructions [56.734927320020496]
We develop a new technique for the recently proposed Break-and-Make problem in LTRON.
An agent must learn to build a previously unseen LEGO assembly using a single interactive session.
We train these models using online imitation learning which allows the model to learn from its own mistakes.
arXiv Detail & Related papers (2024-10-01T22:39:58Z) - Physics-Aware Combinatorial Assembly Planning using Deep Reinforcement Learning [6.919208054874144]
Combinatorial assembly uses standardized unit primitives to build objects that satisfy user specifications.
This paper studies sequence planning for physical assembly using Lego.
In particular, we aim to ensure the planned assembly sequence is physically executable.
arXiv Detail & Related papers (2024-08-19T17:16:35Z) - Budget-Aware Sequential Brick Assembly with Efficient Constraint Satisfaction [63.672314717599285]
We tackle the problem of sequential brick assembly with LEGO bricks to create 3D structures.
In particular, the number of assemblable structures increases exponentially as the number of bricks used increases.
We propose a new method to predict the scores of the next brick position by employing a U-shaped sparse 3D convolutional neural network.
arXiv Detail & Related papers (2022-10-03T15:35:08Z) - Break and Make: Interactive Structural Understanding Using LEGO Bricks [61.01136603613139]
We build a fully interactive 3D simulator that allows learning agents to assemble, disassemble and manipulate LEGO models.
We take a first step towards solving this problem using sequence-to-sequence models.
arXiv Detail & Related papers (2022-07-27T18:33:09Z) - GANzzle: Reframing jigsaw puzzle solving as a retrieval task using a
generative mental image [15.132848477903314]
We infer a mental image from all pieces, which a given piece can then be matched against avoiding the explosion.
We learn how to reconstruct the image given a set of unordered pieces, allowing the model to learn a joint embedding space to match an encoding of each piece to the cropped layer of the generator.
In doing so our model is puzzle size agnostic, in contrast to prior deep learning methods which are single size.
arXiv Detail & Related papers (2022-07-12T16:02:00Z) - Blocks Assemble! Learning to Assemble with Large-Scale Structured
Reinforcement Learning [23.85678777628229]
Assembly of multi-part physical structures is a valuable end product for autonomous robotics.
We introduce a naturalistic physics-based environment with a set of connectable magnet blocks inspired by children's toy kits.
We find that the combination of large-scale reinforcement learning and graph-based policies is an effective recipe for training agents.
arXiv Detail & Related papers (2022-03-15T18:21:02Z) - CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and
Transfer Learning [138.40338621974954]
CausalWorld is a benchmark for causal structure and transfer learning in a robotic manipulation environment.
Tasks consist of constructing 3D shapes from a given set of blocks - inspired by how children learn to build complex structures.
arXiv Detail & Related papers (2020-10-08T23:01:13Z) - Object-Aware Multi-Branch Relation Networks for Spatio-Temporal Video
Grounding [90.12181414070496]
We propose a novel object-aware multi-branch relation network for object-aware relation discovery.
We then propose multi-branch reasoning to capture critical object relationships between the main branch and auxiliary branches.
arXiv Detail & Related papers (2020-08-16T15:39:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.