Image2Lego: Customized LEGO Set Generation from Images
- URL: http://arxiv.org/abs/2108.08477v1
- Date: Thu, 19 Aug 2021 03:42:58 GMT
- Title: Image2Lego: Customized LEGO Set Generation from Images
- Authors: Kyle Lennon, Katharina Fransen, Alexander O'Brien, Yumeng Cao, Matthew
Beveridge, Yamin Arefeen, Nikhil Singh, Iddo Drori
- Abstract summary: We implement a system that generates a LEGO brick model from 2D images.
Models are obtained by algorithmic conversion of the 3D voxelized model to bricks.
We generate step-by-step building instructions and animations for LEGO models of objects and human faces.
- Score: 50.87935634904456
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Although LEGO sets have entertained generations of children and adults, the
challenge of designing customized builds matching the complexity of real-world
or imagined scenes remains too great for the average enthusiast. In order to
make this feat possible, we implement a system that generates a LEGO brick
model from 2D images. We design a novel solution to this problem that uses an
octree-structured autoencoder trained on 3D voxelized models to obtain a
feasible latent representation for model reconstruction, and a separate network
trained to predict this latent representation from 2D images. LEGO models are
obtained by algorithmic conversion of the 3D voxelized model to bricks. We
demonstrate first-of-its-kind conversion of photographs to 3D LEGO models. An
octree architecture enables the flexibility to produce multiple resolutions to
best fit a user's creative vision or design needs. In order to demonstrate the
broad applicability of our system, we generate step-by-step building
instructions and animations for LEGO models of objects and human faces.
Finally, we test these automatically generated LEGO sets by constructing
physical builds using real LEGO bricks.
Related papers
- TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly [51.29305265324916]
We propose a class-agnostic tree-transformer framework to predict the sequential assembly actions from input multi-view images.
A major challenge of the sequential brick assembly task is that the step-wise action labels are costly and tedious to obtain in practice.
We mitigate this problem by leveraging synthetic-to-real transfer learning.
arXiv Detail & Related papers (2024-07-22T14:05:27Z) - Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling [98.65190562585461]
Diffusion models excel at generating photo-realistic images but come with significant computational costs.
Current options like U-Net and Vision Transformer often rely on resource-intensive deep networks.
This study introduces LEGO bricks, which seamlessly integrate Local-feature Enrichment and Global-content Orchestration.
arXiv Detail & Related papers (2023-10-10T07:52:30Z) - A Lightweight and Transferable Design for Robust LEGO Manipulation [10.982854061044339]
This paper investigates safe and efficient robotic Lego manipulation.
An end-of-arm tool (EOAT) is designed, which reduces the problem dimension and allows large industrial robots to manipulate small Lego bricks.
Experiments demonstrate that the EOAT can reliably manipulate Lego bricks and the learning framework can effectively and safely improve the manipulation performance to a 100% success rate.
arXiv Detail & Related papers (2023-09-05T16:11:37Z) - MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices [78.20154723650333]
High-quality 3D ground-truth shapes are critical for 3D object reconstruction evaluation.
We introduce a novel multi-view RGBD dataset captured using a mobile device.
We obtain precise 3D ground-truth shape without relying on high-end 3D scanners.
arXiv Detail & Related papers (2023-03-03T14:02:50Z) - UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture [70.59984501516084]
UnrealEgo is a new large-scale naturalistic dataset for egocentric 3D human pose estimation.
It is based on an advanced concept of eyeglasses equipped with two fisheye cameras that can be used in unconstrained environments.
We propose a new benchmark method with a simple but effective idea of devising a 2D keypoint estimation module for stereo inputs to improve 3D human pose estimation.
arXiv Detail & Related papers (2022-08-02T17:59:54Z) - Break and Make: Interactive Structural Understanding Using LEGO Bricks [61.01136603613139]
We build a fully interactive 3D simulator that allows learning agents to assemble, disassemble and manipulate LEGO models.
We take a first step towards solving this problem using sequence-to-sequence models.
arXiv Detail & Related papers (2022-07-27T18:33:09Z) - Building LEGO Using Deep Generative Models of Graphs [22.926487008829668]
We advocate LEGO as a platform for developing generative models of sequential assembly.
We develop a generative model based on graph-structured neural networks that can learn from human-built structures and produce visually compelling designs.
arXiv Detail & Related papers (2020-12-21T18:24:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.