Fugu-MT 論文翻訳(概要): Break and Make: Interactive Structural Understanding Using LEGO Bricks

論文の概要: Break and Make: Interactive Structural Understanding Using LEGO Bricks

arxiv url: http://arxiv.org/abs/2207.13738v1
Date: Wed, 27 Jul 2022 18:33:09 GMT
ステータス: 翻訳完了
システム内更新日: 2022-07-29 11:57:37.921346
Title: Break and Make: Interactive Structural Understanding Using LEGO Bricks
Title（参考訳）: Break and Make: LEGO Bricksを使ったインタラクティブな構造理解
Authors: Aaron Walsman, Muru Zhang, Klemen Kotar, Karthik Desingh, Ali Farhadi, Dieter Fox
Abstract要約: 私たちは、LEGOモデルの組み立て、分解、操作が可能な、完全にインタラクティブな3Dシミュレータを構築しました。シーケンス・ツー・シーケンス・モデルを用いてこの問題を解決するための第一歩を踏み出す。
参考スコア（独自算出の注目度）: 61.01136603613139
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Visual understanding of geometric structures with complex spatial relationships is a fundamental component of human intelligence. As children, we learn how to reason about structure not only from observation, but also by interacting with the world around us -- by taking things apart and putting them back together again. The ability to reason about structure and compositionality allows us to not only build things, but also understand and reverse-engineer complex systems. In order to advance research in interactive reasoning for part-based geometric understanding, we propose a challenging new assembly problem using LEGO bricks that we call Break and Make. In this problem an agent is given a LEGO model and attempts to understand its structure by interactively inspecting and disassembling it. After this inspection period, the agent must then prove its understanding by rebuilding the model from scratch using low-level action primitives. In order to facilitate research on this problem we have built LTRON, a fully interactive 3D simulator that allows learning agents to assemble, disassemble and manipulate LEGO models. We pair this simulator with a new dataset of fan-made LEGO creations that have been uploaded to the internet in order to provide complex scenes containing over a thousand unique brick shapes. We take a first step towards solving this problem using sequence-to-sequence models that provide guidance for how to make progress on this challenging problem. Our simulator and data are available at github.com/aaronwalsman/ltron. Additional training code and PyTorch examples are available at github.com/aaronwalsman/ltron-torch-eccv22.
Abstract（参考訳）: 複雑な空間関係を持つ幾何学構造の視覚的理解は、人間の知性の基本的構成要素である。子ども時代は、観察だけでなく、周りの世界と対話することで、構造を理屈する方法を学んでいる。構造と構成性について推論する能力は、ものを構築するだけでなく、複雑なシステムを理解しリバースエンジニアリングすることもできます。部分的幾何学的理解のための対話的推論の研究を進めるために,私たちがBreak and Makeと呼ぶレゴブロックを用いた新しい組立問題を提案する。この問題において、エージェントはLEGOモデルを与え、対話的な検査と分解によってその構造を理解しようとする。この検査期間の後、エージェントは低レベルのアクションプリミティブを使用してモデルをスクラッチから再構築し、その理解を証明する必要がある。この問題を解決するために私たちは,LEGOモデルの組み立て,分解,操作が可能な,完全にインタラクティブな3DシミュレータLTRONを開発した。このシミュレーターと、ファンが作ったレゴ作品の新しいデータセットを組み合わせることで、1000以上のユニークなレンガの形をした複雑なシーンをインターネットにアップロードします。課題の解決に向けて第一歩を踏み出し,課題の解決方法に関するガイダンスを提供するシーケンシャル・ツー・シーケンス・モデルを用いた。シミュレータとデータはgithub.com/aaronwalsman/ltronで利用可能です。追加のトレーニングコードとpytorchサンプルはgithub.com/aaronwalsman/ltron-torch-eccv22で入手できる。

論文の概要: Break and Make: Interactive Structural Understanding Using LEGO Bricks

関連論文リスト