Fugu-MT 論文翻訳(概要): LayoutPrompter: Awaken the Design Ability of Large Language Models

論文の概要: LayoutPrompter: Awaken the Design Ability of Large Language Models

arxiv url: http://arxiv.org/abs/2311.06495v1
Date: Sat, 11 Nov 2023 07:15:14 GMT
ステータス: 翻訳完了
システム内更新日: 2023-11-14 18:21:53.777981
Title: LayoutPrompter: Awaken the Design Ability of Large Language Models
Title（参考訳）: LayoutPrompter: 大規模言語モデルの設計能力の覚醒
Authors: Jiawei Lin, Jiaqi Guo, Shizhao Sun, Zijiang James Yang, Jian-Guang Lou, Dongmei Zhang
Abstract要約: ユーザの制約を自動的に高品質なレイアウトにマッピングする条件付きグラフィックレイアウト生成が,今日,注目を集めている。本稿では,LyoutPrompterを提案する。LyoutPrompterは大規模言語モデル(LLM)を利用して,テキスト内学習によって上記の問題に対処する。 layoutPrompterは、入力出力シリアライゼーション、動的例選択、レイアウトランキングという3つの重要なコンポーネントで構成されている。
参考スコア（独自算出の注目度）: 46.6946399858818
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Conditional graphic layout generation, which automatically maps user constraints to high-quality layouts, has attracted widespread attention today. Although recent works have achieved promising performance, the lack of versatility and data efficiency hinders their practical applications. In this work, we propose LayoutPrompter, which leverages large language models (LLMs) to address the above problems through in-context learning. LayoutPrompter is made up of three key components, namely input-output serialization, dynamic exemplar selection and layout ranking. Specifically, the input-output serialization component meticulously designs the input and output formats for each layout generation task. Dynamic exemplar selection is responsible for selecting the most helpful prompting exemplars for a given input. And a layout ranker is used to pick the highest quality layout from multiple outputs of LLMs. We conduct experiments on all existing layout generation tasks using four public datasets. Despite the simplicity of our approach, experimental results show that LayoutPrompter can compete with or even outperform state-of-the-art approaches on these tasks without any model training or fine-tuning. This demonstrates the effectiveness of this versatile and training-free approach. In addition, the ablation studies show that LayoutPrompter is significantly superior to the training-based baseline in a low-data regime, further indicating the data efficiency of LayoutPrompter. Our project is available at https://github.com/microsoft/LayoutGeneration/tree/main/LayoutPrompter.
Abstract（参考訳）: ユーザの制約を高品質なレイアウトに自動マッピングする条件付きグラフィックレイアウト生成が,今日では広く注目を集めている。最近の研究は有望な性能を達成しているが、汎用性とデータ効率の欠如は実用的応用を妨げる。そこで本研究では,大規模言語モデル(LLM)を活用したLayoutPrompterを提案する。 LayoutPrompterは、入力出力シリアライゼーション、動的指数選択、レイアウトランキングという3つの重要なコンポーネントで構成されている。具体的には、入力出力シリアライゼーションコンポーネントは、各レイアウト生成タスクの入力および出力フォーマットを慎重に設計する。動的例題選択は、与えられた入力に対して最も有用な例題を選択する責任がある。 LLMの複数の出力から最高品質のレイアウトを選択するためにレイアウトローダが使用される。 4つの公開データセットを用いて既存のレイアウト生成タスクをすべて実験する。このアプローチの単純さにもかかわらず、実験結果から、LayoutPrompterはモデルトレーニングや微調整なしに、これらのタスクにおける最先端のアプローチと競合したり、性能を上回ります。これは、この多用途でトレーニングフリーなアプローチの有効性を示しています。さらに,レイアウトプロンプターは低データ状態におけるトレーニングベースベースラインよりも有意に優れており,レイアウトプロンプターのデータ効率も向上している。私たちのプロジェクトはhttps://github.com/microsoft/LayoutGeneration/tree/main/LayoutPrompterで利用可能です。

論文の概要: LayoutPrompter: Awaken the Design Ability of Large Language Models

関連論文リスト