Constrained Graphic Layout Generation via Latent Optimization
- URL: http://arxiv.org/abs/2108.00871v1
- Date: Mon, 2 Aug 2021 13:04:11 GMT
- Title: Constrained Graphic Layout Generation via Latent Optimization
- Authors: Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi
- Abstract summary: We generate graphic layouts that can flexibly incorporate design semantics, either specified implicitly or explicitly by a user.
Our approach builds on a generative layout model based on a Transformer architecture, and formulates the layout generation as a constrained optimization problem.
We show in the experiments that our approach is capable of generating realistic layouts in both constrained and unconstrained generation tasks with a single model.
- Score: 17.05026043385661
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: It is common in graphic design humans visually arrange various elements
according to their design intent and semantics. For example, a title text
almost always appears on top of other elements in a document. In this work, we
generate graphic layouts that can flexibly incorporate such design semantics,
either specified implicitly or explicitly by a user. We optimize using the
latent space of an off-the-shelf layout generation model, allowing our approach
to be complementary to and used with existing layout generation models. Our
approach builds on a generative layout model based on a Transformer
architecture, and formulates the layout generation as a constrained
optimization problem where design constraints are used for element alignment,
overlap avoidance, or any other user-specified relationship. We show in the
experiments that our approach is capable of generating realistic layouts in
both constrained and unconstrained generation tasks with a single model. The
code is available at https://github.com/ktrk115/const_layout .
Related papers
- GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts [53.568057283934714]
We propose a VLM-based framework that generates content-aware text logo layouts.
We introduce two model techniques to reduce the computation for processing multiple glyph images simultaneously.
To support instruction-tuning of out model, we construct two extensive text logo datasets, which are 5x more larger than the existing public dataset.
arXiv Detail & Related papers (2024-11-18T10:04:10Z) - CoLay: Controllable Layout Generation through Multi-conditional Latent Diffusion [21.958752304572553]
Existing models face two main challenges that limits their adoption in practice.
Most existing models focus on generating labels and coordinates, while real layouts contain a range of style properties.
We propose a novel framework, CoLay, that integrates multiple condition types and generates complex layouts with diverse style properties.
arXiv Detail & Related papers (2024-05-18T17:30:48Z) - PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout Generation [6.855409699832414]
PosterLlama is a network designed for generating visually and textually coherent layouts.
Our evaluations demonstrate that PosterLlama outperforms existing methods in producing authentic and content-aware layouts.
It supports an unparalleled range of conditions, including but not limited to unconditional layout generation, element conditional layout generation, layout completion, among others, serving as a highly versatile user manipulation tool.
arXiv Detail & Related papers (2024-04-01T08:46:35Z) - Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints [53.66698106829144]
We propose a unified model to handle a broad range of layout generation tasks.
The model is based on continuous diffusion models.
Experiment results show that LACE produces high-quality layouts.
arXiv Detail & Related papers (2024-02-07T11:12:41Z) - A Parse-Then-Place Approach for Generating Graphic Layouts from Textual
Descriptions [50.469491454128246]
We use text as the guidance to create graphic layouts, i.e., Text-to-labeled, aiming to lower the design barriers.
Text-to-labeled is a challenging task, because it needs to consider the implicit, combined, and incomplete constraints from text.
We present a two-stage approach, named parse-then-place, to address this problem.
arXiv Detail & Related papers (2023-08-24T10:37:00Z) - PosterLayout: A New Benchmark and Approach for Content-aware
Visual-Textual Presentation Layout [62.12447593298437]
Content-aware visual-textual presentation layout aims at arranging spatial space on the given canvas for pre-defined elements.
We propose design sequence formation (DSF) that reorganizes elements in layouts to imitate the design processes of human designers.
A novel CNN-LSTM-based conditional generative adversarial network (GAN) is presented to generate proper layouts.
arXiv Detail & Related papers (2023-03-28T12:48:36Z) - LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer [80.61492265221817]
Graphic layout designs play an essential role in visual communication.
Yet handcrafting layout designs is skill-demanding, time-consuming, and non-scalable to batch production.
Generative models emerge to make design automation scalable but it remains non-trivial to produce designs that comply with designers' desires.
arXiv Detail & Related papers (2022-12-19T21:57:35Z) - Generative Layout Modeling using Constraint Graphs [37.78500605563527]
We propose a new generative model for layout generation.
First, we generate the layout elements as nodes in a layout graph.
Second, we compute constraints between layout elements as edges in the layout graph.
Third, we solve for the final layout using constrained optimization.
arXiv Detail & Related papers (2020-11-26T18:18:37Z) - LayoutTransformer: Layout Generation and Completion with Self-attention [105.21138914859804]
We address the problem of scene layout generation for diverse domains such as images, mobile applications, documents, and 3D objects.
We propose LayoutTransformer, a novel framework that leverages self-attention to learn contextual relationships between layout elements.
Our framework allows us to generate a new layout either from an empty set or from an initial seed set of primitives, and can easily scale to support an arbitrary of primitives per layout.
arXiv Detail & Related papers (2020-06-25T17:56:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.