PosterGen: Aesthetic-Aware Paper-to-Poster Generation via Multi-Agent LLMs
- URL: http://arxiv.org/abs/2508.17188v1
- Date: Sun, 24 Aug 2025 02:25:45 GMT
- Title: PosterGen: Aesthetic-Aware Paper-to-Poster Generation via Multi-Agent LLMs
- Authors: Zhilin Zhang, Xiang Zhang, Jiaqi Wei, Yiwei Xu, Chenyu You,
- Abstract summary: PosterGen is a multi-agent framework that mirrors the workflow of professional poster designers.<n>It produces posters that are both semantically grounded and visually appealing.<n> Experimental results show that PosterGen consistently matches in content fidelity, and significantly outperforms existing methods in visual designs.
- Score: 16.62052847270255
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Multi-agent systems built upon large language models (LLMs) have demonstrated remarkable capabilities in tackling complex compositional tasks. In this work, we apply this paradigm to the paper-to-poster generation problem, a practical yet time-consuming process faced by researchers preparing for conferences. While recent approaches have attempted to automate this task, most neglect core design and aesthetic principles, resulting in posters that require substantial manual refinement. To address these design limitations, we propose PosterGen, a multi-agent framework that mirrors the workflow of professional poster designers. It consists of four collaborative specialized agents: (1) Parser and Curator agents extract content from the paper and organize storyboard; (2) Layout agent maps the content into a coherent spatial layout; (3) Stylist agents apply visual design elements such as color and typography; and (4) Renderer composes the final poster. Together, these agents produce posters that are both semantically grounded and visually appealing. To evaluate design quality, we introduce a vision-language model (VLM)-based rubric that measures layout balance, readability, and aesthetic coherence. Experimental results show that PosterGen consistently matches in content fidelity, and significantly outperforms existing methods in visual designs, generating posters that are presentation-ready with minimal human refinements.
Related papers
- PosterOmni: Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback [30.88155039139322]
Poster Omni is a generalized artistic poster creation framework.<n>It integrates the two regimes, namely local editing and global creation, within a single system.<n>It significantly enhances reference adherence, global composition quality, and aesthetic harmony.
arXiv Detail & Related papers (2026-02-12T16:16:38Z) - PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography [44.93712206658515]
PosterVerse is a full-workflow, commercial-grade poster generation method.<n>PosterVerse replicates professional design through three key stages.<n>PosterDNA is a commercial-grade, HTML-based dataset.
arXiv Detail & Related papers (2026-01-07T15:04:24Z) - Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers [11.186078920251754]
Poster generation is a crucial yet challenging task in scientific communication.<n>We introduce the first benchmark and metric suite for poster generation.<n>PosterAgent is a top-down, visual-in-the-loop multi-agent pipeline.
arXiv Detail & Related papers (2025-05-27T17:58:49Z) - P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark [27.57464219790922]
We introduce P2P, the first flexible, LLM-based multi-agent framework that generates high-quality, HTML-rendered academic posters.<n>P2P employs three specialized agents-for visual element processing, content generation, and final poster assembly-each integrated with dedicated checker modules.<n>We establish P2PEval, a comprehensive benchmark featuring 121 paper-poster pairs and a dual evaluation methodology.
arXiv Detail & Related papers (2025-05-21T09:06:05Z) - PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering [50.76106125697899]
Product posters, which integrate subject, scene, and text, are crucial promotional tools for attracting customers.<n>Main challenge lies in accurately rendering text, especially for complex writing systems like Chinese, which contains over 10,000 individual characters.<n>We develop TextRenderNet, which achieves a high text rendering accuracy of over 90%.<n>Based on TextRenderNet and SceneGenNet, we present PosterMaker, an end-to-end generation framework.
arXiv Detail & Related papers (2025-04-09T07:13:08Z) - POSTA: A Go-to Framework for Customized Artistic Poster Generation [87.16343612086959]
POSTA is a modular framework for customized artistic poster generation.<n>Background Diffusion creates a themed background based on user input.<n>Design MLLM then generates layout and typography elements that align with and complement the background style.<n>ArtText Diffusion applies additional stylization to key text elements.
arXiv Detail & Related papers (2025-03-19T05:22:38Z) - Compose Your Aesthetics: Empowering Text-to-Image Models with the Principles of Art [61.28133495240179]
We propose a novel task of aesthetics alignment which seeks to align user-specified aesthetics with the T2I generation output.<n>Inspired by how artworks provide an invaluable perspective to approach aesthetics, we codify visual aesthetics using the compositional framework artists employ.<n>We demonstrate that T2I DMs can effectively offer 10 compositional controls through user-specified PoA conditions.
arXiv Detail & Related papers (2025-03-15T06:58:09Z) - PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides [51.88536367177796]
We propose a two-stage, edit-based approach inspired by human drafts for automatically generating presentations.<n>PWTAgent first analyzes references to extract slide-level functional types and content schemas, then generates editing actions based on selected reference slides.<n>PWTAgent significantly outperforms existing automatic presentation generation methods across all three dimensions.
arXiv Detail & Related papers (2025-01-07T16:53:01Z) - GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts [53.568057283934714]
We propose a Vision-Language Model (VLM)-based framework that generates content-aware text logo layouts.<n>We introduce two model techniques that reduce the computational cost for processing multiple glyph images simultaneously.<n>To support instruction tuning of our model, we construct two extensive text logo datasets that are five times larger than existing public datasets.
arXiv Detail & Related papers (2024-11-18T10:04:10Z) - MetaDesigner: Advancing Artistic Typography Through AI-Driven, User-Centric, and Multilingual WordArt Synthesis [65.78359025027457]
MetaDesigner introduces a transformative framework for artistic typography, powered by Large Language Models (LLMs)<n>Its foundation is a multi-agent system comprising the Pipeline, Glyph, and Texture agents, which collectively orchestrate the creation of customizable WordArt.
arXiv Detail & Related papers (2024-06-28T11:58:26Z) - AutoPoster: A Highly Automatic and Content-aware Design System for
Advertising Poster Generation [14.20790443380675]
This paper introduces AutoPoster, a highly automatic and content-aware system for generating advertising posters.
With only product images and titles as inputs, AutoPoster can automatically produce posters of varying sizes through four key stages.
We propose the first poster generation dataset that includes visual attribute annotations for over 76k posters.
arXiv Detail & Related papers (2023-08-02T11:58:43Z) - PosterLayout: A New Benchmark and Approach for Content-aware
Visual-Textual Presentation Layout [62.12447593298437]
Content-aware visual-textual presentation layout aims at arranging spatial space on the given canvas for pre-defined elements.
We propose design sequence formation (DSF) that reorganizes elements in layouts to imitate the design processes of human designers.
A novel CNN-LSTM-based conditional generative adversarial network (GAN) is presented to generate proper layouts.
arXiv Detail & Related papers (2023-03-28T12:48:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.