Fugu-MT 論文翻訳(概要): WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

論文の概要: WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

arxiv url: http://arxiv.org/abs/2604.20398v1
Date: Wed, 22 Apr 2026 10:04:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-23 15:36:11.0778
Title: WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning
Title（参考訳）: WebGen-R1:強化学習による大規模言語モデルの導入による機能的および美的Webサイトの生成
Authors: Juyong Jiang, Chenglin Cai, Chansung Park, Jiasi Shen, Sunghun Kim, Jianguo Li, Yue Wang,
Abstract要約: WebGen-R1はプロジェクトレベルのWebサイト生成に適したエンドツーエンドのRLフレームワークである。大規模なオープンエンド行動空間を制約する足場駆動型構造化生成パラダイムを導入する。次に,構造的保証と機能的フィードバックをシームレスに結合する,ケースケード型マルチモーダル報酬を設計する。
参考スコア（独自算出の注目度）: 19.832733425312476
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While Large Language Models (LLMs) excel at function-level code generation, project-level tasks such as generating functional and visually aesthetic multi-page websites remain highly challenging. Existing works are often limited to single-page static websites, while agentic frameworks typically rely on multi-turn execution with proprietary models, leading to substantial token costs, high latency, and brittle integration. Training a small LLM end-to-end with reinforcement learning (RL) is a promising alternative, yet it faces a critical bottleneck in designing reliable and computationally feasible rewards for website generation. Unlike single-file coding tasks that can be verified by unit tests, website generation requires evaluating inherently subjective aesthetics, cross-page interactions, and functional correctness. To this end, we propose WebGen-R1, an end-to-end RL framework tailored for project-level website generation. We first introduce a scaffold-driven structured generation paradigm that constrains the large open-ended action space and preserves architectural integrity. We then design a novel cascaded multimodal reward that seamlessly couples structural guarantees with execution-grounded functional feedback and vision-based aesthetic supervision. Extensive experiments demonstrate that our WebGen-R1 substantially transforms a 7B base model from generating nearly nonfunctional websites into producing deployable, aesthetically aligned multi-page websites. Remarkably, our WebGen-R1 not only consistently outperforms heavily scaled open-source models (up to 72B), but also rivals the state-of-the-art DeepSeek-R1 (671B) in functional success, while substantially exceeding it in valid rendering and aesthetic alignment. These results position WebGen-R1 as a viable path for scaling small open models from function-level code generation to project-level web application generation.
Abstract（参考訳）: 大きな言語モデル(LLM)は関数レベルのコード生成に優れていますが、機能的で視覚的に美的なWebサイトを生成するようなプロジェクトレベルのタスクは非常に難しいままです。既存の作業はシングルページの静的Webサイトに限定されることが多いが、エージェントフレームワークは通常、プロプライエタリなモデルによるマルチターン実行に依存しており、相当なトークンコスト、高いレイテンシ、不安定な統合につながっている。強化学習(RL)による小さなLLMエンドツーエンドのトレーニングは、有望な代替手段だが、Webサイト生成のための信頼性と計算可能な報酬を設計する上で、重要なボトルネックに直面している。単体テストで検証できる単一ファイルのコーディングタスクとは異なり、ウェブサイト生成には固有の主観的美学、ページ間の相互作用、機能的正当性の評価が必要である。この目的のために,プロジェクトレベルのWebサイト生成に適したエンドツーエンドのRLフレームワークであるWebGen-R1を提案する。まず、大規模なオープンエンドアクション空間を制約し、アーキテクチャの整合性を維持する、足場駆動型構造化生成パラダイムを導入する。次に、構造的保証をシームレスに結合し、実行時の機能的フィードバックと視覚に基づく審美的監督とをシームレスに結合する、ケースケード型マルチモーダル報酬を設計する。大規模な実験により、我々のWebGen-R1は7Bベースモデルをほぼ非機能なWebサイトから、デプロイ可能で美学的に整合したマルチページWebサイトへと実質的に変換します。注目すべきなのは、当社のWebGen-R1は、大規模なオープンソースモデル(最大72B)を一貫して上回るだけでなく、最先端のDeepSeek-R1(671B)に匹敵する機能を備えています。これらの結果は、WebGen-R1を、関数レベルのコード生成からプロジェクトレベルのWebアプリケーション生成まで、小さなオープンモデルをスケールするための実行可能なパスとして位置づけている。

論文の概要: WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

関連論文リスト