Fugu-MT 論文翻訳(概要): ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

論文の概要: ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

arxiv url: http://arxiv.org/abs/2603.27991v1
Date: Mon, 30 Mar 2026 03:29:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 23:18:45.210313
Title: ViviDoc: Generating Interactive Documents through Human-Agent Collaboration
Title（参考訳）: ViviDoc: ヒューマンエージェントコラボレーションによるインタラクティブドキュメントの生成
Authors: Yinghao Tang, Yupeng Xie, Yingchaojie Feng, Tingfeng Lan, Jiale Lao, Yue Cheng, Wei Chen,
Abstract要約: インタラクティブなドキュメントは、ダイナミックな可視化、インタラクティブなアニメーション、探索的なインターフェイスを通じて、読者が複雑なアイデアに取り組むのに役立つ。近年のLarge Language Model (LLM) ベースのエージェントは、コンテンツ生成を自動化できるが、インタラクティブなドキュメント生成に直接適用することで、制御が難しい出力を生成することが多い。インタラクティブなドキュメント生成を体系的に扱うための最初の作業として,私たちの知る限り,ViviDocを紹介します。
参考スコア（独自算出の注目度）: 6.761074932523358
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Interactive documents help readers engage with complex ideas through dynamic visualization, interactive animations, and exploratory interfaces. However, creating such documents remains costly, as it requires both domain expertise and web development skills. Recent Large Language Model (LLM)-based agents can automate content creation, but directly applying them to interactive document generation often produces outputs that are difficult to control. To address this, we present ViviDoc, to the best of our knowledge the first work to systematically address interactive document generation. ViviDoc introduces a multi-agent pipeline (Planner, Styler, Executor, Evaluator). To make the generation process controllable, we provide three levels of human control: (1) the Document Specification (DocSpec) with SRTC Interaction Specifications (State, Render, Transition, Constraint) for structured planning, (2) a content-aware Style Palette for customizing writing and interaction styles, and (3) chat-based editing for iterative refinement. We also construct ViviBench, a benchmark of 101 topics derived from real-world interactive documents across 11 domains, along with a taxonomy of 8 interaction types and a 4-dimensional automated evaluation framework validated against human ratings (Pearson r > 0.84). Experiments show that ViviDoc achieves the highest content richness and interaction quality in both automated and human evaluation. A 12-person user study confirms that the system is easy to use, provides effective control over the generation process, and produces documents that satisfy users.
Abstract（参考訳）: インタラクティブなドキュメントは、ダイナミックな可視化、インタラクティブなアニメーション、探索的なインターフェイスを通じて、読者が複雑なアイデアに取り組むのに役立つ。しかし、ドメインの専門知識とWeb開発スキルの両方を必要とするため、そのようなドキュメントの作成には依然としてコストがかかる。近年のLarge Language Model (LLM) ベースのエージェントは、コンテンツ生成を自動化できるが、インタラクティブなドキュメント生成に直接適用することで、制御が難しい出力を生成することが多い。そこで本稿では,対話型文書生成の体系化に向けた最初の取り組みとして,ViviDocを紹介する。 ViviDocはマルチエージェントパイプライン(Planner、Styler、Executor、Evaluator)を導入している。生成プロセスを制御可能にするために,1)構造化計画のためのSRTCインタラクション仕様(状態,レンダリング,遷移,制約)付き文書仕様(DocSpec),2)書き込みスタイルやインタラクションスタイルをカスタマイズするためのコンテンツ対応スタイルパレット,3)反復改善のためのチャットベースの編集の3段階の人的制御を行う。また,11領域にわたる実世界の対話文書から抽出した101のトピックのベンチマークであるViviBenchと,8種類の対話型分類と,人間の評価に対して検証された4次元自動評価フレームワークを構築した(Pearson r > 0.84)。実験により、ViviDocは、自動評価と人的評価の両方において、最高のコンテンツ豊かさと相互作用品質を達成することが示された。 12人のユーザによる調査では、システムは使いやすく、生成プロセスの効果的な制御を提供し、ユーザを満足させる文書を生成する。

論文の概要: ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

関連論文リスト