Fugu-MT 論文翻訳(概要): DocSync: Agentic Documentation Maintenance via Critic-Guided Reflexion

論文の概要: DocSync: Agentic Documentation Maintenance via Critic-Guided Reflexion

arxiv url: http://arxiv.org/abs/2605.02163v1
Date: Mon, 04 May 2026 02:41:33 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:50.111465
Title: DocSync: Agentic Documentation Maintenance via Critic-Guided Reflexion
Title（参考訳）: DocSync:批判的ガイド反射によるエージェントドキュメンテーションメンテナンス
Authors: Sidhesh Badrinarayan, Adithya Parthasarathy,
Abstract要約: DocSyncは、ドキュメントのメンテナンスを構造的に基盤付けられた反復的な生成タスクとしてフレーム化するエージェントワークフローである。 LoRA適応小言語モデルを用いたDocSyncの資源制約実装を実証的に評価する。このAST対応エージェントアプローチが標準エンコーダ・デコーダのベースラインを大幅に上回ることを示す。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Software documentation frequently drifts from executable logic as codebases evolve, creating technical debt that degrades maintainability and causes downstream API misuse. While static analysis tools can detect the absence of documentation, they cannot evaluate its semantic consistency. Conversely, standard Large Language Models (LLMs) offer generative flexibility but frequently hallucinate when updating documentation without deep structural awareness of the underlying code. To address this gap, we propose DocSync, an agentic workflow that frames documentation maintenance as a structurally grounded, iterative generation task. DocSync bridges syntactic changes and natural language descriptions by fusing Abstract Syntax Tree (AST) representations and Retrieval-Augmented Generation (RAG) to provide dependency-aware context. Furthermore, to ensure factual consistency, we incorporate a critic-guided refinement loop based on the Reflexion paradigm, allowing the model to self-correct candidate updates against the source code. We empirically evaluate a resource-constrained implementation of DocSync-using a LoRA-adapted small language model - on a proxy code-to-text maintenance task. Our findings demonstrate that this AST-aware agentic approach substantially outperforms standard encoder-decoder baselines across semantic alignment, summary-line faithfulness, and automated judge preferences (e.g., achieving an automated judge score of 3.44/5.0 compared to 1.91 for CodeT5-base). Crucially, the iterative critic loop yields measurable improvements in semantic correctness without requiring scaled-up parameter counts. These results provide strong evidence that coupling structural retrieval with agentic refinement is a highly promising direction for autonomously mitigating documentation debt.
Abstract（参考訳）: コードベースが進化するにつれて、ソフトウェアドキュメンテーションはしばしば実行可能なロジックから逸脱し、保守性を低下させ、ダウンストリームAPIの誤用を引き起こす技術的負債を生み出します。静的解析ツールはドキュメントの欠如を検出することができるが、セマンティックな一貫性を評価することはできない。逆に、標準のLarge Language Models (LLMs) は、生成の柔軟性を提供するが、基盤となるコードに深い構造的意識を持たずにドキュメントを更新する際には、しばしば幻覚を与える。このギャップに対処するために、DocSyncを提案する。DocSyncは、ドキュメントのメンテナンスを構造的に基盤付けられた反復的な生成タスクとしてフレーム化するエージェントワークフローである。 DocSyncは、AST(Abstract Syntax Tree)表現とRAG(Retrieval-Augmented Generation)を融合して、構文変更と自然言語記述をブリッジして、依存性を意識したコンテキストを提供する。さらに、事実整合性を確保するために、リフレクションパラダイムに基づく批判誘導改良ループを導入し、ソースコードに対する候補更新を自己修正する。本稿では,ロラ適応小言語モデルを用いたDocSyncのリソース制約実装を,プロキシコードからテキストへのメンテナンスタスクで実証的に評価する。このAST対応エージェントアプローチは, 意味的アライメント, 要約線忠実度, 自動判断選好において, 標準エンコーダ・デコーダベースラインを大幅に上回っている(例えば, CodeT5ベースでは 1.91 に対して, 自動判断スコア 3.44/5.0 を達成している)。重要なことに、反復的批判ループは、スケールアップされたパラメータカウントを必要とせずに意味的正当性を測定可能な改善をもたらす。これらの結果は, 構造検索とエージェントリファインメントの結合が, ドキュメンテーション債務を自律的に軽減するための極めて有望な方向であることを示す。

論文の概要: DocSync: Agentic Documentation Maintenance via Critic-Guided Reflexion

関連論文リスト