Fugu-MT 論文翻訳(概要): RepoDoc: A Knowledge Graph-Based Framework to Automatic Documentation Generation and Incremental Updates

論文の概要: RepoDoc: A Knowledge Graph-Based Framework to Automatic Documentation Generation and Incremental Updates

arxiv url: http://arxiv.org/abs/2604.26523v1
Date: Wed, 29 Apr 2026 10:43:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-30 15:59:36.363249
Title: RepoDoc: A Knowledge Graph-Based Framework to Automatic Documentation Generation and Incremental Updates
Title（参考訳）: RepoDoc: 自動ドキュメンテーション生成とインクリメンタルアップデートのための知識グラフベースのフレームワーク
Authors: Dong Xu, Mingwei Liu, Xiwen Wang, Jianfeng Zhong, Zibin Zheng,
Abstract要約: 本稿では,レポジトリ知識グラフ(RepoKG)をドキュメントライフサイクル全体の意味基盤として利用するシステムであるRepoDocを提案する。インクリメンタルアップデートでは、更新時間を73%削減し、トークン使用量を77%削減し、10.2%高めの更新リコールを実現している。 8つの言語にわたる24のリポジトリで評価されたRepoDocは、最先端の代替よりも大幅に優れています。
参考スコア（独自算出の注目度）: 41.56228301287882
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Maintaining up-to-date, comprehensive documentation for large codebases is a persistent challenge. Recent progress in automated documentation has moved from template-based rules to large language models (LLMs), yet existing tools still process source code as flat fragments, producing isolated documents that lack semantic structure. This design also leads to excessive token consumption and slow generation, while failing to capture how code changes propagate across dependencies. We propose RepoDoc, a system that uses a repository knowledge graph (RepoKG) as the semantic foundation for the entire documentation lifecycle. Our framework consists of three stages: (1) RepoKG construction, which extracts code entities and their relationships; (2) module clustering, which groups code into functionally cohesive, hierarchical units; and (3) skillful agent-based generation, which queries the graph to create modular, cross-referenced documentation with auto-generated Mermaid diagrams. For incremental maintenance, a semantic impact propagation mechanism navigates the RepoKG bidirectionally to pinpoint all affected parts, allowing selective, targeted regeneration. Evaluated on 24 repositories across 8 programming languages, RepoDoc substantially outperforms state-of-the-art alternatives. It improves API coverage by 32.5% and completeness by 10.4%, while generating documentation 3x faster with 85% fewer tokens. For incremental updates, it cuts update time by 73% and token usage by 77%, and achieves 10.2% higher update recall, more accurately reflecting code changes in the regenerated documentation. The source code and experimental artifacts are available at https://github.com/SYSUSELab/RepoDoc.
Abstract（参考訳）: 大規模なコードベースに対する最新の包括的なドキュメントを維持することは、永続的な課題である。自動ドキュメントの最近の進歩は、テンプレートベースのルールから、大きな言語モデル(LLM)へと移行しているが、既存のツールは、ソースコードをフラットなフラグメントとして処理し、セマンティック構造を持たない独立したドキュメントを生成する。この設計は、コードの変更が依存関係間でどのように伝播するかを捉えるのに失敗しながら、トークンの過剰な消費と生成を遅くする。本稿では,レポジトリ知識グラフ(RepoKG)をドキュメントライフサイクル全体の意味基盤として利用するシステムであるRepoDocを提案する。このフレームワークは,(1)コードエンティティとその関係を抽出するRepoKGの構築,(2)コードを機能的に結合的で階層的な単位にグループ化するモジュールクラスタリング,(3)グラフをクエリしてモジュール化された相互参照ドキュメンテーションを自動生成する巧妙なエージェントベース生成,の3段階で構成されている。インクリメンタルなメンテナンスのために、セマンティックインパクト伝搬機構は、RepoKGを双方向にナビゲートして、影響を受けるすべての部分をピンポイントし、選択的に標的とする再生を可能にする。 8つのプログラミング言語にわたる24のリポジトリで評価され、RepoDocは最先端の代替よりも大幅に優れている。 APIカバレッジを32.5%改善し、完全性を10.4%向上し、ドキュメント生成を3倍速くし、トークンを85%削減した。インクリメンタルアップデートでは、更新時間を73%削減し、トークン使用量を77%削減し、10.2%高めの更新リコールを実現し、再生されたドキュメントのコード変更をより正確に反映する。ソースコードと実験成果物はhttps://github.com/SYSUSELab/RepoDoc.comで入手できる。

論文の概要: RepoDoc: A Knowledge Graph-Based Framework to Automatic Documentation Generation and Incremental Updates

関連論文リスト