Fugu-MT 論文翻訳(概要): RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving

論文の概要: RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving

arxiv url: http://arxiv.org/abs/2505.21577v2
Date: Fri, 06 Jun 2025 05:35:21 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-09 17:28:43.007981
Title: RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving
Title（参考訳）: RepoMaster: 複雑なタスク解決のためのGitHubリポジトリの自律的な探索と理解
Authors: Huacan Wang, Ziyi Ni, Shuo Zhang, Shuo Lu, Sen Hu, Ziyang He, Chen Hu, Jiaye Lin, Yifu Guo, Yuntao Du, Pin Lyu,
Abstract要約: RepoMasterは、複雑なタスクを解決するためにGitHubリポジトリを探索、再利用するために設計された、自律的なエージェントフレームワークである。 RepoMasterは関数呼び出しグラフ、モジュール依存グラフ、階層的なコードツリーを構築し、必須コンポーネントを特定する。新たにリリースしたGitTaskBenchでは、RepoMasterがタスクパス率を24.1%から62.9%に引き上げ、トークン使用率を95%削減しました。
参考スコア（独自算出の注目度）: 9.477917878478188
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: The ultimate goal of code agents is to solve complex tasks autonomously. Although large language models (LLMs) have made substantial progress in code generation, real-world tasks typically demand full-fledged code repositories rather than simple scripts. Building such repositories from scratch remains a major challenge. Fortunately, GitHub hosts a vast, evolving collection of open-source repositories, which developers frequently reuse as modular components for complex tasks. Yet, existing frameworks like OpenHands and SWE-Agent still struggle to effectively leverage these valuable resources. Relying solely on README files provides insufficient guidance, and deeper exploration reveals two core obstacles: overwhelming information and tangled dependencies of repositories, both constrained by the limited context windows of current LLMs. To tackle these issues, we propose RepoMaster, an autonomous agent framework designed to explore and reuse GitHub repositories for solving complex tasks. For efficient understanding, RepoMaster constructs function-call graphs, module-dependency graphs, and hierarchical code trees to identify essential components, providing only identified core elements to the LLMs rather than the entire repository. During autonomous execution, it progressively explores related components using our exploration tools and prunes information to optimize context usage. Evaluated on the adjusted MLE-bench, RepoMaster achieves a 110% relative boost in valid submissions over the strongest baseline OpenHands. On our newly released GitTaskBench, RepoMaster lifts the task-pass rate from 24.1% to 62.9% while reducing token usage by 95%. Our code and demonstration materials are publicly available at https://github.com/wanghuacan/RepoMaster.
Abstract（参考訳）: コードエージェントの最終的な目標は、複雑なタスクを自律的に解決することだ。大きな言語モデル(LLM)はコード生成に大きな進歩を遂げているが、現実のタスクは通常、単純なスクリプトではなく、本格的なコードリポジトリを必要とする。このようなリポジトリをゼロから構築することは、依然として大きな課題です。幸いなことに、GitHubは進化を続けるオープンソースリポジトリのコレクションをホストしており、開発者は複雑なタスクのモジュールコンポーネントとして頻繁に再利用している。しかし、OpenHandsやSWE-Agentのような既存のフレームワークは、これらの貴重なリソースを効果的に活用するのに苦労しています。 READMEファイルのみをリライニングすることは十分なガイダンスを提供し、より深い調査によって2つの中核的な障害が明らかになった。これらの問題に対処するために、複雑なタスクを解決するためにGitHubリポジトリを探索および再利用するために設計された、自律的なエージェントフレームワークであるRepoMasterを提案する。効率的な理解のために、RepoMasterは関数呼び出しグラフ、モジュール依存グラフ、階層的なコードツリーを構築し、必須コンポーネントを特定する。自律実行中に、我々の探索ツールを使用して関連するコンポーネントを徐々に探索し、コンテキスト使用を最適化する。調整済みのMLEベンチに基づいて評価されたRepoMasterは、最強のベースラインであるOpenHandsよりも110%の相対的なアップを実現している。新たにリリースしたGitTaskBenchでは、RepoMasterがタスクパス率を24.1%から62.9%に引き上げ、トークン使用率を95%削減しました。私たちのコードとデモ資料はhttps://github.com/wanghuacan/RepoMaster.comで公開されています。

論文の概要: RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving

関連論文リスト