Fugu-MT 論文翻訳(概要): SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning

論文の概要: SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning

arxiv url: http://arxiv.org/abs/2601.21649v1
Date: Thu, 29 Jan 2026 12:49:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-30 16:22:49.821062
Title: SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning
Title（参考訳）: SWE-Spot:レポジトリ中心学習による小さなRepo-Expertの構築
Authors: Jinjun Peng, Magnus Saebo, Tianjun Zhong, Yi-Jie Cheng, Junfeng Yang, Baishakhi Ray, Simin Chen, Yangruibo Ding,
Abstract要約: 小さな言語モデルには、複雑な、馴染みの無い一般化を扱うための推論時間がない。本稿では,水平タスク幅よりも垂直リポジトリ深度を優先するパラダイムシフトであるRepository-Centric Learning(RCL)を提案する。 RCLは、より高いトレーニングサンプル効率と低い推論コストをもたらす。
参考スコア（独自算出の注目度）: 26.404563042035395
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: The deployment of coding agents in privacy-sensitive and resource-constrained environments drives the demand for capable open-weight Small Language Models (SLMs). However, they suffer from a fundamental capability gap: unlike frontier large models, they lack the inference-time strong generalization to work with complicated, unfamiliar codebases. We identify that the prevailing Task-Centric Learning (TCL) paradigm, which scales exposure across disparate repositories, fails to address this limitation. In response, we propose Repository-Centric Learning (RCL), a paradigm shift that prioritizes vertical repository depth over horizontal task breadth, suggesting SLMs must internalize the "physics" of a target software environment through parametric knowledge acquisition, rather than attempting to recover it via costly inference-time search. Following this new paradigm, we design a four-unit Repository-Centric Experience, transforming static codebases into interactive learning signals, to train SWE-Spot-4B, a family of highly compact models built as repo-specialized experts that breaks established scaling trends, outperforming open-weight models up to larger (e.g., CWM by Meta, Qwen3-Coder-30B) and surpassing/matching efficiency-focused commercial models (e.g., GPT-4.1-mini, GPT-5-nano) across multiple SWE tasks. Further analysis reveals that RCL yields higher training sample efficiency and lower inference costs, emphasizing that for building efficient intelligence, repository mastery is a distinct and necessary dimension that complements general coding capability.
Abstract（参考訳）: プライバシに敏感でリソースに制約のある環境におけるコーディングエージェントの展開は、有能なオープンウェイト・スモール言語モデル(SLM)の需要を加速させる。しかし、それらは基本的な機能ギャップに悩まされている。フロンティアの大規模モデルとは異なり、複雑な、馴染みの無いコードベースを扱うための推論時強い一般化が欠けている。異なるリポジトリにまたがって露出を拡大するタスク中心学習(TCL)パラダイムは,この制限に対処できない。そこで本研究では,水平タスク幅よりも垂直リポジトリ深度を優先するパラダイムシフトであるRepository-Centric Learning (RCL)を提案する。この新たなパラダイムに従って、静的コードベースをインタラクティブな学習信号に変換し、SWE-Spot-4Bをトレーニングする4つのユニットリポジトリ-Centric Experienceを設計する。SWE-Spot-4Bは、既存のスケーリングトレンドを破り、オープンウェイトモデルをより大きく(Meta、Qwen3-Coder-30BによるCWM)、効率を重視した商用モデル(例えば、GPT-4.1-mini、GPT-5-nano)を複数のSWEタスクで上回り、より高速なモデルである。さらなる分析により、RCLはトレーニングサンプルの効率を高め、推論コストを低減し、効率的なインテリジェンスを構築するために、リポジトリのマスターは一般的なコーディング能力を補完する別次元であり、必要な要素であることを強調した。

論文の概要: SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning

関連論文リスト