Fugu-MT 論文翻訳(概要): Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

論文の概要: Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

arxiv url: http://arxiv.org/abs/2603.11103v1
Date: Wed, 11 Mar 2026 09:23:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-13 14:46:25.515809
Title: Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining
Title（参考訳）: 再構築による理解 - LLM事前学習のソフトウェア開発プロセスの逆転
Authors: Zhiyuan Zeng, Yichi Zhang, Yong Shan, Kai Hua, Siyuan Fang, Zhaiyu Liu, Jiaheng Liu, Haozhe Wang, Yining Zheng, Ming Ding, Ke Shen, Ge Zhang, Wenhao Huang, Xipeng Qiu,
Abstract要約: 大規模言語モデル(LLM)は、複雑なソフトウェア工学に必要な、深く、長期にわたる推論に苦しむことが多い。本稿では,再構築による理解という,新しいパラダイムを提案する。マルチエージェントシミュレーションを用いて潜在エージェント軌道を合成するフレームワークを提案する。
参考スコア（独自算出の注目度）: 66.89012795621349
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While Large Language Models (LLMs) have achieved remarkable success in code generation, they often struggle with the deep, long-horizon reasoning required for complex software engineering. We attribute this limitation to the nature of standard pre-training data: static software repositories represent only the terminal state of an intricate intellectual process, abstracting away the intermediate planning, debugging, and iterative refinement. To bridge this gap, we propose a novel paradigm: understanding via reconstruction. We hypothesize that reverse-engineering the latent agentic trajectories -- the planning, reasoning, and debugging steps -- behind static repositories provides a far richer supervision signal than raw code alone. To operationalize this, we introduce a framework that synthesizes these trajectories using a multi-agent simulation. This process is grounded in the structural realities of the source repositories (e.g., dependency graphs and file hierarchies) to ensure fidelity. Furthermore, to guarantee the logical rigor of the synthetic data, we employ a search-based optimization technique that iteratively refines the Chain-of-Thought (CoT) reasoning to maximize the likelihood of the ground-truth code. Empirical results demonstrate that continuous pre-training on these reconstructed trajectories significantly enhances Llama-3-8B's performance across diverse benchmarks, including long-context understanding, coding proficiency, and agentic capabilities.
Abstract（参考訳）: LLM(Large Language Models)はコード生成において顕著な成功を収めていますが、複雑なソフトウェアエンジニアリングに必要な、深い、長い水平推論に悩まされることが少なくありません。静的ソフトウェアリポジトリは複雑な知的プロセスの終端状態を表現し、中間的な計画、デバッグ、反復的な改善を抽象化します。このギャップを埋めるために、再構築による理解という新しいパラダイムを提案する。静的リポジトリの背後にある、遅延エージェントの軌道 -- 計画、推論、デバッグステップ -- のリバースエンジニアリングは、生コード単独よりもはるかにリッチな監視信号を提供する、という仮説を立てています。これを運用するために,マルチエージェントシミュレーションを用いてこれらの軌道を合成するフレームワークを提案する。このプロセスは、ソースリポジトリ(例えば、依存グラフやファイル階層)の構造的現実に基礎を置いている。さらに, 合成データの論理的厳密性を保証するため, 探索に基づく最適化手法を用いて, 基幹符号の確率を最大化するために, 繰り返し改良を行う。実験により、再構成された軌道上での連続事前学習がLlama-3-8Bの性能を大幅に向上させることが実証された。

論文の概要: Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

関連論文リスト