Fugu-MT 論文翻訳(概要): CWM: An Open-Weights LLM for Research on Code Generation with World Models

論文の概要: CWM: An Open-Weights LLM for Research on Code Generation with World Models

arxiv url: http://arxiv.org/abs/2510.02387v1
Date: Tue, 30 Sep 2025 21:47:10 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-06 16:35:52.067479
Title: CWM: An Open-Weights LLM for Research on Code Generation with World Models
Title（参考訳）: CWM:世界モデルを用いたコード生成研究のためのオープンウェイトLLM
Authors: FAIR CodeGen team, Quentin Carbonneaux, Gal Cohen, Jonas Gehring, Jacob Kahn, Jannik Kossen, Felix Kreuk, Emily McMilin, Michel Meyer, Yuxiang Wei, David Zhang, Kunhao Zheng, Jordi Armengol-Estapé, Pedram Bashiri, Maximilian Beck, Pierre Chambon, Abhishek Charnalia, Chris Cummins, Juliette Decugis, Zacharias V. Fisches, François Fleuret, Fabian Gloeckle, Alex Gu, Michael Hassid, Daniel Haziza, Badr Youbi Idrissi, Christian Keller, Rahul Kindi, Hugh Leather, Gallil Maimon, Aram Markosyan, Francisco Massa, Pierre-Emmanuel Mazaré, Vegard Mella, Naila Murray, Keyur Muzumdar, Peter O'Hearn, Matteo Pagliardini, Dmitrii Pedchenko, Tal Remez, Volker Seeker, Marco Selvi, Oren Sultan, Sida Wang, Luca Wehrstedt, Ori Yoran, Lingming Zhang, Taco Cohen, Yossi Adi, Gabriel Synnaeve,
Abstract要約: 我々は、Code World Model (CWM) をリリースし、世界モデルを用いたコード生成の研究を進めます。我々はPythonインタプリタとエージェントDocker環境から大量の観察-アクショントラジェクトリをトレーニング中である。我々は,ワールドモデルがエージェントプログラミングの恩恵を受けるための第一歩として,Pythonコード実行のステップバイステップシミュレーションを実現し,推論が後者にどのようなメリットをもたらすかを示す。
参考スコア（独自算出の注目度）: 78.0342683953353
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We release Code World Model (CWM), a 32-billion-parameter open-weights LLM, to advance research on code generation with world models. To improve code understanding beyond what can be learned from training on static code alone, we mid-train CWM on a large amount of observation-action trajectories from Python interpreter and agentic Docker environments, and perform extensive multi-task reasoning RL in verifiable coding, math, and multi-turn software engineering environments. With CWM, we provide a strong testbed for researchers to explore the opportunities world modeling affords for improving code generation with reasoning and planning in computational environments. We present first steps of how world models can benefit agentic coding, enable step-by-step simulation of Python code execution, and show early results of how reasoning can benefit from the latter. CWM is a dense, decoder-only LLM trained with a context size of up to 131k tokens. Independent of its world modeling capabilities, CWM offers strong performance on general coding and math tasks: it reaches pass@1 scores of 65.8% on SWE-bench Verified (with test-time scaling), 68.6% on LiveCodeBench, 96.6% on Math-500, and 76.0% on AIME 2024. To support further research on code world modeling, we release model checkpoints after mid-training, SFT, and RL.
Abstract（参考訳）: 我々は,32ビリオンパラメタのオープンウェイトLLMであるCode World Model (CWM) をリリースし,世界モデルを用いたコード生成の研究を進めた。静的コードだけでのトレーニングから学んだことを超えて、コード理解を改善するために、PythonインタプリタとエージェントDocker環境から大量の観察-アクショントラジェクトリをトレーニングし、検証可能なコーディング、数学、マルチターンソフトウェアエンジニアリング環境において、広範なマルチタスク推論RLを実行しました。 CWMでは、研究者が計算環境における推論と計画によるコード生成を改善するための世界モデリングの機会を探るための強力なテストベッドを提供する。我々は,ワールドモデルがエージェントプログラミングの恩恵を受けるための第一歩として,Pythonコード実行のステップバイステップシミュレーションを実現し,推論が後者のメリットを享受する方法の早期結果を示す。 CWMは、最大131kのトークンでトレーニングされた、密度の高いデコーダのみのLLMである。ワールドモデリング機能とは独立して、CWMは一般的なコーディングタスクと数学タスクで強力なパフォーマンスを提供している。パス@1スコアはSWE-bench Verifiedで65.8%、LiveCodeBenchで68.6%、Math-500で96.6%、AIME 2024で76.0%である。コードワールドモデリングのさらなる研究を支援するため、中級トレーニング後のモデルチェックポイント、SFT、RLをリリースする。

論文の概要: CWM: An Open-Weights LLM for Research on Code Generation with World Models

関連論文リスト