Fugu-MT 論文翻訳(概要): One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models

論文の概要: One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models

arxiv url: http://arxiv.org/abs/2604.18839v1
Date: Mon, 20 Apr 2026 21:06:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 22:41:49.495471
Title: One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models
Title（参考訳）: 一歩前進して一歩後退する - 再帰モデルのデノレーションによる推論の改善
Authors: Chris Cameron, Wangzheng Wang, Nikita Ivanov, Ashmita Bhattacharyya, Didier Chételat, Yingxue Zhang,
Abstract要約: Denoising Recursion Modelsは、データをノイズで汚すが、複数のステップで破損を逆転させるようモデルを訓練する手法である。この戦略は、中間状態の抽出可能なカリキュラムを提供すると同時に、テストとの整合性を向上し、非グレーディで前向きな世代にインセンティブを与える。この手法はARC-AGI上のTiny Recursion Modelよりも優れており、最近はブレークスルー性能を達成している。
参考スコア（独自算出の注目度）: 4.903188186588148
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Looped transformers scale computational depth without increasing parameter count by repeatedly applying a shared transformer block and can be used for iterative refinement, where each loop rewrites a full fixed-size prediction in parallel. On difficult problems, such as those that require search-like computation, reaching a highly structured solution starting from noise can require long refinement trajectories. Learning such trajectories is challenging when training specifies only the target solution and provides no supervision over the intermediate refinement path. Diffusion models tackle this issue by corrupting data with varying magnitudes of noise and training the model to reverse it in a \textit{single step}. However, this process misaligns training and testing behaviour. We introduce Denoising Recursion Models, a method that similarly corrupts data with noise but trains the model to reverse the corruption over \textit{multiple} recursive steps. This strategy provides a tractable curriculum of intermediate states, while better aligning training with testing and incentivizing non-greedy, forward-looking generation. Through extensive experiments, we show this approach outperforms the Tiny Recursion Model (TRM) on ARC-AGI, where it recently achieved breakthrough performance.
Abstract（参考訳）: ループ変換器は、共用変圧器ブロックを繰り返し適用することでパラメータ数を増やすことなく計算深度を拡大し、各ループが完全な固定サイズ予測を並列に書き直す反復精製に使用できる。探索のような計算を必要とするような難しい問題では、ノイズから始まる高度に構造化された解に到達するには、長い洗練された軌道が必要となる。このような軌道の学習は、トレーニングが対象のソリューションのみを指定し、中間の洗練パスを監督しない場合、困難である。拡散モデルは、様々な大きさのノイズでデータを破損させ、それを「textit{single step}」で逆転するようにモデルを訓練することでこの問題に対処する。しかし、このプロセスはトレーニングやテストの振る舞いを誤ったものにします。 Denoising Recursion Models(デノナイジング・リキュレーション・モデル)は、同様にノイズでデータを破損させる手法であるが、モデルにリキュレイティブ・ステップであるtextit{multiple} を逆転させるよう訓練する手法である。この戦略は、中間状態の抽出可能なカリキュラムを提供すると同時に、テストとトレーニングの整合性を改善し、非グレードで前向きな世代にインセンティブを与える。大規模な実験を通じて,この手法はARC-AGI上のTiny Recursion Model(TRM)より優れていることを示す。

論文の概要: One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models

関連論文リスト