Fugu-MT 論文翻訳(概要): Tiny Autoregressive Recursive Models

論文の概要: Tiny Autoregressive Recursive Models

arxiv url: http://arxiv.org/abs/2603.08082v1
Date: Mon, 09 Mar 2026 08:22:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:15.694793
Title: Tiny Autoregressive Recursive Models
Title（参考訳）: Tiny Autoregressive Recursive Models
Authors: Paulius Rauba, Claudio Fanconi, Mihaela van der Schaar,
Abstract要約: Tiny Recursive Models (TRMs) は最近のARC-AGIで顕著な性能を示した。本稿では、自己回帰TRMを提案し、小さな自己回帰タスクで評価する。期待に反して、完全なAutoregressive TRMアーキテクチャによる信頼性の高いパフォーマンス向上は見つからない。
参考スコア（独自算出の注目度）: 48.902738432071935
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Tiny Recursive Models (TRMs) have recently demonstrated remarkable performance on ARC-AGI, showing that very small models can compete against large foundation models through a two-step refinement mechanism that updates an internal reasoning state $z$ and the predicted output $y$. Naturally, such refinement is of interest for any predictor; it is therefore natural to wonder whether the TRM mechanism could be effectively re-adopted in autoregressive models. However, TRMs cannot be simply compared to standard models because they lack causal predictive structures and contain persistent latent states that make it difficult to isolate specific performance gains. In this paper, we propose the Autoregressive TRM and evaluate it on small autoregressive tasks. To understand its efficacy, we propose a suite of models that gradually transform a standard Transformer to a Tiny Autoregressive Recursive Model in a controlled setting that fixes the block design, token stream, and next-token objective. Across compute-matched experiments on character-level algorithmic tasks, we surprisingly find that there are some two-level refinement baselines that show strong performance. Contrary to expectations, we find no reliable performance gains from the full Autoregressive TRM architecture. These results offer potential promise for two-step refinement mechanisms more broadly but caution against investing in the autoregressive TRM-specific model as a fruitful research direction.
Abstract（参考訳）: Tiny Recursive Models (TRMs) は最近のARC-AGIで顕著な性能を示しており、非常に小さなモデルでは内部推論状態の$z$と予測出力$y$を更新する2段階の改良機構によって大きな基盤モデルと競合できることを示している。当然、このような改良は任意の予測子にとって興味深いものであり、TRM機構が自己回帰モデルで効果的に再適応できるかどうか疑問に思うのは当然である。しかし、TRMは因果予測構造が欠如し、特定の性能向上の分離が困難になる永続的な潜伏状態を含むため、標準モデルと単純に比較することはできない。本稿では, 自己回帰TRMを提案し, 小さな自己回帰タスクで評価する。そこで本研究では,標準トランスフォーマーからTiny Autoregressive Recursive Modelへ,ブロック設計やトークンストリーム,次に注意すべき目的を段階的に修正するモデルセットを提案する。文字レベルのアルゴリズムタスクに関する計算マッチング実験全体において、強い性能を示す2段階の洗練されたベースラインがあることは驚くべきことである。期待に反して、完全なAutoregressive TRMアーキテクチャによる信頼性の高いパフォーマンス向上は見つからない。これらの結果は、より広範に2段階の精錬機構を期待できるが、実りある研究方向として自己回帰的TRM特化モデルへの投資には注意が必要である。

論文の概要: Tiny Autoregressive Recursive Models

関連論文リスト