Fugu-MT 論文翻訳(概要): Self-Aware Markov Models for Discrete Reasoning

論文の概要: Self-Aware Markov Models for Discrete Reasoning

arxiv url: http://arxiv.org/abs/2603.16661v1
Date: Tue, 17 Mar 2026 15:30:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:07.372711
Title: Self-Aware Markov Models for Discrete Reasoning
Title（参考訳）: 離散推論のための自己認識マルコフモデル
Authors: Gregor Kornhardt, Jannis Chemseddine, Christian Wald, Gabriele Steidl,
Abstract要約: 本稿では,Markovトランジションカーネルの学習方法を紹介する。この設計によりトークンを再マッピングすることができ、モデルが以前のミスを修正することができる。 Sudoku-Extremeデータセットでは、95%の妥当性で、他のフローベース手法よりも明らかに優れている。
参考スコア（独自算出の注目度）: 8.161697757509701
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Standard masked discrete diffusion models face limitations in reasoning tasks due to their inability to correct their own mistakes on the masking path. Since they rely on a fixed number of denoising steps, they are unable to adjust their computation to the complexity of a given problem. To address these limitations, we introduce a method based on learning a Markov transition kernel that is trained on its own outputs. This design enables tokens to be remasked, allowing the model to correct its previous mistakes. Furthermore, we do not need a fixed time schedule but use a trained stopping criterion. This allows for adaptation of the number of function evaluations to the difficulty of the reasoning problem. Our adaptation adds two lightweight prediction heads, enabling reuse and fine-tuning of existing pretrained models. On the Sudoku-Extreme dataset we clearly outperform other flow based methods with a validity of 95%. For the Countdown-4 we only need in average of 10 steps to solve almost 96% of them correctly, while many problems can be solved already in 2 steps.
Abstract（参考訳）: 標準的なマスク付き離散拡散モデルは、マスキングパスにおける自身の誤りを修正することができないため、推論タスクの制限に直面している。与えられた問題の複雑さに合わせて計算を調整することはできない。これらの制約に対処するために,Markovトランジションカーネルを学習し,自身の出力に基づいて学習する手法を提案する。この設計によりトークンを再マッピングすることができ、モデルが以前のミスを修正することができる。さらに、固定時間スケジュールは必要とせず、訓練された停止基準を使用する。これにより、関数評価の回数を推論問題の難しさに適応させることができる。我々の適応は2つの軽量な予測ヘッドを追加し、既存の事前学習モデルの再利用と微調整を可能にした。 Sudoku-Extremeデータセットでは、95%の妥当性で、他のフローベース手法よりも明らかに優れている。 Countdown-4では、およそ96%の問題を正しく解くのに平均10ステップしか必要とせず、2ステップですでに多くの問題が解決できる。

論文の概要: Self-Aware Markov Models for Discrete Reasoning

関連論文リスト