Fugu-MT 論文翻訳(概要): Beyond Meta-Reasoning: Metacognitive Consolidation for Self-Improving LLM Reasoning

論文の概要: Beyond Meta-Reasoning: Metacognitive Consolidation for Self-Improving LLM Reasoning

arxiv url: http://arxiv.org/abs/2604.17399v1
Date: Sun, 19 Apr 2026 12:02:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.506287
Title: Beyond Meta-Reasoning: Metacognitive Consolidation for Self-Improving LLM Reasoning
Title（参考訳）: メタ推論を超えて:自己改善LDM推論のためのメタ認知統合
Authors: Ziqing Zhuang, Linhai Zhang, Jiasheng Si, Deyu Zhou, Yulan He,
Abstract要約: 大規模言語モデル(LLM)は強力な推論能力を示している。さらなる改善に向けた有望な方向として、メタレアソン化に注目が移っている。メタ認知的統合(Metacognitive Consolidation)は,メタ認知的体験を再利用可能な知識に集約する新しいフレームワークである。
参考スコア（独自算出の注目度）: 42.534586884018076
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) have demonstrated strong reasoning capabilities, and as existing approaches for enhancing LLM reasoning continue to mature, increasing attention has shifted toward meta-reasoning as a promising direction for further improvement. However, most existing meta-reasoning methods remain episodic: they focus on executing complex meta-reasoning routines within individual instances, but ignore the accumulation of reusable meta-reasoning skills across instances, leading to recurring failure modes and repeatedly high metacognitive effort. In this paper, we introduce Metacognitive Consolidation, a novel framework in which a model consolidates metacognitive experience from past reasoning episodes into reusable knowledge that improves future meta-reasoning. We instantiate this framework by structuring instance-level problem solving into distinct roles for reasoning, monitoring, and control to generate rich, attributable meta-level traces. These traces are then consolidated through a hierarchical, multi-timescale update mechanism that gradually forms evolving meta-knowledge. Experimental results demonstrate consistent performance gains across benchmarks and backbone models, and show that performance improves as metacognitive experience accumulates over time.
Abstract（参考訳）: 大規模言語モデル(LLM)は、強力な推論能力を示しており、LLM推論を強化する既存のアプローチが成熟を続けるにつれ、さらなる改善に向けた将来的な方向としてメタ推論へと注目が移りつつある。個々のインスタンス内で複雑なメタ推論ルーチンを実行することに重点を置いているが、インスタンス全体で再利用可能なメタ推論スキルの蓄積を無視し、繰り返し発生する障害モードと、繰り返し発生するメタ認知の取り組みを無視する。本稿では,メタ認知統合(Metacognitive Consolidation)について紹介する。メタ認知統合(Metacognitive Consolidation)は,モデルが過去の推論エピソードからメタ認知体験を,将来的なメタ推論を改善する再利用可能な知識に集約する新しいフレームワークである。我々は、このフレームワークをインスタンスレベルの問題を推論、監視、制御の異なる役割に構造化して、リッチで帰属可能なメタレベルのトレースを生成することでインスタンス化する。これらのトレースは、進化するメタ知識を徐々に形成する階層的なマルチタイム更新メカニズムによって統合される。実験の結果,ベンチマークモデルとバックボーンモデル間で一貫した性能向上が示され,メタ認知経験が時間の経過とともに蓄積されるにつれて,性能が向上することが示された。

論文の概要: Beyond Meta-Reasoning: Metacognitive Consolidation for Self-Improving LLM Reasoning

関連論文リスト