Fugu-MT 論文翻訳(概要): ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs

論文の概要: ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs

arxiv url: http://arxiv.org/abs/2506.15211v1
Date: Wed, 18 Jun 2025 07:44:09 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-19 19:35:51.576204
Title: ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs
Title（参考訳）: プロト推論 : LLMにおける一般化可能な推論の基礎としてのプロトタイプ
Authors: Feng He, Zijun Chen, Xinnian Liang, Tingting Ma, Yunqi Qiu, Shuangzhi Wu, Junchi Yan,
Abstract要約: ProtoReasoningは、大規模推論モデルの推論能力を高めるフレームワークである。 ProtoReasoningは問題を対応するプロトタイプ表現に変換する。 ProtoReasoningは論理的推論に基づくベースラインモデルよりも4.7%改善されている。
参考スコア（独自算出の注目度）: 54.154593699263074
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advances in Large Reasoning Models (LRMs) trained with Long Chain-of-Thought (Long CoT) reasoning have demonstrated remarkable cross-domain generalization capabilities. However, the underlying mechanisms supporting such transfer remain poorly understood. We hypothesize that cross-domain generalization arises from shared abstract reasoning prototypes -- fundamental reasoning patterns that capture the essence of problems across domains. These prototypes minimize the nuances of the representation, revealing that seemingly diverse tasks are grounded in shared reasoning structures.Based on this hypothesis, we propose ProtoReasoning, a framework that enhances the reasoning ability of LLMs by leveraging scalable and verifiable prototypical representations (Prolog for logical reasoning, PDDL for planning).ProtoReasoning features: (1) an automated prototype construction pipeline that transforms problems into corresponding prototype representations; (2) a comprehensive verification system providing reliable feedback through Prolog/PDDL interpreters; (3) the scalability to synthesize problems arbitrarily within prototype space while ensuring correctness. Extensive experiments show that ProtoReasoning achieves 4.7% improvement over baseline models on logical reasoning (Enigmata-Eval), 6.3% improvement on planning tasks, 4.0% improvement on general reasoning (MMLU) and 1.0% on mathematics (AIME24). Significantly, our ablation studies confirm that learning in prototype space also demonstrates enhanced generalization to structurally similar problems compared to training solely on natural language representations, validating our hypothesis that reasoning prototypes serve as the foundation for generalizable reasoning in large language models.
Abstract（参考訳）: 近年,Long Chain-of-Thought (Long CoT) 推論で訓練されたLarge Reasoning Models (LRMs) の進歩により,ドメイン間の一般化能力が著しく向上した。しかし、そのような転移を支えるメカニズムはいまだに理解されていない。ドメイン間の一般化は、ドメイン間の問題の本質を捉える基本的な推論パターンである、共通の抽象的推論のプロトタイプから生じる、という仮説を立てる。これらのプロトタイプは、表現のニュアンスを最小限に抑え、一見多様なタスクが共有推論構造に基礎を置いていることを明らかにする。この仮説に基づいて、スケーラブルで検証可能なプロトタイプ表現(論理推論のProlog、計画のためのPDDL)を活用してLLMの推論能力を高めるフレームワークであるProtoReasoningを提案する。 ProtoReasoning の特徴は,(1) 問題を対応するプロトタイプ表現に変換する自動プロトタイプ構築パイプライン,(2) Prolog/PDDLインタプリタによる信頼性の高いフィードバックを提供する総合的な検証システム,(3) プロトタイプ空間内で問題を任意に合成し,正確性を確保するスケーラビリティである。大規模実験により, 論理的推論に基づくベースラインモデル(Enigmata-Eval)よりも4.7%, 計画タスクが6.3%, 一般推論(MMLU)が4.0%, 数学が1.0%向上した(AIME24)。本研究は, 原型空間における学習が, 自然言語表現のみによる学習に比べて, 構造的に類似した問題への一般化の促進を証明し, 推論プロトタイプが大規模言語モデルにおける一般化可能な推論の基礎となるという仮説を検証した。

論文の概要: ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs

関連論文リスト