Fugu-MT 論文翻訳(概要): The Periodic Table of LLM Reasoning: A Structured Survey of Reasoning Paradigms, Methods, and Failure Modes

論文の概要: The Periodic Table of LLM Reasoning: A Structured Survey of Reasoning Paradigms, Methods, and Failure Modes

arxiv url: http://arxiv.org/abs/2606.11470v1
Date: Tue, 09 Jun 2026 21:59:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-11 16:42:38.190538
Title: The Periodic Table of LLM Reasoning: A Structured Survey of Reasoning Paradigms, Methods, and Failure Modes
Title（参考訳）: LLM推論の周期表:推論パラダイム,方法,障害モードの構造的調査
Authors: Avinash Anand, Mahisha Ramesh, Avni Mittal, Ashutosh Kumar, Erik Cambria, Zhengkui Wang, Timothy Liu, Aik Beng Ng, Simon See, Rajiv Ratn Shah,
Abstract要約: この調査は、arXiv、Semantic Scholar、Google Scholar、Papers with Code、ACL Anthologyの300以上の最近の論文の体系的な分析を提供する。本稿では、推論研究の構造的分類を導入し、チェーン・オブ・ソート推論、マルチホップ推論、数学的推論、常識推論、視覚的および時間的推論、コードとアルゴリズム的推論、検索強化推論、ツール強化およびエージェント的推論、強化学習に基づく推論について紹介する。我々は、繰り返し発生する制限と失敗モードを合成する。例えば、幻覚の推論、脆い多段階推論、弱い因果的抽象、貧弱な交差などである。
参考スコア（独自算出の注目度）: 55.42597391158023
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) have achieved strong performance across natural language processing tasks, yet reliable reasoning remains an open challenge. Although modern LLMs show progress in structured inference, multi-step problem solving, and contextual understanding, their reasoning behavior is often inconsistent and sensitive to prompting strategies, task design, and model scale. This survey provides a systematic analysis of more than 300 recent papers from arXiv, Semantic Scholar, Google Scholar, Papers with Code, and the ACL Anthology to examine how reasoning capabilities emerge in LLMs and where they fail. We make three main contributions. First, we introduce a structured taxonomy of LLM reasoning research, covering Chain-of-Thought reasoning, multi-hop reasoning, mathematical reasoning, common sense reasoning, visual and temporal reasoning, code and algorithmic reasoning, retrieval-augmented reasoning, tool-augmented and agentic reasoning, and reinforcement learning-based reasoning. Second, we analyze methodological trends across these paradigms, including prompting methods, model architectures, training objectives, reward modeling, and evaluation benchmarks. Third, we synthesize recurring limitations and failure modes, such as reasoning hallucinations, brittle multi-step inference, weak causal abstraction, and poor cross-domain generalization. By organizing a rapidly expanding literature, this survey offers a unified view of the current capabilities and limitations of reasoning in LLMs. We also identify emerging research directions, including meta-reasoning, self-evolving reasoning frameworks, multimodal reasoning, and socially grounded reasoning. Overall, this work aims to serve as a reference for developing more robust, interpretable, and generalizable reasoning systems in future language models.
Abstract（参考訳）: 大きな言語モデル(LLM)は、自然言語処理タスク全体にわたって強力なパフォーマンスを実現していますが、信頼できる推論は依然としてオープンな課題です。現代のLCMは、構造化推論、多段階問題解決、文脈理解の進歩を示すが、それらの推論動作は、しばしば矛盾し、戦略、タスクデザイン、モデルスケールの促進に敏感である。この調査では、arXiv、Semantic Scholar、Google Scholar、Papers with Code、ACL Anthologyの300以上の最近の論文を体系的に分析し、LCMにおける推論機能がどのように出現し、どこで失敗するのかを調べた。主な貢献は3つある。まず,LLM推論研究の構造的分類を導入し,階層推論,マルチホップ推論,数学的推論,共通感覚推論,視覚的・時間的推論,コードとアルゴリズム推論,検索強化推論,ツール強化・エージェント推論,強化学習に基づく推論について紹介する。第2に,手法,モデルアーキテクチャ,トレーニング目標,報酬モデリング,評価ベンチマークなど,これらのパラダイムの方法論的傾向を分析する。第三に、幻覚の推論、脆弱な多段階推論、因果抽象化の弱さ、ドメイン間の一般化の弱さなど、繰り返し発生する制限や障害モードを合成する。急速に拡大する文献を整理することにより、この調査はLLMにおける現在の能力と推論の限界の統一的な見解を提供する。また,メタ推論,自己進化的推論フレームワーク,マルチモーダル推論,社会的基盤的推論など,新たな研究の方向性を明らかにした。全体として、この研究は将来の言語モデルにおいてより堅牢で解釈可能で一般化可能な推論システムを開発するためのリファレンスとして機能することを目的としている。

論文の概要: The Periodic Table of LLM Reasoning: A Structured Survey of Reasoning Paradigms, Methods, and Failure Modes

関連論文リスト