Fugu-MT 論文翻訳(概要): Wiring the 'Why': A Unified Taxonomy and Survey of Abductive Reasoning in LLMs

論文の概要: Wiring the 'Why': A Unified Taxonomy and Survey of Abductive Reasoning in LLMs

arxiv url: http://arxiv.org/abs/2604.08016v1
Date: Thu, 09 Apr 2026 09:16:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-10 18:34:05.835858
Title: Wiring the 'Why': A Unified Taxonomy and Survey of Abductive Reasoning in LLMs
Title（参考訳）: 理由」を問う : 統一分類学とLCMにおける帰納的推論の調査
Authors: Moein Salimi, Shaygan Adim, Danial Parnian, Nima Alighardashi, Mahdi Jafari Siavoshani, Mohammad Hossein Rohban,
Abstract要約: 本稿では,Large Language Models (LLMs) における帰納的推論に関する最初の調査について述べる。この分野で広く普及している概念的混乱とタスク定義の合成に対処するため,我々は,先行作業を正式に分類する統合された2段階定義を確立する。文献を包括的に分類し,それらの帰納的タスク,データセット,基礎となる方法論,評価戦略に基づいて事前作業の分類を行う。
参考スコア（独自算出の注目度）: 6.369429101077227
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Regardless of its foundational role in human discovery and sense-making, abductive reasoning--the inference of the most plausible explanation for an observation--has been relatively underexplored in Large Language Models (LLMs). Despite the rapid advancement of LLMs, the exploration of abductive reasoning and its diverse facets has thus far been disjointed rather than cohesive. This paper presents the first survey of abductive reasoning in LLMs, tracing its trajectory from philosophical foundations to contemporary AI implementations. To address the widespread conceptual confusion and disjointed task definitions prevalent in the field, we establish a unified two-stage definition that formally categorizes prior work. This definition disentangles abduction into \textit{Hypothesis Generation}, where models bridge epistemic gaps to produce candidate explanations, and \textit{Hypothesis Selection}, where the generated candidates are evaluated and the most plausible explanation is chosen. Building upon this foundation, we present a comprehensive taxonomy of the literature, categorizing prior work based on their abductive tasks, datasets, underlying methodologies, and evaluation strategies. In order to ground our framework empirically, we conduct a compact benchmark study of current LLMs on abductive tasks, together with targeted comparative analyses across model sizes, model families, evaluation styles, and the distinct generation-versus-selection task typologies. Moreover, by synthesizing recent empirical results, we examine how LLM performance on abductive reasoning relates to deductive and inductive tasks, providing insights into their broader reasoning capabilities. Our analysis reveals critical gaps in current approaches--from static benchmark design and narrow domain coverage to narrow training frameworks and limited mechanistic understanding of abductive processes...
Abstract（参考訳）: 人間の発見と感覚形成における基礎的役割にかかわらず、帰納的推論 - 観察の最も確実な説明の推測 - は、Large Language Models (LLMs) において比較的過小評価されている。 LLMの急速な進歩にもかかわらず、帰納的推論とその多様な面の探索は、結合性よりもむしろ解離している。本稿では,LLMにおける帰納的推論に関する最初の調査を行い,哲学的基礎から現代AI実装への道筋をたどる。この分野で広く普及している概念的混乱とタスク定義の相違に対処するため、我々は、前処理を正式に分類する統合された2段階の定義を確立する。この定義は、推論を \textit{Hypothesis Generation} に分解し、モデルがエピステミックギャップをブリッジして候補説明を生成し、生成した候補を評価し、最も妥当な説明を選択する \textit{Hypothesis Selection} に分解する。本財団を基盤として,文献の包括的分類を提示し,それらの帰納的タスク,データセット,基礎となる方法論,評価戦略に基づいて事前作業の分類を行う。本研究では, モデルサイズ, モデルファミリ, 評価スタイル, 世代逆選択タスクのタイプを対象とする比較分析とともに, 現在のLCMの帰納的タスクに関するコンパクトなベンチマーク調査を行う。さらに,最近の経験的結果の合成により,帰納的推論におけるLCM性能が帰納的および帰納的タスクとどのように関連し,より広範な推論能力に関する洞察を与えるかを検討する。私たちの分析では、静的なベンチマーク設計や狭いドメインカバレッジから、狭いトレーニングフレームワーク、誘拐プロセスの機械的理解の制限まで、現在のアプローチにおける重要なギャップが明らかになりました。

論文の概要: Wiring the 'Why': A Unified Taxonomy and Survey of Abductive Reasoning in LLMs

関連論文リスト