Fugu-MT 論文翻訳(概要): Where Reasoning Breaks: Logic-Aware Path Selection by Controlling Logical Connectives in LLMs Reasoning Chains

論文の概要: Where Reasoning Breaks: Logic-Aware Path Selection by Controlling Logical Connectives in LLMs Reasoning Chains

arxiv url: http://arxiv.org/abs/2604.20564v1
Date: Wed, 22 Apr 2026 13:45:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-23 15:36:11.150759
Title: Where Reasoning Breaks: Logic-Aware Path Selection by Controlling Logical Connectives in LLMs Reasoning Chains
Title（参考訳）: LLMにおける論理接続制御による論理認識経路選択
Authors: Seunghyun Park, Yuanyuan Lei,
Abstract要約: 我々は、この構造的脆弱性の主要なポイントとして、論理的結合体を同定する。推論過程におけるこれらの論理クリティカルな接合に特異的に介入するフレームワークを提案する。
参考スコア（独自算出の注目度）: 7.740591992262573
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While LLMs demonstrate impressive reasoning capabilities, they remain fragile in multi-step logical deduction, where a single transition error can propagate through the entire reasoning chain, leading to unstable performance. In this work, we identify logical connectives as primary points of this structural fragility. Through empirical analysis, we show that connective tokens function as high entropy forking points, at which models frequently struggle to determine the correct logical direction. Motivated by this observation, we hypothesize that intervening in logical connective selection can guide LLMs toward more correct logical direction, thereby improving the overall reasoning chain. To validate this hypothesis, we propose a multi-layered framework that intervenes specifically at these logic-critical junctions in the reasoning process. Our framework includes (1) Gradient-based Logical Steering to guide LLMs internal representations towards valid reasoning subspaces, (2) Localized Branching to resolve ambiguity via targeted look-ahead search, and (3) Targeted Transition Preference Optimization, a surgical reinforcement learning objective that selectively optimizes single-token preferences at logical pivots. Crucially, by concentrating intervention solely on logic-critical transitions, our framework achieves a favorable accuracy--efficiency trade-off compared to global inference time scaling methods like beam search and self-consistency.
Abstract（参考訳）: LLMは印象的な推論能力を示すが、多段階論理推論において脆弱なままであり、単一の遷移エラーが推論チェーン全体を通して伝播し、不安定な性能をもたらす。この研究では、この構造的不安定性の主点として論理的連結体を同定する。経験的分析により、連結トークンは高いエントロピーフォークポイントとして機能し、モデルが正しい論理的方向を決定するのにしばしば苦労することを示す。本研究は, 論理結合選択の介入により, LLMをより正しい論理方向に導くことができ, 全体としての推論連鎖を改善することができると仮定した。この仮説を検証するために、推論過程におけるこれらの論理クリティカルジャンクションに特異的に介入する多層フレームワークを提案する。本フレームワークは,(1)LLMの内部表現を有効な推論部分空間に導くためのグラディエントベースの論理的ステアリング,(2)目的のルックアヘッド探索によるあいまいさを解消するための局所分岐,(3)論理ピボットにおける単一トークンの選好を選択的に最適化する外科的強化学習目的であるTargeted Transition Preference Optimizationを含む。重要なことは、論理クリティカルな遷移のみに介入を集中させることで、ビームサーチや自己整合性のようなグローバルな推論時間スケーリング手法と比較して、我々のフレームワークは良好な精度-効率のトレードオフを達成する。

論文の概要: Where Reasoning Breaks: Logic-Aware Path Selection by Controlling Logical Connectives in LLMs Reasoning Chains

関連論文リスト