Fugu-MT 論文翻訳(概要): TrigReason: Trigger-Based Collaboration between Small and Large Reasoning Models

論文の概要: TrigReason: Trigger-Based Collaboration between Small and Large Reasoning Models

arxiv url: http://arxiv.org/abs/2604.14847v1
Date: Thu, 16 Apr 2026 10:33:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:31.842693
Title: TrigReason: Trigger-Based Collaboration between Small and Large Reasoning Models
Title（参考訳）: TrigReason: 小規模と大規模の推論モデル間のトリガーベースコラボレーション
Authors: Yi Zhao, Yajuan Peng, Cam-Tu Nguyen, Zuchao Li, Xiaoliang Wang, Xiaoming Fu, Hai Zhao,
Abstract要約: 大規模推論モデル(LRM)は複雑なタスクにおいて高い性能を達成するが、自己回帰的推論によって高い推論遅延に悩まされる。最近の研究は、小型共振モデル(SRM)を用いてLEM推論を高速化する。 TrigReasonは、継続的ポーリングを選択的介入で置き換えるトリガーベースの協調推論フレームワークである。
参考スコア（独自算出の注目度）: 59.83398270447905
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Reasoning Models (LRMs) achieve strong performance on complex tasks through extended chains of thought but suffer from high inference latency due to autoregressive reasoning. Recent work explores using Small Reasoning Models (SRMs) to accelerate LRM inference. In this paper, we systematically characterize the capability boundaries of SRMs and identify three common types of reasoning risks: (1) path divergence, where SRMs lack the strategic ability to construct an initial plan, causing reasoning to deviate from the most probable path; (2) cognitive overload, where SRMs fail to solve particularly difficult steps; and (3) recovery inability, where SRMs lack robust self-reflection and error correction mechanisms. To address these challenges, we propose TrigReason, a trigger-based collaborative reasoning framework that replaces continuous polling with selective intervention. TrigReason delegates most reasoning to the SRM and activates LRM intervention only when necessary-during initial strategic planning (strategic priming trigger), upon detecting extraordinary overconfidence (cognitive offload trigger), or when reasoning falls into unproductive loops (intervention request trigger). The evaluation results on AIME24, AIME25, and GPQA-D indicate that TrigReason matches the accuracy of full LRMs and SpecReason, while offloading 1.70x - 4.79x more reasoning steps to SRMs. Under edge-cloud conditions, TrigReason reduces latency by 43.9\% and API cost by 73.3\%. Our code is available at \href{https://github.com/QQQ-yi/TrigReason}{https://github.com/QQQ-yi/TrigReason}
Abstract（参考訳）: 大規模推論モデル(LRM)は思考の連鎖を延長することで複雑なタスクにおいて高い性能を達成するが、自己回帰的推論による高い推論遅延に悩まされる。最近の研究は、小型共振モデル(SRM)を用いてLEM推論を高速化する。本稿では,SRMの能力境界を体系的に評価し,(1)SRMが初期計画を構築する戦略的能力に欠ける経路分散,(2)SRMが特に難解なステップを解けない認知的過負荷,(3)SRMが堅牢な自己回帰と誤り訂正機構を欠く回復不能,の3つの共通タイプの推論リスクを同定する。これらの課題に対処するため、我々はトリガーベースの協調推論フレームワークであるTrigReasonを提案し、継続的ポーリングを選択的介入で置き換える。 TrigReasonはSRMにほとんどの推論を委譲し、必要な戦略計画(戦略プライミング・トリガー)、異常な過信(認知的オフロード・トリガー)の検出、あるいは推論が非生産的ループ(干渉要求トリガー)に陥る時にのみLEMの介入を活性化する。 AIME24, AIME25, GPQA-Dの評価結果から, TrigReason は完全な LRM と SpecReason の精度と一致し, 1.70x - 4.79倍の推理ステップを SRM にオフロードした。エッジクラウド環境では、TrigReasonはレイテンシを43.9\%、APIコストを73.3\%削減する。我々のコードは \href{https://github.com/QQQ-yi/TrigReason}{https://github.com/QQQQ-yi/TrigReason} で入手できる。

論文の概要: TrigReason: Trigger-Based Collaboration between Small and Large Reasoning Models

関連論文リスト