Fugu-MT 論文翻訳(概要): SCALE: Selective Resource Allocation for Overcoming Performance Bottlenecks in Mathematical Test-time Scaling

論文の概要: SCALE: Selective Resource Allocation for Overcoming Performance Bottlenecks in Mathematical Test-time Scaling

arxiv url: http://arxiv.org/abs/2512.00466v1
Date: Sat, 29 Nov 2025 12:38:07 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-02 19:46:34.251776
Title: SCALE: Selective Resource Allocation for Overcoming Performance Bottlenecks in Mathematical Test-time Scaling
Title（参考訳）: SCALE: 数量的テストタイムスケーリングにおけるパフォーマンスのボトルネックを克服するための選択的なリソース割り当て
Authors: Yang Xiao, Chunpu Xu, Ruifeng Yuan, Jiashuo Wang, Wenjie Li, Pengfei Liu,
Abstract要約: テストタイムの計算スケーリングは、大規模言語モデルにおける数学的推論を強化するための強力なパラダイムとして登場した。サブプロブレムの難易度に基づいて計算資源を選択的に割り当てるフレームワークである textbfSCALE (Selective Resource Allocation) を提案する。
参考スコア（独自算出の注目度）: 38.48815459263562
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Test-time compute scaling has emerged as a powerful paradigm for enhancing mathematical reasoning in large language models (LLMs) by allocating additional computational resources during inference. However, current methods employ uniform resource distribution across all reasoning sub-problems, creating fundamental bottlenecks where challenging sub-problems receive insufficient attention while routine operations consume disproportionate resources. This uniform allocation creates performance bottlenecks where additional computational resources yield diminishing returns. Inspired by dual-process theory, we propose \textbf{SCALE} (Selective Resource Allocation), a framework that selectively allocates computational resources based on sub-problem difficulty. SCALE operates through four stages: (1) problem decomposition into sequential reasoning sub-problems, (2) difficulty assessment of each sub-problem to distinguish between routine operations and computationally challenging sub-problems, (3) selective processing mode assignment between System 1 for simple sub-problems and System 2 for complex ones, and (4) sequential execution with context propagation. By concentrating resources on challenging sub-problems while processing routine operations efficiently, SCALE achieves substantial performance improvements with superior resource utilization. Extensive experiments demonstrate that SCALE significantly outperforms uniform scaling baselines, achieving accuracy improvements of up to 13.75 percentage points (57.50% to 71.25% on AIME25) while reducing computational costs by 33%-53%, representing a major advance in test-time scaling that addresses fundamental limitations of current approaches.
Abstract（参考訳）: テストタイムの計算スケーリングは、推論中に追加の計算資源を割り当てることで、大規模言語モデル(LLM)の数学的推論を強化するための強力なパラダイムとして登場した。しかし、現在の手法では、全てのサブプロブレムの推論に統一的なリソース分布を採用しており、通常の操作が不均等なリソースを消費している間に、挑戦的なサブプロブレムが不十分な注意を受けるという根本的なボトルネックを生み出している。この均一なアロケーションは、余分な計算資源が減少するリターンをもたらすパフォーマンスボトルネックを生成する。二元プロセス理論に着想を得て,サブプロブレムの難易度に基づいて計算資源を選択的に割り当てるフレームワークである「textbf{SCALE} (Selective Resource Allocation)」を提案する。 SCALEは,(1)シーケンシャルな推論サブプロブレムへの問題分解,(2)ルーチン操作と計算的に困難なサブプロブレムを区別するための各サブプロブレムの難易度評価,(3)単純なサブプロブレムのシステム1と複雑なサブプロブレムのシステム2間の選択的な処理モード割り当て,(4)コンテキスト伝搬のシーケンシャルな実行,の4段階からなる。ルーチン操作を効率よく処理しながら、リソースを挑戦的なサブプロブレムに集中させることにより、SCALEはリソース利用の優れたパフォーマンス向上を実現している。大規模な実験により、SCALEは均一なスケーリングベースラインを大幅に上回り、13.75ポイント(AIME25では57.50%から71.25%)の精度向上を実現し、計算コストを33%-53%削減した。

論文の概要: SCALE: Selective Resource Allocation for Overcoming Performance Bottlenecks in Mathematical Test-time Scaling

関連論文リスト