Fugu-MT 論文翻訳(概要): Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty

論文の概要: Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty

arxiv url: http://arxiv.org/abs/2508.19069v1
Date: Tue, 26 Aug 2025 14:26:32 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-27 17:42:38.880093
Title: Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty
Title（参考訳）: 構造化テンプレートは難解なタスクにLLMを役立てることができるか? : 難易度によるスケーリング法則の探索
Authors: Zhichao Yang, Zhaoxin Fan, Gen Li, Yuanze Hu, Xinyu Wang, Ye Qiu, Xin Wang, Yifan Sun, Wenjun Wu,
Abstract要約: 後学習法は複雑なタスクの深層手続き論理を捉えるのに不足する。本稿では,SST(Structured Solution template)フレームワークを提案する。 GSM8K、AIME24、および新しいDynamic Enベンチマークの実験により、SSTは特に難しい問題において、精度と効率の両方を著しく改善することが示された。
参考スコア（独自算出の注目度）: 20.9633647829495
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Structured, procedural reasoning is essential for Large Language Models (LLMs), especially in mathematics. While post-training methods have improved LLM performance, they still fall short in capturing deep procedural logic on complex tasks. To tackle the issue, in this paper, we first investigate this limitation and uncover a novel finding: a Scaling Law by Difficulty, which reveals that model performance follows a U-shaped curve with respect to training data complexity -- excessive low-difficulty data impedes abstraction, while high-difficulty data significantly enhances reasoning ability. Motivated by this, we propose the Structured Solution Template (SST) framework, which uses solution templates and a curriculum of varied difficulty to explicitly teach procedural reasoning. Specifically, SST comprises (1) fine-tuning with structured solution-template chains and dynamically weighted loss to prioritize procedural logic, (2) prompt-time injection of solution templates as cognitive scaffolds to guide inference, and (3) integrated curriculum fine-tuning that explicitly teaches the model to self-plan - execute - self-correct. Experiments on GSM8K, AIME24, and new Dynamic En benchmark show that SST significantly improves both accuracy and efficiency, especially on harder problems.
Abstract（参考訳）: 構造化された手続き的推論は大規模言語モデル(LLM)、特に数学において不可欠である。ポストトレーニング手法はLLMの性能を向上したが、複雑なタスクにおける深い手続き論理の取得には依然として不足している。そこで,本論文ではまず,この制限を検証し,新たな発見を明らかにする。Difficultyによるスケーリング法則(Scaling Law by Difficulty)は,モデルの性能がトレーニングデータの複雑性に関して,U字型曲線に従っていることを明らかにする。そこで本研究では,構造化ソリューションテンプレート(Structured Solution Template, SST)フレームワークを提案する。具体的には,(1)構造化された解テンプレートチェーンによる微調整と,(2)手続き論理を優先するために動的に重み付けされた損失,(2)推論をガイドする認知的足場としての解テンプレートの迅速な注入,(3)自己計画 - 実行 - 自己修正を明示的に教える統合的なカリキュラム微調整を含む。 GSM8K、AIME24、および新しいDynamic Enベンチマークの実験により、SSTは特に難しい問題において、精度と効率の両方を著しく改善することが示された。

論文の概要: Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty

関連論文リスト