Fugu-MT 論文翻訳(概要): DICE: Structured Reasoning in LLMs through SLM-Guided Chain-of-Thought Correction

論文の概要: DICE: Structured Reasoning in LLMs through SLM-Guided Chain-of-Thought Correction

arxiv url: http://arxiv.org/abs/2510.09211v1
Date: Fri, 10 Oct 2025 09:45:35 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 00:38:48.613345
Title: DICE: Structured Reasoning in LLMs through SLM-Guided Chain-of-Thought Correction
Title（参考訳）: DICE:SLM-Guided Chain-of-Thought CorrectionによるLCMの構造的推論
Authors: Yiqi Li, Yusheng Liao, Zhe Chen, Yanfeng Wang, Yu Wang,
Abstract要約: 大規模言語モデル(LLM)は、しばしば詳細な命令の遵守よりも推論を優先する。これに対応するための教師付きデータセットの微調整 LLM は、高い計算コストと限られたパラメータアクセスのために実用的ではない。我々は,小型言語モデル(SLM)をガイドする軽量フレームワークであるDICEを提案し,LLMの出力をチェーン・オブ・シンク(CoT)補正により改善する。
参考スコア（独自算出の注目度）: 29.22321321753093
License: http://creativecommons.org/licenses/by/4.0/
Abstract: When performing reasoning tasks with user-specific requirements, such as strict output formats, large language models (LLMs) often prioritize reasoning over adherence to detailed instructions. Fine-tuning LLMs on supervised datasets to address this is impractical due to high computational costs and limited parameter access. To tackle this, we propose DICE, a lightweight framework that guides small language models (SLMs) to refine LLMs' outputs through chain-of-thought (CoT) correction. DICE decouples the process by first prompting LLMs to generate natural language responses, then using trained SLMs to analyze and refine these outputs to meet structured output specifications. This framework preserves LLMs' broad knowledge and reasoning capabilities while ensuring the outputs conform to user demands. Specifically, DICE first constructs structured CoT adaptation datasets via a two-stage method and subsequently applies a dual-tuning strategy to fine-tune SLMs for generating structured outputs in an analyze-then-answer pattern. Experiments demonstrate that DICE improves the average format accuracy and content correctness of LLM outputs by 35.4\% and 29.4\%, respectively, achieving state-of-the-art (SOTA) performance over other competitive baselines.
Abstract（参考訳）: 厳格な出力形式などのユーザ固有の要件で推論タスクを実行する場合、大きな言語モデル(LLM)は詳細な命令への準拠よりも推論を優先することが多い。これに対応するための教師付きデータセットの微調整 LLM は、高い計算コストと限られたパラメータアクセスのために実用的ではない。これを解決するために,我々は小型言語モデル(SLM)をガイドする軽量フレームワークであるDICEを提案し,LLMの出力をチェーン・オブ・シント(CoT)補正により改善する。 DICEは、まずLLMに自然言語応答を生成するように促し、次に訓練されたSLMを使用してこれらの出力を分析し、洗練し、構造化された出力仕様を満たすことによって、プロセスを分離する。このフレームワークは、LCMの幅広い知識と推論能力を維持しながら、出力がユーザの要求に合致することを保証します。具体的には、DICEはまず2段階法を用いて構造化CoT適応データセットを構築し、その後、構造化出力を生成するための微調整SLMに二重チューニング戦略を適用した。実験により、DICE は LLM 出力の平均フォーマット精度と内容の正しさを 35.4 % と 29.4 % に改善し、他の競争基準よりも最先端(SOTA) の性能を達成することを示した。

論文の概要: DICE: Structured Reasoning in LLMs through SLM-Guided Chain-of-Thought Correction

関連論文リスト