Fugu-MT 論文翻訳(概要): Decoupling Task-Solving and Output Formatting in LLM Generation

論文の概要: Decoupling Task-Solving and Output Formatting in LLM Generation

arxiv url: http://arxiv.org/abs/2510.03595v1
Date: Sat, 04 Oct 2025 00:52:48 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:59.140249
Title: Decoupling Task-Solving and Output Formatting in LLM Generation
Title（参考訳）: LLM生成におけるタスクソルビングと出力フォーマッティングの分離
Authors: Haikang Deng, Po-Nien Kung, Nanyun Peng,
Abstract要約: Deco-Gは、タスク解決からフォーマットのアテンデンスを明確に分離するデコードフレームワークである。 Deco-Gは、分離されたトラクタブル確率モデル(TPM)でフォーマットコンプライアンスを処理する Deco-Gの有効性を,多種多様なフォーマット要求を伴う多種多様なタスクで実証する。
参考スコア（独自算出の注目度）: 44.40087140333511
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) are increasingly adept at following instructions containing task descriptions to solve complex problems, such as mathematical reasoning and automatic evaluation (LLM-as-a-Judge). However, as prompts grow more complex, models often struggle to adhere to all instructions. This difficulty is especially common when instructive prompts intertwine reasoning directives -- specifying what the model should solve -- with rigid formatting requirements that dictate how the solution must be presented. The entanglement creates competing goals for the model, suggesting that more explicit separation of these two aspects could lead to improved performance. To this front, we introduce Deco-G, a decoding framework that explicitly decouples format adherence from task solving. Deco-G handles format compliance with a separate tractable probabilistic model (TPM), while prompts LLMs with only task instructions. At each decoding step, Deco-G combines next token probabilities from the LLM with the TPM calculated format compliance likelihood to form the output probability. To make this approach both practical and scalable for modern instruction-tuned LLMs, we introduce three key innovations: instruction-aware distillation, a flexible trie-building algorithm, and HMM state pruning for computational efficiency. We demonstrate the effectiveness of Deco-G across a wide range of tasks with diverse format requirements, including mathematical reasoning, LLM-as-a-judge, and event argument extraction. Overall, our approach yields 1.0% to 6.0% relative gain over regular prompting practice with guaranteed format compliance.
Abstract（参考訳）: 大規模言語モデル(LLM)は、数学的推論や自動評価(LLM-as-a-Judge)といった複雑な問題を解くためのタスク記述を含む命令に適応する傾向にある。しかしながら、プロンプトがより複雑になるにつれて、モデルはしばしば全ての命令に従うのに苦労する。この難しさは、インストラクティブがインタートウィン推論指示 -- モデルが何を解決するべきかを指定する -- を、どのようにソリューションを提示するかを規定する厳格なフォーマット要件によって促す場合、特に一般的である。この絡み合いはモデルの競合する目標を生み出し、これらの2つの側面のより明確な分離によってパフォーマンスが向上する可能性があることを示唆している。本稿では,デコードフレームワークであるDeco-Gを紹介する。 Deco-Gは、個別のトラクタブル確率モデル(TPM)でフォーマットコンプライアンスを処理し、タスク命令のみでLLMをプロンプトする。各復号ステップにおいて、Deco-G は LLM からの次のトークン確率と TPM 計算フォーマット適合確率を組合せて出力確率を形成する。提案手法は,命令認識蒸留,フレキシブルトリエ構築アルゴリズム,計算効率向上のためのHMM状態プルーニングという3つの重要な革新をもたらす。本稿では, 数学的推論, LLM-as-a-judge, イベント引数抽出など, 多様な形式要件を持つ多種多様なタスクを対象としたDeco-Gの有効性を示す。提案手法は,形式順守が保証された定期的なプロンプトよりも1.0%から6.0%の相対的な利得が得られる。

論文の概要: Decoupling Task-Solving and Output Formatting in LLM Generation

関連論文リスト