Fugu-MT 論文翻訳(概要): Schema Key Wording as an Instruction Channel in Structured Generation under Constrained Decoding

論文の概要: Schema Key Wording as an Instruction Channel in Structured Generation under Constrained Decoding

arxiv url: http://arxiv.org/abs/2604.14862v1
Date: Thu, 16 Apr 2026 10:52:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:31.850945
Title: Schema Key Wording as an Instruction Channel in Structured Generation under Constrained Decoding
Title（参考訳）: 制約付きデコードにおける構造生成における命令チャネルとしてのスキーマキーワード
Authors: Yifan Le,
Abstract要約: 本研究では,命令配置が構造生成におけるモデル性能に与える影響について検討する。提案手法では,プロンプトやモデルパラメータを変更することなく,単にスキーマキーのワードを変更するだけで,制約付き復号化時のモデル性能を著しく変化させることができることを示す。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Constrained decoding has been widely adopted for structured generation with large language models (LLMs), ensuring that outputs satisfy predefined formats such as JSON and XML. However, existing approaches largely treat schemas as purely structural constraints and overlook the possibility that their linguistic formulation may affect model behavior. In this work, we study how instruction placement influences model performance in structured generation and show that merely changing the wording of schema keys, without modifying the prompt or model parameters, can significantly alter model performance under constrained decoding. Based on this observation, we propose to reinterpret structured generation as a multi-channel instruction problem, where instructions can be conveyed explicitly through prompts and implicitly through schema keys during decoding. To the best of our knowledge, this is the first work to systematically study how schema key formulation acts as an implicit instruction channel and affects model performance under constrained decoding. Experiments on multiple mathematical reasoning benchmarks show that different model families exhibit distinct sensitivities to these instruction channels: Qwen models consistently benefit from schema-level instructions, while LLaMA models rely more heavily on prompt-level guidance. We further observe non-additive interaction effects between instruction channels, showing that combining multiple channels does not always lead to further improvement. These findings suggest that schema design not only determines output structure, but also carries instruction signals, offering a new perspective on structured generation in LLMs.
Abstract（参考訳）: 制約付きデコーディングは、大きな言語モデル(LLM)で構造化された世代に広く採用されており、出力がJSONやXMLのような事前定義されたフォーマットを満たすことが保証されている。しかし、既存のアプローチはスキーマを純粋に構造的な制約として扱い、それらの言語的定式化がモデル行動に影響を与える可能性を見落としている。本研究では,命令配置が構造的生成におけるモデル性能に与える影響について検討し,プロンプトやモデルパラメータを変更することなく,単にスキーマキーの単語化を変更するだけで,制約付き復号化時のモデル性能を著しく変化させることができることを示す。そこで本研究では,マルチチャネル命令問題として構造生成を再解釈し,命令をプロンプトで明示的に伝達し,デコード中のスキーマキーで暗黙的に命令を伝達する手法を提案する。我々の知る限りでは、スキーマキーの定式化が暗黙の命令チャネルとしてどのように振る舞うかを体系的に研究し、制約付き復号化の下でモデル性能に影響を及ぼすのはこれが初めてである。複数の数学的推論ベンチマークの実験は、異なるモデルファミリーがこれらの命令チャネルに対して異なる感受性を示すことを示している: Qwenモデルは一貫してスキーマレベルの命令から恩恵を受け、LLaMAモデルはプロンプトレベルのガイダンスに大きく依存する。さらに、命令チャネル間の非付加的な相互作用効果を観察し、複数のチャネルの組み合わせが必ずしも更なる改善につながるとは限らないことを示す。これらの結果は、スキーマ設計が出力構造を決定するだけでなく、命令信号も持てることを示唆し、LLMにおける構造生成の新しい視点を提供する。

論文の概要: Schema Key Wording as an Instruction Channel in Structured Generation under Constrained Decoding

関連論文リスト