Fugu-MT 論文翻訳(概要): Synthesizing Instruction-Tuning Datasets with Contrastive Decoding

論文の概要: Synthesizing Instruction-Tuning Datasets with Contrastive Decoding

arxiv url: http://arxiv.org/abs/2604.13538v1
Date: Wed, 15 Apr 2026 06:37:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-16 20:38:32.416514
Title: Synthesizing Instruction-Tuning Datasets with Contrastive Decoding
Title（参考訳）: コントラストデコーディングによる命令調整データセットの合成
Authors: Tatsuya Ichinose, Youmi Ma, Masanari Oi, Ryuto Koike, Naoaki Okazaki,
Abstract要約: 応答生成において,学習後モデルと事前学習後のモデル間のコントラストデコーディングを適用する手法を提案する。実験の結果、CoDITによって構築されたデータセットでトレーニングされたモデルは、直接生成されたレスポンスでトレーニングされたモデルよりも一貫して優れていた。
参考スコア（独自算出の注目度）: 17.127903764198084
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Using responses generated by high-performing large language models (LLMs) for instruction tuning has become a widely adopted approach. However, the existing literature overlooks a property of LLM-generated responses: they conflate world knowledge acquired during pre-training with instruction-following capabilities acquired during post-training. We hypothesize that disentangling the instruction-following capabilities from pre-trained knowledge improves the effectiveness of instruction tuning. To this end, we propose CoDIT, a method that applies contrastive decoding between a post-trained model and its pre-trained counterpart during response generation. The method suppresses pre-trained knowledge shared between the two models while amplifying the instruction-following behavior acquired via post-training, resulting in responses that more purely reflect instruction-following capabilities. Experiment results demonstrate that models trained on datasets constructed via CoDIT consistently outperform those trained on directly generated responses. Training on our datasets also yields better performance than on existing publicly available instruction-tuning datasets across multiple benchmarks. Furthermore, we theoretically and empirically show that CoDIT can be interpreted as distilling the chat vector from parameter space to text space, enabling the transfer of instruction-tuning capabilities across models of different architectures.
Abstract（参考訳）: 高い性能の大規模言語モデル(LLM)によって生成された応答を命令チューニングに利用することは、広く採用されているアプローチである。しかし、既存の文献は、LLM生成応答の特性を軽視しており、事前学習中に得られた世界知識と、後学習時に取得した指導追従能力とを要約している。我々は、事前学習した知識から命令追従能力を引き離すことで、命令チューニングの有効性が向上すると仮定する。そこで本研究では,後学習モデルと事前学習モデルとの間で,応答生成時にコントラストデコードを適用する手法であるCoDITを提案する。この方法は、2つのモデル間で共有される事前訓練された知識を抑えつつ、ポストトレーニングによって得られた命令追従動作を増幅し、結果として命令追従能力をより純粋に反映する応答をもたらす。実験の結果、CoDITによって構築されたデータセットでトレーニングされたモデルは、直接生成されたレスポンスでトレーニングされたモデルよりも一貫して優れていた。データセットをトレーニングすることで、既存の複数のベンチマークで利用可能な命令チューニングデータセットよりもパフォーマンスが向上します。さらに,CoDITは,パラメータ空間からテキスト空間へのチャットベクトルの蒸留であり,異なるアーキテクチャのモデル間での命令調整能力の伝達を可能にすることを理論的かつ実証的に示す。

論文の概要: Synthesizing Instruction-Tuning Datasets with Contrastive Decoding

関連論文リスト