Fugu-MT 論文翻訳(概要): Importance-Aware Data Selection for Efficient LLM Instruction Tuning

論文の概要: Importance-Aware Data Selection for Efficient LLM Instruction Tuning

arxiv url: http://arxiv.org/abs/2511.07074v1
Date: Mon, 10 Nov 2025 13:06:30 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-11 21:18:45.260296
Title: Importance-Aware Data Selection for Efficient LLM Instruction Tuning
Title（参考訳）: LLM指導の効率化のための重要度を考慮したデータ選択
Authors: Tingyu Jiang, Shen Li, Yiyao Song, Lan Zhang, Hualei Zhu, Yuan Zhao, Xiaohang Xu, Kenjiro Taura, Hao Henry Wang,
Abstract要約: 本稿では,モデルの能力向上における命令データの重要性を定量化するための新しい指標として,MIWV(Model Instruction Weakness Value)を提案する。実験の結果,MIWVに基づくデータのトップ1%のみを選択すると,全データセットでのトレーニングよりも優れることがわかった。
参考スコア（独自算出の注目度）: 12.894727887191621
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Instruction tuning plays a critical role in enhancing the performance and efficiency of Large Language Models (LLMs). Its success depends not only on the quality of the instruction data but also on the inherent capabilities of the LLM itself. Some studies suggest that even a small amount of high-quality data can achieve instruction fine-tuning results that are on par with, or even exceed, those from using a full-scale dataset. However, rather than focusing solely on calculating data quality scores to evaluate instruction data, there is a growing need to select high-quality data that maximally enhances the performance of instruction tuning for a given LLM. In this paper, we propose the Model Instruction Weakness Value (MIWV) as a novel metric to quantify the importance of instruction data in enhancing model's capabilities. The MIWV metric is derived from the discrepancies in the model's responses when using In-Context Learning (ICL), helping identify the most beneficial data for enhancing instruction tuning performance. Our experimental results demonstrate that selecting only the top 1\% of data based on MIWV can outperform training on the full dataset. Furthermore, this approach extends beyond existing research that focuses on data quality scoring for data selection, offering strong empirical evidence supporting the effectiveness of our proposed method.
Abstract（参考訳）: インストラクションチューニングは、大規模言語モデル(LLM)の性能と効率を高める上で重要な役割を果たす。その成功は、命令データの品質だけでなく、LLM自体の固有の能力にも依存する。一部の研究では、少量の高品質なデータでさえ、フルスケールのデータセットを使用することで、命令の微調整結果に匹敵する、あるいは超える結果が得られることを示唆している。しかし、命令データを評価するためにデータ品質スコアの計算にのみ焦点をあてるのではなく、与えられたLLMの命令チューニング性能を最大限に向上させる高品質なデータを選択する必要がある。本稿では,モデルの能力向上における命令データの重要性を定量化するための新しい指標として,MIWV(Model Instruction Weakness Value)を提案する。 MIWVメトリックは、インコンテキスト学習(ICL)を使用する際のモデルの応答の相違から派生し、命令チューニング性能を向上させる上で最も有益なデータを特定するのに役立つ。実験の結果,MIWVに基づくデータのトップ1\%のみを選択すると,全データセットでのトレーニングよりも優れることがわかった。さらに、本手法は、データ選択のためのデータ品質スコアリングに焦点を当てた既存の研究を超えて、提案手法の有効性を裏付ける強力な実証的証拠を提供する。

論文の概要: Importance-Aware Data Selection for Efficient LLM Instruction Tuning

関連論文リスト