Fugu-MT 論文翻訳(概要): SLAP: Stratified Loss-based Pruning for On-Policy Data-Efficient Instruction Tuning

論文の概要: SLAP: Stratified Loss-based Pruning for On-Policy Data-Efficient Instruction Tuning

arxiv url: http://arxiv.org/abs/2605.23969v1
Date: Wed, 13 May 2026 06:36:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-01 02:55:42.986874
Title: SLAP: Stratified Loss-based Pruning for On-Policy Data-Efficient Instruction Tuning
Title（参考訳）: SLAP: オンデマンドデータ効率なインストラクションチューニングのための階層化ロスベースプルーニング
Authors: Run Zou, Jianhang Ding, Yifan Ding, Wen Wu, Hao Chen, Renshu Gu,
Abstract要約: 我々は,新しいバッチ対応データ選択フレームワークである textbfSLAP を提案する。 SLAPは、個別ではなく、バッチ構成全体の学習可能性を評価する。 SLAPは完全なデータセットトレーニングよりも20～40%少ないトレーニングデータで優れたパフォーマンスを実現している。
参考スコア（独自算出の注目度）: 14.87050258251449
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Instruction tuning has optimized the specialized capabilities of large language models (LLMs), but it often requires extensive datasets and prolonged training times. The challenge lies in developing specific capabilities by identifying useful data and efficiently fine-tuning. High-quality and diverse pruned data can help models achieve lossless performance at a lower cost. In this paper, we propose \textbf{SLAP}, a novel batch-aware data selection framework that evaluates the learnability of entire batch compositions rather than individual. SLAP ensures comprehensive data distribution coverage through distribution-aware stratified sampling while maximizing intra-batch diversity through relative distance optimization. By leveraging Hessian-approximated gradient information for dynamic batch selection, SLAP significantly outperforms existing state-of-the-art methods across multiple model architectures (LLaMA, ChatGLM) and diverse downstream tasks including multi-turn dialogue, multilingual translation, and question answering. Most notably, SLAP achieves superior performance with 20-40\% less training data compared to full dataset training, substantially reducing computational costs while maintaining or improving model capabilities. These results establish SLAP as a powerful approach for efficient and effective instruction tuning of large language models.
Abstract（参考訳）: インストラクションチューニングは、大規模言語モデル(LLM)の特殊機能に最適化されている。この課題は、有用なデータを特定し、効率的に微調整することで、特定の機能を開発することである。高品質で多様なデータは、モデルが損失のないパフォーマンスを低コストで達成するのに役立ちます。本稿では,個別ではなく,バッチ構成全体の学習性を評価する新しいバッチ対応データ選択フレームワークである \textbf{SLAP} を提案する。 SLAPは、相対的距離最適化によりバッチ内多様性を最大化しつつ、分布認識型階層化サンプリングによる包括的データ配信のカバレッジを保証する。動的バッチ選択にヘシアン近似勾配情報を活用することで、SLAPは、複数のモデルアーキテクチャ(LLaMA、ChatGLM)とマルチターン対話、多言語翻訳、質問応答を含む様々な下流タスクにおいて、既存の最先端メソッドよりも大幅に優れている。最も注目すべきは、SLAPが完全なデータセットトレーニングに比べて20～40倍のトレーニングデータで優れたパフォーマンスを実現し、モデル機能を維持したり改善したりしながら、計算コストを大幅に削減することです。これらの結果から、SLAPは大規模言語モデルの効率的かつ効果的な命令チューニングのための強力なアプローチとして確立された。

論文の概要: SLAP: Stratified Loss-based Pruning for On-Policy Data-Efficient Instruction Tuning

関連論文リスト