Fugu-MT 論文翻訳(概要): Accelerate Scaling of LLM Alignment via Quantifying the Coverage and Depth of Instruction Set

論文の概要: Accelerate Scaling of LLM Alignment via Quantifying the Coverage and Depth of Instruction Set

arxiv url: http://arxiv.org/abs/2509.06463v1
Date: Mon, 08 Sep 2025 09:22:57 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-09 14:07:04.030863
Title: Accelerate Scaling of LLM Alignment via Quantifying the Coverage and Depth of Instruction Set
Title（参考訳）: 命令セットの被覆と深さの定量化によるLLMアライメントの高速化
Authors: Chengwei Wu, Li Du, Hanyu Zhao, Yiming Ju, Jiapu Wang, Tengfei Pan,
Abstract要約: 本稿では,命令データセットの分布とアライメントモデルの性能の関係について検討する。次に,選択した命令の深さと意味的カバレッジを同時に最大化するために,命令選択アルゴリズムを設計する。実験結果から, 最先端のベースライン法と比較して, モデル性能を高速に向上し, 高速スケーリングを実現することが可能である。」
参考スコア（独自算出の注目度）: 22.846835046470517
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: With the growing demand for applying large language models to downstream tasks, improving model alignment performance and efficiency has become crucial. Such a process involves selecting informative instructions from a candidate pool. However, due to the complexity of instruction set distributions, the key factors driving the performance of aligned models remain unclear. As a result, current instruction set refinement methods fail to improve performance as the instruction pool expands continuously. To address this issue, we first investigate the key factors that influence the relationship between instruction dataset distribution and aligned model performance. Based on these insights, we propose a novel instruction data selection method. We identify that the depth of instructions and the coverage of the semantic space are the crucial factors determining downstream performance, which could explain over 70\% of the model loss on the development set. We then design an instruction selection algorithm to simultaneously maximize the depth and semantic coverage of the selected instructions. Experimental results demonstrate that, compared to state-of-the-art baseline methods, it can sustainably improve model performance at a faster pace and thus achieve \emph{``Accelerated Scaling''}.
Abstract（参考訳）: 下流タスクに大規模言語モデルを適用する必要性が高まっているため、モデルアライメントのパフォーマンスと効率が改善されている。このようなプロセスでは、候補プールから情報的指示を選択する。しかし、命令セット分布の複雑さのため、整列モデルの性能を駆動する重要な要因はいまだ不明である。その結果、命令プールが連続的に拡大するにつれて、現在の命令セットリファインメント法は性能改善に失敗する。この問題に対処するために、まず、命令データセットの分布と整列モデルの性能の関係に影響を及ぼす重要な要因について検討する。これらの知見に基づいて,新しい命令データ選択法を提案する。提案手法の深度と意味空間のカバレッジが下流の性能を決定する重要な要因であり, 開発環境におけるモデル損失の70%以上を説明できる。次に,選択した命令の深さと意味的カバレッジを同時に最大化するために,命令選択アルゴリズムを設計する。実験結果は,最先端のベースライン手法と比較して,モデル性能を高速に向上し,その結果,'emph{``Accelerated Scaling'} が得られることを示した。

論文の概要: Accelerate Scaling of LLM Alignment via Quantifying the Coverage and Depth of Instruction Set

関連論文リスト