Fugu-MT 論文翻訳(概要): EvoSelect: Data-Efficient LLM Evolution for Targeted Task Adaptation

論文の概要: EvoSelect: Data-Efficient LLM Evolution for Targeted Task Adaptation

arxiv url: http://arxiv.org/abs/2604.26170v1
Date: Tue, 28 Apr 2026 23:26:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-30 15:59:36.195276
Title: EvoSelect: Data-Efficient LLM Evolution for Targeted Task Adaptation
Title（参考訳）: EvoSelect: ターゲットタスク適応のためのデータ効率の良いLLM進化
Authors: Ting-Wei Li, Sirui Chen, Jiaru Zou, Yingbing Huang, Tianxin Wei, Jingrui He, Hanghang Tong,
Abstract要約: 大きな言語モデル(LLM)を目的のタスクに効率的に、効果的に適応させることは、根本的な課題である。 1つの簡単なアプローチは、外部ジェネレータを通じて候補データを合成する反復的な生成訓練ループである。モデル更新に先立って選択ステップを組み込んだ改良パラダイム,すなわち反復生成選択学習ループを導入する。
参考スコア（独自算出の注目度）: 79.71802168256542
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Adapting large language models (LLMs) to a targeted task efficiently and effectively remains a fundamental challenge. Such adaptation often requires iteratively improving the model toward a targeted task, yet collecting high-quality human-labeled data to support this process is costly and difficult to scale. As a result, synthetic data generation has emerged as a flexible and scalable alternative. One straightforward approach is through an iterative generation-training loop, where candidate data are synthesized through an external generator, the model is updated using these data and the process is repeated over iterations. However, generated samples can be noisy, highly redundant, or even misaligned with the targeted task distribution. Training indiscriminately on such data can dilute useful learning signals and even degrade model performance. To address this, we introduce a refined paradigm, namely an iterative generation-selection-training loop, which incorporates a selection step prior to model updates. Building on this paradigm, we propose EvoSelect, a data-efficient framework to evolve LLM effectively. Given candidate samples produced by the data generator, EvoSelect selects training data by jointly modeling targeted task alignment and diversity. We estimate task relevance through optimal transport with proxy gradient representations, which quantifies how well candidate samples align with the targeted task distribution. To mitigate redundancy, we incorporate a diversification mechanism that promotes coverage of complementary training samples. By interleaving alignment and diversification, EvoSelect enables progressive LLM evolution toward targeted tasks. Extensive experiments on various benchmarks demonstrate that with either weak or strong data generators, EvoSelect consistently improves adaptation efficacy over existing data selection methods.
Abstract（参考訳）: 大きな言語モデル(LLM)を目的のタスクに効率的に、効果的に適応させることは、根本的な課題である。このような適応は、しばしば、目標とするタスクに向けてモデルを反復的に改善する必要があるが、このプロセスをサポートするために高品質な人間ラベル付きデータを収集することは、コストがかかり、スケールすることが困難である。その結果、合成データ生成はフレキシブルでスケーラブルな代替手段として登場した。 1つの簡単なアプローチは、外部ジェネレータを通じて候補データが合成され、モデルがこれらのデータを使用して更新され、反復的にプロセスが繰り返される反復生成訓練ループである。しかし、生成されたサンプルは騒々しく、非常に冗長であり、ターゲットのタスク分布と不一致である。このようなデータに基づいて無差別にトレーニングすることで、有用な学習信号を減らし、モデル性能を低下させることができる。そこで本研究では,モデル更新に先立って選択ステップを組み込んだ改良パラダイム,すなわち反復生成-選択-学習ループを導入する。このパラダイムに基づいて,LLMを効果的に進化させるためのデータ効率のよいフレームワークであるEvoSelectを提案する。データジェネレータが生成する候補サンプルが与えられた場合、EvoSelectは目標とするタスクアライメントと多様性を併用してトレーニングデータを選択する。提案手法は,対象のタスク分布に対して,候補となるサンプルがどの程度うまく一致しているかを定量化する。冗長性を緩和するために,相補的なトレーニングサンプルのカバレッジを促進する多角化機構を組み込んだ。アライメントと多様化をインターリーブすることで、EvoSelectは目標とするタスクへのプログレッシブLSM進化を可能にする。様々なベンチマーク実験により、弱いデータジェネレータか強いデータジェネレータで、EvoSelectは既存のデータ選択方法よりも適応効率を一貫して改善することを示した。

論文の概要: EvoSelect: Data-Efficient LLM Evolution for Targeted Task Adaptation

関連論文リスト