Fugu-MT 論文翻訳(概要): SIEVE: Sample-Efficient Parametric Learning from Natural Language

論文の概要: SIEVE: Sample-Efficient Parametric Learning from Natural Language

arxiv url: http://arxiv.org/abs/2604.02339v1
Date: Mon, 02 Feb 2026 18:52:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-19 19:09:11.297997
Title: SIEVE: Sample-Efficient Parametric Learning from Natural Language
Title（参考訳）: SIEVE: 自然言語からのサンプル有効パラメトリック学習
Authors: Parth Asawa, Alexandros G. Dimakis, Matei Zaharia,
Abstract要約: 本稿では,自然言語の文脈からサンプル効率の高いパラメトリック学習手法であるSIEVEを提案する。我々は、コンテキストが分解可能であるという洞察を活用する新しい合成データ生成パイプライン、SIEVE-GENを使用する。以上の結果から,SIEVEは従来の文脈蒸留法よりも優れていることがわかった。
参考スコア（独自算出の注目度）: 66.46967795931559
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Natural language context-such as instructions, knowledge, or feedback-contains rich signal for adapting language models. While in-context learning provides adaptation via the prompt, parametric learning persists into model weights and can improve performance further, though is data hungry and heavily relies on either high-quality traces or automated verifiers. We propose SIEVE, a method for sample-efficient parametric learning from natural language context that requires as few as three query examples. SIEVE uses a novel synthetic data generation pipeline, SIEVE-GEN, that leverages the insight that context is decomposable. Decomposing context allows us to generate higher quality rollouts by pairing synthetic queries with only the applicable context rather than the entirety, then using context distillation to internalize context into the model. We evaluate in reasoning settings where context is necessary, including custom domains and the RuleArena and Machine Translation from One Book tasks. Our results show that SIEVE outperforms prior context distillation methods using just three query examples, demonstrating how to achieve sample-efficient parametric learning from natural language.
Abstract（参考訳）: 自然言語の文脈、例えば指示、知識、フィードバックは、言語モデルに適応するための豊かな信号を含む。文脈内学習はプロンプトを介して適応するが、パラメトリック学習はモデルウェイトに留まり、さらにパフォーマンスを向上させることができる。 SIEVEは,3つのクエリサンプルを必要とする自然言語コンテキストから,サンプル効率のよいパラメトリック学習手法である。 SIEVEは、コンテキストが分解可能であるという洞察を活用する新しい合成データ生成パイプラインであるSIEVE-GENを使用している。コンテキストを分解することで、合成クエリを全体ではなく、適用可能なコンテキストのみとペアリングし、コンテキスト蒸留を用いてコンテキストをモデルに内在化することで、高品質なロールアウトを生成することができる。我々は、カスタムドメインやルールアリーナ、ワンブックタスクからの機械翻訳など、コンテキストが必要な推論設定で評価する。以上の結果から,SIEVEは3つのクエリ例を用いて,従来の文脈蒸留法よりも優れた性能を示し,サンプル効率のよいパラメトリック学習を自然言語から行う方法を示した。

論文の概要: SIEVE: Sample-Efficient Parametric Learning from Natural Language

関連論文リスト