Fugu-MT 論文翻訳(概要): Can language models learn from explanations in context?

論文の概要: Can language models learn from explanations in context?

arxiv url: http://arxiv.org/abs/2204.02329v1
Date: Tue, 5 Apr 2022 16:33:44 GMT
ステータス: 翻訳完了
システム内更新日: 2022-04-06 13:12:36.651270
Title: Can language models learn from explanations in context?
Title（参考訳）: 言語モデルは文脈の説明から学ぶことができるか?
Authors: Andrew K. Lampinen, Ishita Dasgupta, Stephanie C. Y. Chan, Kory Matthewson, Michael Henry Tessler, Antonia Creswell, James L. McClelland, Jane X. Wang, and Felix Hill
Abstract要約: 大規模言語モデルは、いくつかのコンテキスト内例に適応することで、新しいタスクを実行することができる。人間にとって、例からの素早い学習は、例とタスク原則を結びつける説明の恩恵を受けることができる。少数例の説明によって言語モデルがより効果的に適応できるかどうかを考察する。
参考スコア（独自算出の注目度）: 21.67788893486215
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models can perform new tasks by adapting to a few in-context examples. For humans, rapid learning from examples can benefit from explanations that connect examples to task principles. We therefore investigate whether explanations of few-shot examples can allow language models to adapt more effectively. We annotate a set of 40 challenging tasks from BIG-Bench with explanations of answers to a small subset of questions, as well as a variety of matched control explanations. We evaluate the effects of various zero-shot and few-shot prompts that include different types of explanations, instructions, and controls on the performance of a range of large language models. We analyze these results using statistical multilevel modeling techniques that account for the nested dependencies among conditions, tasks, prompts, and models. We find that explanations of examples can improve performance. Adding untuned explanations to a few-shot prompt offers a modest improvement in performance; about 1/3 the effect size of adding few-shot examples, but twice the effect size of task instructions. We then show that explanations tuned for performance on a small validation set offer substantially larger benefits; building a prompt by selecting examples and explanations together substantially improves performance over selecting examples alone. Hand-tuning explanations can substantially improve performance on challenging tasks. Furthermore, even untuned explanations outperform carefully matched controls, suggesting that the benefits are due to the link between an example and its explanation, rather than lower-level features of the language used. However, only large models can benefit from explanations. In summary, explanations can support the in-context learning abilities of large language models on
Abstract（参考訳）: 大規模言語モデルは、いくつかのコンテキスト内例に適応することで、新しいタスクを実行することができる。人間にとって、例からの迅速な学習は、例とタスク原則を結びつける説明の恩恵を受ける。そこで本研究では,少数例の説明によって言語モデルがより効果的に適応できるかどうかを検討する。我々は,BIG-Benchの課題40のセットに,質問のサブセットに対する回答の説明と,一致したさまざまな制御説明を加えて注釈付けする。本研究では,多種多様な説明,指示,制御を含むゼロショットおよび少数ショットのプロンプトが,多種多様な言語モデルの性能に与える影響を評価する。条件,タスク,プロンプト,モデル間のネスト依存性を考慮した統計的マルチレベルモデリング手法を用いて,これらの結果を分析する。例の説明がパフォーマンスを向上できることが分かりました。数ショットのプロンプトに修正されていない説明を追加することで、パフォーマンスがわずかに向上する。次に、小検証セットのパフォーマンスに調整された説明は、かなり大きな利点をもたらすことを示し、例と説明を一緒に選択することでプロンプトを構築することで、例のみを選択するよりもパフォーマンスを大幅に改善する。ハンドチューニングの説明は、困難なタスクのパフォーマンスを大幅に改善する。さらに、トレーニングされていない説明でさえ、注意深くマッチした制御よりも優れており、その利点は、使用される言語の低レベルな特徴よりも、例と説明とのリンクによるものであることを示唆している。しかし、大きなモデルだけが説明の恩恵を受けることができる。要約すると、説明は大規模言語モデルの文脈内学習能力をサポートすることができる。

論文の概要: Can language models learn from explanations in context?

関連論文リスト