Fugu-MT 論文翻訳(概要): Self-Instruct: Aligning Language Models with Self-Generated Instructions

論文の概要: Self-Instruct: Aligning Language Models with Self-Generated Instructions

arxiv url: http://arxiv.org/abs/2212.10560v2
Date: Thu, 25 May 2023 23:50:07 GMT
ステータス: 翻訳完了
システム内更新日: 2023-05-29 22:49:23.634648
Title: Self-Instruct: Aligning Language Models with Self-Generated Instructions
Title（参考訳）: self-instruct: 言語モデルと自己生成命令の整合
Authors: Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi
Abstract要約: Self-Instructは、事前訓練された言語モデルの命令フォロー機能を改善するためのフレームワークである。私たちのパイプラインは、言語モデルから命令、入力、および出力のサンプルを生成し、その後、元のモデルを微調整するためにそれらを使用する前に、無効または類似のサンプルをフィルタします。さらなる評価のために、新規タスクのエキスパートによる指示のセットをキュレートし、GPT3とセルフインストラクトのチューニングが既存の公開インストラクションデータセットを大きなマージンで向上することを示す。
参考スコア（独自算出の注目度）: 76.42871502364697
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large "instruction-tuned" language models (i.e., finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is often limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We introduce Self-Instruct, a framework for improving the instruction-following capabilities of pretrained language models by bootstrapping off their own generations. Our pipeline generates instructions, input, and output samples from a language model, then filters invalid or similar ones before using them to finetune the original model. Applying our method to the vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT-001, which was trained with private user data and human annotations. For further evaluation, we curate a set of expert-written instructions for novel tasks, and show through human evaluation that tuning GPT3 with Self-Instruct outperforms using existing public instruction datasets by a large margin, leaving only a 5% absolute gap behind InstructGPT-001. Self-Instruct provides an almost annotation-free method for aligning pre-trained language models with instructions, and we release our large synthetic dataset to facilitate future studies on instruction tuning. Our code and data are available at https://github.com/yizhongw/self-instruct.
Abstract（参考訳）: 大きな"インストラクションチューニング"言語モデル(命令に応答するために微調整された)は、ゼロショットを新しいタスクに一般化する驚くべき能力を示している。それでも、それらはしばしば量、多様性、創造性に制限される人間による命令データに大きく依存しているため、チューニングされたモデルの一般化を妨げる。我々は,事前学習された言語モデルの命令追従能力を改善するためのフレームワークであるself-instructを紹介する。パイプラインは言語モデルから命令、入力、および出力を生成し、元のモデルを微調整するために使用する前に、無効または類似のサンプルをフィルタする。提案手法をバニラgpt3に適用し,個人ユーザデータと人間のアノテーションで学習したinstructgpt-001の性能に匹敵する,スーパーナチュラルインストラクションの原型モデルに対する絶対値の33%向上を実証した。さらに,新しいタスクに対する専門家による指示の集合をキュレートし,既存の公開命令データセットを用いてGPT3とセルフインストラクトのチューニング性能を大きなマージンで向上させ,InstructGPT-001の背後には5%の絶対差しか残っていないことを示す。 Self-Instructは、事前訓練された言語モデルを命令と整合させるほとんどアノテーションのない方法を提供する。コードとデータはhttps://github.com/yizhongw/self-instruct.com/で入手できます。

論文の概要: Self-Instruct: Aligning Language Models with Self-Generated Instructions

関連論文リスト