Fugu-MT 論文翻訳(概要): Self-Instruct: Aligning Language Model with Self Generated Instructions

論文の概要: Self-Instruct: Aligning Language Model with Self Generated Instructions

arxiv url: http://arxiv.org/abs/2212.10560v1
Date: Tue, 20 Dec 2022 18:59:19 GMT
ステータス: 翻訳完了
システム内更新日: 2022-12-21 14:00:17.051829
Title: Self-Instruct: Aligning Language Model with Self Generated Instructions
Title（参考訳）: self-instruct: 言語モデルと自己生成命令の整合
Authors: Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi
Abstract要約: Self-Instructは、事前訓練された言語モデルの命令フォロー機能を改善するためのフレームワークである。私たちのパイプラインは、言語モデルからインストラクション、インプット、およびアウトプットを生成し、それを使用して元のモデルを微調整する。さらなる評価のために、新規タスクのエキスパートによる指示のセットをキュレートし、GPT3とセルフインストラクトのチューニングが既存の公開インストラクションデータセットを大きなマージンで向上することを示す。
参考スコア（独自算出の注目度）: 76.42871502364697
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large "instruction-tuned" language models (finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We introduce Self-Instruct, a framework for improving the instruction-following capabilities of pretrained language models by bootstrapping off its own generations. Our pipeline generates instruction, input, and output samples from a language model, then prunes them before using them to finetune the original model. Applying our method to vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT_001, which is trained with private user data and human annotations. For further evaluation, we curate a set of expert-written instructions for novel tasks, and show through human evaluation that tuning GPT3 with Self-Instruct outperforms using existing public instruction datasets by a large margin, leaving only a 5% absolute gap behind InstructGPT_001. Self-Instruct provides an almost annotation-free method for aligning pre-trained language models with instructions, and we release our large synthetic dataset to facilitate future studies on instruction tuning.
Abstract（参考訳）: 命令に応答するために微調整された)大規模な「命令調整」言語モデルは、ゼロショットを新しいタスクに一般化する驚くべき能力を示している。それでも、それらは量、多様性、創造性に制限された人間による命令データに大きく依存しているため、調整されたモデルの一般化を妨げる。我々は,事前学習された言語モデルの命令追従能力を向上させるためのフレームワークであるself-instructを紹介する。私たちのパイプラインは、言語モデルからインストラクション、インプット、およびアウトプットを生成し、それを使用して元のモデルを微調整する。提案手法をバニラGPT3に適用することにより,個人のユーザデータと人間のアノテーションをトレーニングしたInstructGPT_001の性能に匹敵する,Super-Natural Instructionsのオリジナルモデルに対する33%の絶対的な改善を実演する。さらに,新しいタスクに対する専門家による指示の集合をキュレートし,既存の公開命令データセットを用いてGPT3とセルフインストラクトのチューニング性能を大きなマージンで向上させ,InstructGPT_001の背後には5%の絶対差しか残っていないことを示す。 Self-Instructは、事前訓練された言語モデルを命令と整合させるほとんどアノテーションのない方法を提供する。

論文の概要: Self-Instruct: Aligning Language Model with Self Generated Instructions

関連論文リスト