Fugu-MT 論文翻訳(概要): Automatic Instruction Optimization for Open-source LLM Instruction Tuning

論文の概要: Automatic Instruction Optimization for Open-source LLM Instruction Tuning

arxiv url: http://arxiv.org/abs/2311.13246v1
Date: Wed, 22 Nov 2023 09:04:57 GMT
ステータス: 翻訳完了
システム内更新日: 2023-11-23 15:44:23.758001
Title: Automatic Instruction Optimization for Open-source LLM Instruction Tuning
Title（参考訳）: オープンソースのLLM命令チューニングのための自動命令最適化
Authors: Yilun Liu, Shimin Tao, Xiaofeng Zhao, Ming Zhu, Wenbing Ma, Junhao Zhu, Chang Su, Yutai Hou, Miao Zhang, Min Zhang, Hongxia Ma, Li Zhang, Hao Yang, Yanfei Jiang
Abstract要約: 提案するCoachLMは,データセット内のサンプルを自動的に修正することで,命令データセットの品質を高める新しい手法である。 CoachLMは、人間の専門家によって改訂されたサンプルから訓練され、データセットの高品質なサンプルの割合が17.7%から78.9%に大幅に増加した。結果から,CoachLMは命令調整LDMの指示追従能力を平均29.9%改善することがわかった。
参考スコア（独自算出の注目度）: 33.27796882562961
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Instruction tuning is crucial for enabling Language Learning Models (LLMs) in responding to human instructions. The quality of instruction pairs used for tuning greatly affects the performance of LLMs. However, the manual creation of high-quality instruction datasets is costly, leading to the adoption of automatic generation of instruction pairs by LLMs as a popular alternative in the training of open-source LLMs. To ensure the high quality of LLM-generated instruction datasets, several approaches have been proposed. Nevertheless, existing methods either compromise dataset integrity by filtering a large proportion of samples, or are unsuitable for industrial applications. In this paper, instead of discarding low-quality samples, we propose CoachLM, a novel approach to enhance the quality of instruction datasets through automatic revisions on samples in the dataset. CoachLM is trained from the samples revised by human experts and significantly increases the proportion of high-quality samples in the dataset from 17.7% to 78.9%. The effectiveness of CoachLM is further assessed on various real-world instruction test sets. The results show that CoachLM improves the instruction-following capabilities of the instruction-tuned LLM by an average of 29.9%, which even surpasses larger LLMs with nearly twice the number of parameters. Furthermore, CoachLM is successfully deployed in a data management system for LLMs at Huawei, resulting in an efficiency improvement of up to 20% in the cleaning of 40k real-world instruction pairs. We release the training data and code of CoachLM (https://github.com/lunyiliu/CoachLM).
Abstract（参考訳）: インストラクションチューニングは、人間の指示に応答する言語学習モデル(LLM)の実現に不可欠である。チューニングに使用する命令ペアの品質は、LLMの性能に大きく影響する。しかし、高品質な命令データセットを手作業で作成することはコストがかかるため、LLMによる命令ペアの自動生成が、オープンソースのLLMのトレーニングにおいて一般的な代替手段となる。 LLM生成した命令データセットの高品質性を確保するため、いくつかのアプローチが提案されている。それにもかかわらず、既存の手法は大量のサンプルをフィルタリングすることでデータセットの整合性を損なうか、工業用途に適さない。本稿では,低品質なサンプルを捨てる代わりに,データセットのサンプルの自動修正によって命令データセットの品質を高める新しい手法であるCoachLMを提案する。 CoachLMは、人間の専門家によって改訂されたサンプルから訓練され、データセットの高品質なサンプルの割合が17.7%から78.9%に大幅に増加した。 coachLMの有効性は、様々な実世界の命令セットでさらに評価される。その結果、CoachLMは、平均29.9%の命令調整LDMの命令追従能力を向上し、パラメータの約2倍のLLMを超える結果となった。さらに、CoachLMはHuaweiのLLMのデータ管理システムにデプロイされ、40kの実世界の命令ペアのクリーニングにおいて最大20%の効率向上を実現している。 CoachLM(https://github.com/lunyiliu/CoachLM)のトレーニングデータとコードをリリースする。

論文の概要: Automatic Instruction Optimization for Open-source LLM Instruction Tuning

関連論文リスト