Fugu-MT 論文翻訳(概要): BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models

論文の概要: BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models

arxiv url: http://arxiv.org/abs/2306.10968v2
Date: Wed, 21 Jun 2023 11:31:50 GMT
ステータス: 翻訳完了
システム内更新日: 2023-06-22 10:37:19.635958
Title: BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models
Title（参考訳）: BayLing:大規模言語モデルの対話的翻訳による言語間アライメントと指示のブリッジ
Authors: Shaolei Zhang, Qingkai Fang, Zhuocheng Zhang, Zhengrui Ma, Yan Zhou, Langlin Huang, Mengyu Bu, Shangtong Gui, Yunji Chen, Xilin Chen, Yang Feng
Abstract要約: 大規模言語モデル (LLM) は言語理解と生成において顕著な進歩を見せている。我々は,LLaMAを基礎LLMとして活用し,命令追従型LLMであるBayLingを開発した。 BayLingのデモ、ホームページ、コード、モデルが利用可能だ。
参考スコア（独自算出の注目度）: 39.03467441090675
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Large language models (LLMs) have demonstrated remarkable prowess in language understanding and generation. Advancing from foundation LLMs to instructionfollowing LLMs, instruction tuning plays a vital role in aligning LLMs to human preferences. However, the existing LLMs are usually focused on English, leading to inferior performance in non-English languages. In order to improve the performance for non-English languages, it is necessary to collect language-specific training data for foundation LLMs and construct language-specific instructions for instruction tuning, both of which are heavy loads. To minimize human workload, we propose to transfer the capabilities of language generation and instruction following from English to other languages through an interactive translation task. We have developed BayLing, an instruction-following LLM by utilizing LLaMA as the foundation LLM and automatically constructing interactive translation instructions for instructing tuning. Extensive assessments demonstrate that BayLing achieves comparable performance to GPT-3.5-turbo, despite utilizing a considerably smaller parameter size of only 13 billion. Experimental results on translation tasks show that BayLing achieves 95% of single-turn translation capability compared to GPT-4 with automatic evaluation and 96% of interactive translation capability compared to GPT-3.5-turbo with human evaluation. To estimate the performance on general tasks, we created a multi-turn instruction test set called BayLing-80. The experimental results on BayLing-80 indicate that BayLing achieves 89% of performance compared to GPT-3.5-turbo. BayLing also demonstrates outstanding performance on knowledge assessment of Chinese GaoKao and English SAT, second only to GPT-3.5-turbo among a multitude of instruction-following LLMs. Demo, homepage, code and models of BayLing are available.
Abstract（参考訳）: 大規模言語モデル (LLM) は言語理解と生成において顕著な進歩を見せている。基礎的な LLM から命令に従う LLM への進化により、命令チューニングは LLM を人間の好みに合わせる上で重要な役割を果たす。しかし、既存のllmは通常英語に焦点を当てており、非英語言語のパフォーマンスは低下している。英語以外の言語の性能を向上させるためには,LLMの基礎となる言語固有の学習データを収集し,言語固有の指導命令を構築する必要がある。人間の作業量を最小限に抑えるため,対話型翻訳タスクを通じて,言語生成能力と指示文を他の言語に伝達することを提案する。我々は,LLaMAを基礎LLMとして活用し,学習指導のための対話型翻訳命令を自動構築する,命令追従LLMのBayLingを開発した。非常に小さなパラメータサイズがわずか13億であるにもかかわらず、ベイリングはGPT-3.5-turboに匹敵する性能を示した。翻訳タスクに関する実験結果から,ベイリングは自動評価によるGPT-4と比較して95%,対話型翻訳能力は96%,人的評価によるGPT-3.5-turboと比較して95%を実現していることがわかった。一般タスクの性能を推定するために,BayLing-80と呼ばれるマルチターン命令セットを開発した。 bayling-80の実験結果はgpt-3.5-turboと比較して89%の性能を達成したことを示している。ベイリングはまた、中国語のガオカオと英語のSATの知識評価に際し卓越した性能を示した。 BayLingのデモ、ホームページ、コード、モデルが利用可能だ。

論文の概要: BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models

関連論文リスト