Fugu-MT 論文翻訳(概要): Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models

論文の概要: Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models

arxiv url: http://arxiv.org/abs/2410.05802v1
Date: Tue, 8 Oct 2024 08:35:16 GMT
ステータス: 翻訳完了
システム内更新日: 2024-11-01 12:39:56.772500
Title: Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models
Title（参考訳）: グラデーショナルラーニング:大規模言語モデルにおける知識の部分的習得による微調整の最適化
Authors: Bozhou Li, Hao Liang, Yang Li, Fangcheng Fu, Hongzhi Yin, Conghui He, Wentao Zhang,
Abstract要約: 大規模言語モデル(LLM)は、事前学習期間中に大量のテキストコーパスから膨大な量の知識を取得する。微調整や推論のような後段では、モデルは初期訓練でカバーされていない知識に遭遇する可能性がある。本稿では,モデル全体のテスト精度と知識保持性を改善するための2段階の微調整戦略を提案する。
参考スコア（独自算出の注目度）: 51.20499954955646
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: During the pretraining phase, large language models (LLMs) acquire vast amounts of knowledge from extensive text corpora. Nevertheless, in later stages such as fine-tuning and inference, the model may encounter knowledge not covered in the initial training, which can lead to hallucinations and degraded performance. This issue has a profound impact on the model's capabilities, as it will inevitably face out-of-scope knowledge after pretraining. Furthermore, fine-tuning is often required to adapt LLMs to domain-specific tasks. However, this phenomenon limits the model's ability to learn and integrate new information during fine-tuning. The effectiveness of fine-tuning largely depends on the type of knowledge involved. Existing research suggests that fine-tuning the model on partially mastered knowledge-for instance, question-answer pairs where the model has a chance of providing correct responses under non-greedy decoding-can enable the model to acquire new knowledge while mitigating hallucination. Notably, this approach can still lead to the forgetting of fully mastered knowledge, constraining the fine-tuning dataset to a narrower range and limiting the model's overall potential for improvement. Given the model's intrinsic reasoning abilities and the interconnectedness of different knowledge areas, it is likely that as the model's capacity to utilize existing knowledge improves during fine-tuning, previously unmastered knowledge may become more understandable. To explore this hypothesis, we conducted experiments and, based on the results, proposed a two-stage fine-tuning strategy. This approach not only improves the model's overall test accuracy and knowledge retention but also preserves its accuracy on previously mastered content. When fine-tuning on the WikiQA dataset, our method increases the amount of knowledge acquired by the model in this stage by 24%.
Abstract（参考訳）: 事前学習期間中、大規模言語モデル(LLM)は広範なテキストコーパスから膨大な量の知識を取得する。それでも、微調整や推論のような後の段階では、モデルは初期訓練でカバーされていない知識に遭遇し、幻覚や性能低下につながる可能性がある。この問題は、事前トレーニング後に必然的にスコープ外の知識に直面するため、モデルの能力に大きな影響を与えます。さらに、LLMをドメイン固有のタスクに適応させるためには、微調整が必要とされることが多い。しかし、この現象は、微調整中に新しい情報を学習し統合する能力を制限する。微調整の有効性は、主に関連する知識の種類に依存する。既存の研究では、部分的にマスターされた知識に基づいてモデルを微調整し、例えば、非グリーディデコードの下で正しい応答を提供することができる質問と回答のペアは、幻覚を緩和しながら新しい知識を得ることを可能にすることを示唆している。特に、このアプローチは、完全にマスターされた知識を忘れて、微調整データセットをより狭い範囲に制限し、モデル全体の改善の可能性を制限することにつながる可能性がある。モデルの本質的な推論能力と異なる知識領域の相互接続性を考えると、既存の知識を活用する能力が微調整中に向上するにつれて、以前は未習得の知識がより理解しやすくなる可能性がある。この仮説を探索するため,実験を行い,その結果に基づいて2段階の微調整戦略を提案した。このアプローチは、モデル全体のテスト精度と知識保持を向上するだけでなく、以前にマスターされたコンテンツの正確性も維持する。 WikiQAデータセットを微調整すると、この段階で得られた知識の量は24%増加する。

論文の概要: Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models

関連論文リスト