論文の概要: The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
- arxiv url: http://arxiv.org/abs/2309.12288v2
- Date: Fri, 22 Sep 2023 18:08:20 GMT
- ステータス: 処理完了
- システム内更新日: 2023-09-26 10:36:07.527226
- Title: The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
- Title(参考訳): The Reversal Curse: "A is B" でトレーニングされた LLM は "B is A" を学ぶことができません。
- Authors: Lukas Berglund, Meg Tong, Max Kaufmann, Mikita Balesni, Asa Cooper
Stickland, Tomasz Korbak, Owain Evans
- Abstract要約: 自己回帰型大言語モデル(LLM)における一般化の驚くべき失敗を示す。
例えば、もしあるモデルが「Olaf Scholzがドイツ第9代首相であった」と訓練されたとしても、自動的には「ドイツの第9代首相は誰だったのか?」という質問に答えることはできない。
- 参考スコア(独自算出の注目度): 5.856975245105276
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We expose a surprising failure of generalization in auto-regressive large
language models (LLMs). If a model is trained on a sentence of the form "A is
B", it will not automatically generalize to the reverse direction "B is A".
This is the Reversal Curse. For instance, if a model is trained on "Olaf Scholz
was the ninth Chancellor of Germany", it will not automatically be able to
answer the question, "Who was the ninth Chancellor of Germany?". Moreover, the
likelihood of the correct answer ("Olaf Scholz") will not be higher than for a
random name. Thus, models exhibit a basic failure of logical deduction and do
not generalize a prevalent pattern in their training set (i.e. if "A is B''
occurs, "B is A" is more likely to occur). We provide evidence for the Reversal
Curse by finetuning GPT-3 and Llama-1 on fictitious statements such as "Uriah
Hawthorne is the composer of 'Abyssal Melodies'" and showing that they fail to
correctly answer "Who composed 'Abyssal Melodies?'". The Reversal Curse is
robust across model sizes and model families and is not alleviated by data
augmentation. We also evaluate ChatGPT (GPT-3.5 and GPT-4) on questions about
real-world celebrities, such as "Who is Tom Cruise's mother? [A: Mary Lee
Pfeiffer]" and the reverse "Who is Mary Lee Pfeiffer's son?". GPT-4 correctly
answers questions like the former 79% of the time, compared to 33% for the
latter. This shows a failure of logical deduction that we hypothesize is caused
by the Reversal Curse. Code is available at
- Abstract(参考訳): 自動回帰型大言語モデル(LLM)における一般化の驚くべき失敗を明らかにする。
モデルが "A is B" という形式の文で訓練された場合、それは自動的に "B is A" に一般化されない。
例えば、もしあるモデルが「Olaf Scholzがドイツ第9代首相であった」と訓練された場合、「ドイツの第9代首相は誰だったのか」という疑問に自動的に答えることはできない。
さらに、正解("Olaf Scholz")の確率は、ランダムな名前よりも高くはならない。
Thus, models exhibit a basic failure of logical deduction and do not generalize a prevalent pattern in their training set (i.e. if "A is B'' occurs, "B is A" is more likely to occur). We provide evidence for the Reversal Curse by finetuning GPT-3 and Llama-1 on fictitious statements such as "Uriah Hawthorne is the composer of 'Abyssal Melodies'" and showing that they fail to correctly answer "Who composed 'Abyssal Melodies?'". The Reversal Curse is robust across model sizes and model families and is not alleviated by data augmentation. We also evaluate ChatGPT (GPT-3.5 and GPT-4) on questions about real-world celebrities, such as "Who is Tom Cruise's mother?
a: メアリー・リー・ファイファー (mary lee pfeiffer) - メアリー・リー・ファイファーの息子。
- Delving into the Reversal Curse: How Far Can Large Language Models Generalize? [40.64539467276017]
論文 参考訳(メタデータ) (2024-10-24T14:55:09Z) - SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights [89.56181323849512]
論文 参考訳(メタデータ) (2024-10-11T17:25:52Z) - Reverse Training to Nurse the Reversal Curse [42.8324011011372]
大型言語モデル (LLM) には驚くべき失敗がある: "A has a feature B" で訓練された場合、それらは "B is a feature of A" に一般化されるのではなく、"Reversal Curse" と呼ばれる。
論文 参考訳(メタデータ) (2024-03-20T17:55:35Z) - Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training [57.771940716189114]
この問題に対処するために,SPT(Semantic-Aware Permutation Training)を提案する。
論文 参考訳(メタデータ) (2024-03-01T18:55:20Z) - An Analysis and Mitigation of the Reversal Curse [70.13419502543915]
論文 参考訳(メタデータ) (2023-11-13T17:01:12Z) - Physics of Language Models: Part 3.2, Knowledge Manipulation [51.68385617116854]
また, GPT-4のような近代的な事前学習言語モデルにも適用できる。
論文 参考訳(メタデータ) (2023-09-25T17:50:41Z) - TruthfulQA: Measuring How Models Mimic Human Falsehoods [2.7143159361691227]
論文 参考訳(メタデータ) (2021-09-08T17:15:27Z) - AGKD-BML: Defense Against Adversarial Attack by Attention Guided
Knowledge Distillation and Bi-directional Metric Learning [61.8003954296545]
Atention Guided Knowledge Distillation and Bi-directional Metric Learning (AGKD-BML) による新しい対人訓練ベースモデルを提案する。
論文 参考訳(メタデータ) (2021-08-13T01:25:04Z)