Fugu-MT 論文翻訳(概要): Bilinear relational structure fixes reversal curse and enables consistent model editing

論文の概要: Bilinear relational structure fixes reversal curse and enables consistent model editing

arxiv url: http://arxiv.org/abs/2509.21993v1
Date: Fri, 26 Sep 2025 07:19:39 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-29 20:57:54.267043
Title: Bilinear relational structure fixes reversal curse and enables consistent model editing
Title（参考訳）: 双線形関係構造が逆の呪文を修正し、一貫したモデル編集を可能にする
Authors: Dong-Kyum Kim, Minsung Kim, Jea Kwon, Nakyeong Yang, Meeyoung Cha,
Abstract要約: 逆の呪いは本質的に失敗ではなく、モデルが知識をエンコードする方法の成果であることを示す。関係知識グラフの合成データセットをスクラッチからトレーニングすることにより、両線形関係構造が隠れ表現に現れることを示す。この構造は逆の呪いを著しく軽減し、LMが見えない逆の事実を推測することを可能にする。
参考スコア（独自算出の注目度）: 18.483285872202107
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The reversal curse -- a language model's (LM) inability to infer an unseen fact ``B is A'' from a learned fact ``A is B'' -- is widely considered a fundamental limitation. We show that this is not an inherent failure but an artifact of how models encode knowledge. By training LMs from scratch on a synthetic dataset of relational knowledge graphs, we demonstrate that bilinear relational structure emerges in their hidden representations. This structure substantially alleviates the reversal curse, enabling LMs to infer unseen reverse facts. Crucially, we also find that this bilinear structure plays a key role in consistent model editing. When a fact is updated in a LM with this structure, the edit correctly propagates to its reverse and other logically dependent facts. In contrast, models lacking this representation not only suffer from the reversal curse but also fail to generalize edits, further introducing logical inconsistencies. Our results establish that training on a relational knowledge dataset induces the emergence of bilinear internal representations, which in turn enable LMs to behave in a logically consistent manner after editing. This implies that the success of model editing depends critically not just on editing algorithms but on the underlying representational geometry of the knowledge being modified.
Abstract（参考訳）: 言語モデル(LM)では、学習された事実である ``A is B'' から ``B is A'' を推測できないという逆の呪いは、基本的な制限とみなされている。これは本質的に失敗ではなく、モデルが知識をエンコードする方法の成果物であることを示している。関係知識グラフの合成データセットをスクラッチからトレーニングすることにより、両線形関係構造が隠れ表現に現れることを示す。この構造は逆の呪いを著しく軽減し、LMが見えない逆の事実を推測することを可能にする。重要なことに、この双線形構造は一貫性のあるモデル編集において重要な役割を果たす。この構造を持つLMで事実が更新されると、編集はその逆や論理的に依存する事実に正しく伝播する。対照的に、この表現を欠いたモデルは、逆の呪いに苦しむだけでなく、編集の一般化にも失敗し、さらに論理的な矛盾がもたらされる。この結果から,関係知識データセットを用いたトレーニングは,二線形内部表現の出現を誘導し,その結果,LMが編集後に論理的に一貫した振る舞いをすることができることがわかった。これは、モデル編集の成功は、編集アルゴリズムだけでなく、修正される知識の基本的な表現幾何学にも大きく依存していることを意味する。

論文の概要: Bilinear relational structure fixes reversal curse and enables consistent model editing

関連論文リスト