Fugu-MT 論文翻訳(概要): User as Engram: Internalizing Per-User Memory as Local Parametric Edits

論文の概要: User as Engram: Internalizing Per-User Memory as Local Parametric Edits

arxiv url: http://arxiv.org/abs/2606.19172v1
Date: Wed, 17 Jun 2026 15:15:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-18 17:16:51.233438
Title: User as Engram: Internalizing Per-User Memory as Local Parametric Edits
Title（参考訳）: ユーザ・アズ・エングラム: ローカルパラメトリック・エディットとしてユーザ毎のメモリを内部化する
Authors: Bojie Li,
Abstract要約: 現在、ほとんどのパーソナライゼーションは、自然言語のメモリファイルや検索インデックスにおいて、ユーザの事実を重みの外側に保持している。ユーザ・アズ・エングラム(User as Engram)を提案する。エングラムモデルのハッシュキーメモリテーブルに,外科的編集としてユーザのコンテンツを格納する。この層状設計はユーザ毎のLoRAの直接リコールと一致し、平均して5.6倍高い間接推論精度を提供する。
参考スコア（独自算出の注目度）: 0.3951796994513004
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Personal memory in a language model is two problems: content and reasoning skill. The brain keeps the two apart (a sparse, local engram in the hippocampus for each episode, a slow neocortex for the shared skills that interpret it), so a new fact need not overwrite everything else. Most personalization today keeps a user's facts outside the weights, in a natural-language memory file or a retrieval index. When facts are written into the model instead, the standard recipe is the per-user LoRA adapter, which does the opposite of the brain, folding content and skill into one global weight delta. Writing a user's facts as a LoRA contaminates text unrelated to them; writing the same facts as local Engram rows leaves it mathematically untouched, resulting in a roughly 33,000x smaller memory footprint. We therefore propose User as Engram: store a user's content as surgical edits to the hash-keyed memory table of an Engram model, and carry the reasoning skill in one shared adapter. This layered design matches per-user LoRA's direct recall while delivering 5.6x higher indirect-reasoning accuracy on average, and never makes a single user worse at reasoning than the untouched base. The edit is a glass box: writing a fact switches on its lookup at exactly the trigger, adds the value the answer needs, leaves every other position unchanged to the last bit, and fails if written into the wrong layer. Because different users' facts land in disjoint hash slots, their edits compose: many users live in one shared table at once, stacking additively and losslessly, where a per-user LoRA, a single global weight delta, admits only one. Upon retrieval, a per-user Engram table does not grow with the population the retriever must search, so past ~100 facts it overtakes a retrieval pipeline on a 2.5x larger model.
Abstract（参考訳）: 言語モデルにおけるパーソナルメモリは、内容と推論のスキルの2つの問題である。脳は2つの部分(各エピソードの海馬の局所的なエングラム、それを解釈する共有スキルの低速な新皮質)を分離しているため、新しい事実は他の全てを上書きする必要はない。現在、ほとんどのパーソナライゼーションは、自然言語のメモリファイルや検索インデックスにおいて、ユーザの事実を重みの外側に保持している。代わりに事実がモデルに書き込まれるとき、標準的なレシピはユーザ毎のLoRAアダプタで、脳の反対、コンテンツ、スキルを1つのグローバルな重量差に分割する。ユーザの事実をLoRAとして書くと、それらと無関係なテキストが汚染され、ローカルのEngram行と同じ事実が数学的に無傷のまま残され、結果としてメモリフットプリントが約33,000倍小さくなる。そこで我々は,ユーザ・アズ・エングラムを提案する: ユーザのコンテンツをEngramモデルのハッシュキーメモリテーブルに外科的編集として保存し,その推論スキルを1つの共有アダプタに保持する。この階層化された設計は、ユーザ毎のLoRAの直接リコールと一致し、平均して5.6倍高い間接推論精度を実現している。編集はガラスボックスで、トリガーのルックアップにファクトスイッチを書き、必要な値を追加し、他のすべての位置を最後のビットに変更し、間違ったレイヤに書き込むとフェールする。多くのユーザが一度に1つの共有テーブルに住み、付加的かつ損失的に積み重ね、ユーザ毎のLoRA(グローバルな1つの重み付けデルタ)が1つしか認めない。検索時には、ユーザ当たりのEngramテーブルは、検索者が検索しなければならない人口とともに成長しないため、過去100年ほどで2.5倍のモデルで検索パイプラインを上回っている。

論文の概要: User as Engram: Internalizing Per-User Memory as Local Parametric Edits

関連論文リスト