Fugu-MT 論文翻訳(概要): Panini: Continual Learning in Token Space via Structured Memory

論文の概要: Panini: Continual Learning in Token Space via Structured Memory

arxiv url: http://arxiv.org/abs/2602.15156v1
Date: Mon, 16 Feb 2026 19:58:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-18 16:03:17.890924
Title: Panini: Continual Learning in Token Space via Structured Memory
Title（参考訳）: Panini: 構造化メモリによるトークン空間での継続的な学習
Authors: Shreyas Rajesh, Pavan Holur, Mehmet Yigit Turali, Chenda Duan, Vwani Roychowdhury,
Abstract要約: 言語モデルは、トレーニングされていないコンテンツを推論するために、ますます使われています。一般的なアプローチは検索拡張生成(RAG)であり、これは動詞の文書を(チャンクとして)外部に保存し、推論時に関連するサブセットのみを取得する。本稿では,基本モデルを固定したままで,新たな経験を外部セマンティックメモリ状態に統合することで学習を行う,人間のような非パラメトリック連続学習フレームワークを提案する。
参考スコア（独自算出の注目度）: 4.979820180013486
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Language models are increasingly used to reason over content they were not trained on, such as new documents, evolving knowledge, and user-specific data. A common approach is retrieval-augmented generation (RAG), which stores verbatim documents externally (as chunks) and retrieves only a relevant subset at inference time for an LLM to reason over. However, this results in inefficient usage of test-time compute (LLM repeatedly reasons over the same documents); moreover, chunk retrieval can inject irrelevant context that increases unsupported generation. We propose a human-like non-parametric continual learning framework, where the base model remains fixed, and learning occurs by integrating each new experience into an external semantic memory state that accumulates and consolidates itself continually. We present Panini, which realizes this by representing documents as Generative Semantic Workspaces (GSW) -- an entity- and event-aware network of question-answer (QA) pairs, sufficient for an LLM to reconstruct the experienced situations and mine latent knowledge via reasoning-grounded inference chains on the network. Given a query, Panini only traverses the continually-updated GSW (not the verbatim documents or chunks), and retrieves the most likely inference chains. Across six QA benchmarks, Panini achieves the highest average performance, 5%-7% higher than other competitive baselines, while using 2-30x fewer answer-context tokens, supports fully open-source pipelines, and reduces unsupported answers on curated unanswerable queries. The results show that efficient and accurate structuring of experiences at write time -- as achieved by the GSW framework -- yields both efficiency and reliability gains at read time. Code is available at https://github.com/roychowdhuryresearch/gsw-memory.
Abstract（参考訳）: 言語モデルは、新しいドキュメント、進化する知識、ユーザー固有のデータなど、トレーニングされていないコンテンツを推論するために、ますます使われています。一般的なアプローチは検索拡張生成(RAG)であり、これは(チャンクとして)冗長文書を外部に保存し、LLMが推論する時間にのみ関連するサブセットを検索する。しかし、この結果、テスト時間計算(LLMが同じ文書に対して繰り返し発生する理由)の非効率な使用が可能となり、さらにチャンク検索は、サポートされない生成を増大させる無関係なコンテキストを注入することができる。基本モデルが固定されたままであり,新たな経験を外部のセマンティックメモリ状態に統合することで学習を連続的に蓄積・統合する,人間のような非パラメトリック連続学習フレームワークを提案する。パニーニは、文書をジェネレーティブ・セマンティック・ワークスペース(GSW)として表現することでこれを実現し、質問応答(QA)ペアのエンティティとイベントを意識したネットワークであり、LCMが経験した状況を再構築し、ネットワーク上の推論連鎖を通じて潜伏知識をマイニングするのに十分なものである。クエリが与えられた場合、パニーニは継続的に更新されたGSW(動詞の文書やチャンクではない)を横切り、最も起こりそうな推論チェーンを検索する。 6つのQAベンチマークで、パニーニは、他の競争ベースラインよりも5%-7%高い平均パフォーマンスを達成し、2-30倍少ない回答コンテキストトークンを使用し、完全なオープンソースパイプラインをサポートし、キュレートされた未解決クエリに対するサポート回答を削減した。 GSWフレームワークが達成したように、書き込み時のエクスペリエンスの効率的かつ正確な構造化は、読み取り時の効率性と信頼性の両方を向上させる。コードはhttps://github.com/roychowdhuryresearch/gsw-Memoryで入手できる。

論文の概要: Panini: Continual Learning in Token Space via Structured Memory

関連論文リスト