Fugu-MT 論文翻訳(概要): DySK-Attn: A Framework for Efficient, Real-Time Knowledge Updating in Large Language Models via Dynamic Sparse Knowledge Attention

論文の概要: DySK-Attn: A Framework for Efficient, Real-Time Knowledge Updating in Large Language Models via Dynamic Sparse Knowledge Attention

arxiv url: http://arxiv.org/abs/2508.07185v1
Date: Sun, 10 Aug 2025 05:22:38 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-12 21:23:28.742521
Title: DySK-Attn: A Framework for Efficient, Real-Time Knowledge Updating in Large Language Models via Dynamic Sparse Knowledge Attention
Title（参考訳）: DySK-Attn:動的スパース知識による大規模言語モデルの効率的なリアルタイム更新のためのフレームワーク
Authors: Kabir Khan, Priya Sharma, Arjun Mehta, Neha Gupta, Ravi Narayanan,
Abstract要約: 大きな言語モデル(LLM)は、その知識が静的で、すぐに時代遅れになるという限界に悩まされる。我々は,LLMが動的外部ソースからリアルタイム知識を効率的に統合できる新しいフレームワークであるDySK-Attnを提案する。
参考スコア（独自算出の注目度）: 2.168994673914057
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) suffer from a critical limitation: their knowledge is static and quickly becomes outdated. Retraining these massive models is computationally prohibitive, while existing knowledge editing techniques can be slow and may introduce unforeseen side effects. To address this, we propose DySK-Attn, a novel framework that enables LLMs to efficiently integrate real-time knowledge from a dynamic external source. Our approach synergizes an LLM with a dynamic Knowledge Graph (KG) that can be updated instantaneously. The core of our framework is a sparse knowledge attention mechanism, which allows the LLM to perform a coarse-to-fine grained search, efficiently identifying and focusing on a small, highly relevant subset of facts from the vast KG. This mechanism avoids the high computational cost of dense attention over the entire knowledge base and mitigates noise from irrelevant information. We demonstrate through extensive experiments on time-sensitive question-answering tasks that DySK-Attn significantly outperforms strong baselines, including standard Retrieval-Augmented Generation (RAG) and model editing techniques, in both factual accuracy for updated knowledge and computational efficiency. Our framework offers a scalable and effective solution for building LLMs that can stay current with the ever-changing world.
Abstract（参考訳）: 大きな言語モデル(LLM)は、その知識が静的で、すぐに時代遅れになるという限界に悩まされる。これらの膨大なモデルをリトレーニングすることは、計算的に禁止されるが、既存の知識編集技術は遅く、予期せぬ副作用をもたらす可能性がある。そこで我々は,LLMが動的外部ソースからリアルタイム知識を効率的に統合できる新しいフレームワークであるDySK-Attnを提案する。本手法は,リアルタイムに更新可能な動的知識グラフ(KG)を用いてLCMをシナジする。我々のフレームワークの中核は、粗い知識の注意機構であり、LLMは粗い粒度の探索を行い、膨大なKGから得られる、非常に関連性の高い事実の小さなサブセットを効率的に識別し、焦点を合わせることができる。このメカニズムは、知識ベース全体に対する高密度な注意の計算コストを回避し、無関係な情報からノイズを軽減します。我々は,DySK-Attnが標準検索・拡張生成(RAG)やモデル編集技術など,強いベースラインを著しく上回り,知識の更新と計算効率の両面において,時間に敏感な質問応答タスクに関する広範な実験を行った。当社のフレームワークは、常に変化する世界と現在の状態を維持するLLMを構築するための、スケーラブルで効果的なソリューションを提供します。

論文の概要: DySK-Attn: A Framework for Efficient, Real-Time Knowledge Updating in Large Language Models via Dynamic Sparse Knowledge Attention

関連論文リスト