Fugu-MT 論文翻訳(概要): GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction

論文の概要: GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction

arxiv url: http://arxiv.org/abs/2509.00388v1
Date: Sat, 30 Aug 2025 06:56:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 15:17:03.210116
Title: GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction
Title（参考訳）: GraphKV: グラフベースのKVキャッシュによる静的選択パラダイムを破る
Authors: Xuelin Li, Xiangqi Jin, Linfeng Zhang,
Abstract要約: GraphKVは、KVキャッシュ圧縮のためのトークン選択を再定義するグラフベースのフレームワークである。 SnapKV や PyramidKV といった既存の KV キャッシュ消去手法をプラグイン・アンド・プレイ方式でシームレスに利用することができる。
参考スコア（独自算出の注目度）: 9.309829912599367
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Efficient Key-Value (KV) cache management is essential for processing long text sequences in large language models (LLMs), where memory constraints often limit performance. Conventional KV eviction strategies, such as top-k selection based on attention scores, depend on static heuristics that fail to capture the evolving implicit dependencies among tokens during inference. To overcome this, we propose GraphKV, a graph-based framework that redefines token selection for KV cache compression. In GraphKV, tokens are modeled as nodes with importance scores, and edges represent their similarity relationships. Through a decay-signal-propagation mechanism, token importance is dynamically updated by propagating information across the graph, enabling adaptive retention of the most contextually significant tokens. GraphKV can be seamlessly utilized in existing KV cache eviction methods such as SnapKV and PyramidKV in a plug-and-play manner. Codes will be released on Github.
Abstract（参考訳）: キーバリュー(KV)キャッシュ管理は、大きな言語モデル(LLM)で長いテキストシーケンスを処理するのに不可欠である。注意点に基づくトップk選択のような従来のKV排除戦略は、推論中にトークン間の暗黙的依存関係の進化を捉えるのに失敗する静的ヒューリスティックに依存している。そこで我々は,KVキャッシュ圧縮のためのトークン選択を再定義するグラフベースのフレームワークであるGraphKVを提案する。 GraphKVでは、トークンは重要なスコアを持つノードとしてモデル化され、エッジはその類似関係を表す。崩壊信号伝達機構を通じて、トークンの重要度はグラフ全体に情報を伝達することで動的に更新され、最も文脈的に重要なトークンの適応的保持を可能にする。 GraphKVは、SnapKVやPraamidKVといった既存のKVキャッシュ消去手法で、プラグイン・アンド・プレイ方式でシームレスに利用することができる。コードはGithubで公開される。

論文の概要: GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction

関連論文リスト