Fugu-MT 論文翻訳(概要): KVComm: Enabling Efficient LLM Communication through Selective KV Sharing

論文の概要: KVComm: Enabling Efficient LLM Communication through Selective KV Sharing

arxiv url: http://arxiv.org/abs/2510.03346v1
Date: Thu, 02 Oct 2025 16:01:54 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:58.917593
Title: KVComm: Enabling Efficient LLM Communication through Selective KV Sharing
Title（参考訳）: KVComm:選択的なKV共有による効率的なLLM通信の実現
Authors: Xiangyu Shi, Marco Chiesa, Gerald Q. Maguire Jr., Dejan Kostic,
Abstract要約: 大規模言語モデル(LLM)は、効果的なモデル間通信が不可欠であるマルチエージェントシステムにおいて、ますます多くデプロイされている。我々は,KVペアの選択的共有により,LLM間の効率的な通信を可能にする新しいフレームワークであるKVCommを提案する。本研究は,KVペアがLLM間通信に有効な媒体となる可能性を強調し,スケーラブルで効率的なマルチエージェントシステムを実現することを目的とした。
参考スコア（独自算出の注目度）: 5.08081184831007
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) are increasingly deployed in multi-agent systems, where effective inter-model communication is crucial. Existing communication protocols either rely on natural language, incurring high inference costs and information loss, or on hidden states, which suffer from information concentration bias and inefficiency. To address these limitations, we propose KVComm, a novel communication framework that enables efficient communication between LLMs through selective sharing of KV pairs. KVComm leverages the rich information encoded in the KV pairs while avoiding the pitfalls of hidden states. We introduce a KV layer-wise selection strategy based on attention importance scores with a Gaussian prior to identify the most informative KV pairs for communication. Extensive experiments across diverse tasks and model pairs demonstrate that KVComm achieves comparable performance to the upper-bound method, which directly merges inputs to one model without any communication, while transmitting as few as 30\% of layers' KV pairs. Our study highlights the potential of KV pairs as an effective medium for inter-LLM communication, paving the way for scalable and efficient multi-agent systems.
Abstract（参考訳）: 大規模言語モデル(LLM)は、効果的なモデル間通信が不可欠であるマルチエージェントシステムにおいて、ますます多くデプロイされている。既存の通信プロトコルは自然言語に依存するか、高い推論コストと情報損失をもたらすか、あるいは情報集中バイアスと非効率に悩まされる隠れた状態に依存する。これらの制約に対処するため,KVペアの選択的共有によるLLM間の効率的な通信を可能にする新しい通信フレームワークであるKVCommを提案する。 KVCommは、隠れた状態の落とし穴を避けながら、KVペアに符号化された豊富な情報を活用する。ガウシアンによる注目度スコアに基づくKV層選択戦略を導入し,コミュニケーションにおいて最も有意義なKV対を特定する。多様なタスクとモデルペアにわたる大規模な実験により、KVCommは上行法に匹敵する性能を達成し、通信なしで直接1つのモデルに入力をマージすると同時に、30倍の層のKVペアを伝達することを示した。本研究は,KVペアがLLM間通信に有効な媒体となる可能性を強調し,スケーラブルで効率的なマルチエージェントシステムを実現することを目的とした。

論文の概要: KVComm: Enabling Efficient LLM Communication through Selective KV Sharing

関連論文リスト