Fugu-MT 論文翻訳(概要): ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

論文の概要: ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

arxiv url: http://arxiv.org/abs/2603.24787v1
Date: Wed, 25 Mar 2026 20:00:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-27 20:52:47.977471
Title: ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing
Title（参考訳）: ReLope:マルチモーダルLCMルーティングのためのKL規則化LoRAプローブ
Authors: Yaopei Zeng, Congchao Wang, Blake JianHang Chen, Lu Lin,
Abstract要約: 本稿では,大規模言語モデル(LLM)システムにおけるプローブルーティングを改善するための2つの補完的アプローチを提案する。まず,アテンションスコアに基づいて先行層から隠れた状態を集約し,分散正当性信号の復元を行うemphAttention Probeを提案する。第二に、emphKL-Regularized LoRA Probe(ReLope)という軽量なLoRAアダプタを挿入し、KL正規化器を適用してルーティング対応表現を学習する。
参考スコア（独自算出の注目度）: 5.106806600400179
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Routing has emerged as a promising strategy for balancing performance and cost in large language model (LLM) systems that combine lightweight models with powerful but expensive large models. Recent studies show that \emph{probe routing}, which predicts the correctness of a small model using its hidden states, provides an effective solution in text-only LLMs. However, we observe that these probes degrade substantially when applied to multimodal LLMs (MLLMs). Through empirical analysis, we find that the presence of visual inputs weakens the separability of correctness signals in hidden states, making them harder to extract using standard probe designs. To address this challenge, we introduce two complementary approaches for improving probe routing in MLLMs. First, we propose the \emph{Attention Probe}, which aggregates hidden states from the preceding layer based on attention scores to recover distributed correctness signals. Second, we present the \emph{KL-Regularized LoRA Probe (ReLope)}, which inserts a lightweight LoRA adapter and applies a KL regularizer to learn routing-aware representations. Comprehensive experiments show that our methods consistently outperform baselines, suggesting that improving the quality of hidden states is key to effective routing in MLLMs. Our code is available at https://github.com/Spinozaaa/ReLope.
Abstract（参考訳）: ルーティングは、軽量モデルと強力だが高価な大規模モデルを組み合わせた大規模言語モデル(LLM)システムにおいて、パフォーマンスとコストのバランスをとるための有望な戦略として登場した。近年の研究では、隠れ状態を用いて小さなモデルの正しさを予測する「emph{probe routing}」が、テキストのみのLCMにおいて有効な解であることが示された。しかし,Multimodal LLM(MLLM)に適用した場合,これらのプローブは著しく劣化する。経験的解析により、視覚入力の存在が隠れ状態における正当性信号の分離性を弱め、標準プローブ設計による抽出が困難になることがわかった。この課題に対処するために、MLLMにおけるプローブルーティングを改善するための2つの補完的アプローチを提案する。まず,アテンションスコアに基づいて先行層から隠れた状態を集約し,分散正当性信号の復元を行う「emph{Attention Probe}」を提案する。次に、軽量なLoRAアダプタを挿入し、KL正規化器を適用してルーティング対応表現を学習する、emph{KL-Regularized LoRA Probe (ReLope)}を提案する。総合的な実験により,本手法はベースラインを一貫して上回り,隠れ状態の品質向上がMLLMの効率的なルーティングの鍵となることが示唆された。私たちのコードはhttps://github.com/Spinozaaa/ReLope.comで公開されています。

論文の概要: ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

関連論文リスト