Fugu-MT 論文翻訳(概要): Optimizing Multi-Modal Trackers via Sensitivity-aware Regularized Tuning

論文の概要: Optimizing Multi-Modal Trackers via Sensitivity-aware Regularized Tuning

arxiv url: http://arxiv.org/abs/2508.17488v1
Date: Sun, 24 Aug 2025 18:42:47 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-26 18:43:45.550523
Title: Optimizing Multi-Modal Trackers via Sensitivity-aware Regularized Tuning
Title（参考訳）: 感性を考慮した正規化チューニングによるマルチモーダルトラッカーの最適化
Authors: Zhiwen Chen, Jinjian Wu, Zhiyu Zhu, Yifan Zhang, Guangming Shi, Junhui Hou,
Abstract要約: 本稿では,RGBデータに対する事前学習モデルを効果的に適用することにより,マルチモーダルトラッカーの最適化に挑戦する。既存の微調整パラダイムは過度な自由と過剰な制限の間に振動し、最適の可塑性-安定性のトレードオフをもたらす。そこで本研究では,本質的なパラメータ感を取り入れて学習プロセスを微妙に洗練する,感性に配慮した規則化チューニングフレームワークを提案する。
参考スコア（独自算出の注目度）: 112.12667472919723
License: http://creativecommons.org/licenses/by/4.0/
Abstract: This paper tackles the critical challenge of optimizing multi-modal trackers by effectively adapting the pre-trained models for RGB data. Existing fine-tuning paradigms oscillate between excessive freedom and over-restriction, both leading to a suboptimal plasticity-stability trade-off. To mitigate this dilemma, we propose a novel sensitivity-aware regularized tuning framework, which delicately refines the learning process by incorporating intrinsic parameter sensitivities. Through a comprehensive investigation from pre-trained to multi-modal contexts, we identify that parameters sensitive to pivotal foundational patterns and cross-domain shifts are primary drivers of this issue. Specifically, we first analyze the tangent space of pre-trained weights to measure and orient prior sensitivities, dedicated to preserving generalization. Then, we further explore transfer sensitivities during the tuning phase, emphasizing adaptability and stability. By incorporating these sensitivities as regularization terms, our method significantly enhances the transferability across modalities. Extensive experiments showcase the superior performance of the proposed method, surpassing current state-of-the-art techniques across various multi-modal tracking. The source code and models will be publicly available at https://github.com/zhiwen-xdu/SRTrack.
Abstract（参考訳）: 本稿では,RGBデータに対する事前学習モデルを効果的に適応させることにより,マルチモーダルトラッカーを最適化する上での課題に対処する。既存の微調整パラダイムは過度な自由と過剰な制限の間に振動し、どちらも最適の塑性-安定性のトレードオフにつながる。このジレンマを緩和するために,本研究では,本質的なパラメータ感を取り入れて学習プロセスを微妙に洗練する,感性に配慮した規則化チューニングフレームワークを提案する。事前学習からマルチモーダル・コンテクストへの包括的調査を通じて、この問題の主要因は、中心的な基礎パターンやドメイン間シフトに敏感なパラメータであることが分かった。具体的には、まず、事前学習した重みの接空間を解析し、一般化の保存に特化した事前感度を測り、オリエントする。さらに, 調整段階における伝達感度について検討し, 適応性と安定性を強調した。これらの感度を正規化項として組み込むことにより,モーダル性間の伝達可能性を大幅に向上させる。広範囲な実験により提案手法の優れた性能を示し, 各種マルチモーダルトラッキングにおける最先端技術を上回った。ソースコードとモデルはhttps://github.com/zhiwen-xdu/SRTrack.comで公開されている。

論文の概要: Optimizing Multi-Modal Trackers via Sensitivity-aware Regularized Tuning

関連論文リスト