Fugu-MT 論文翻訳(概要): Multimodal Negative Learning

論文の概要: Multimodal Negative Learning

arxiv url: http://arxiv.org/abs/2510.20877v1
Date: Thu, 23 Oct 2025 11:47:11 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 09:00:15.280985
Title: Multimodal Negative Learning
Title（参考訳）: マルチモーダルネガティブラーニング
Authors: Baoquan Gong, Xiyuan Gao, Pengfei Zhu, Qinghua Hu, Bing Cao,
Abstract要約: 我々は新しい学習パラダイム"学習すべきでない"(Negative Learning)を提案する。弱いモダリティのターゲットクラス予測を強化する代わりに、支配的なモダリティは弱いモダリティを動的に導き、非ターゲットクラスを抑える。これは決定空間を安定化させ、モダリティ固有の情報を保存する。
参考スコア（独自算出の注目度）: 55.67017420486548
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multimodal learning systems often encounter challenges related to modality imbalance, where a dominant modality may overshadow others, thereby hindering the learning of weak modalities. Conventional approaches often force weak modalities to align with dominant ones in "Learning to be (the same)" (Positive Learning), which risks suppressing the unique information inherent in the weak modalities. To address this challenge, we offer a new learning paradigm: "Learning Not to be" (Negative Learning). Instead of enhancing weak modalities' target-class predictions, the dominant modalities dynamically guide the weak modality to suppress non-target classes. This stabilizes the decision space and preserves modality-specific information, allowing weak modalities to preserve unique information without being over-aligned. We proceed to reveal multimodal learning from a robustness perspective and theoretically derive the Multimodal Negative Learning (MNL) framework, which introduces a dynamic guidance mechanism tailored for negative learning. Our method provably tightens the robustness lower bound of multimodal learning by increasing the Unimodal Confidence Margin (UCoM) and reduces the empirical error of weak modalities, particularly under noisy and imbalanced scenarios. Extensive experiments across multiple benchmarks demonstrate the effectiveness and generalizability of our approach against competing methods. The code will be available at https://github.com/BaoquanGong/Multimodal-Negative-Learning.git.
Abstract（参考訳）: マルチモーダル学習システムは、しばしばモダリティの不均衡に関連する問題に遭遇し、支配的なモダリティが他のモダリティを覆す可能性があるため、弱いモダリティの学習を妨げる。従来のアプローチでは、弱いモダリティは、弱いモダリティに固有の独特な情報を抑圧するリスクを負う「(同じことを学ぶこと」 (Positive Learning) において、支配的なモダリティと整合せざるを得ない場合が多い。この課題に対処するため、私たちは"Learning Not to Be"(否定的学習)という新しい学習パラダイムを提供しています。弱いモダリティのターゲットクラス予測を強化する代わりに、支配的なモダリティは弱いモダリティを動的に導き、非ターゲットクラスを抑える。これにより、決定空間を安定化し、モダリティ固有の情報を保存し、弱いモダリティがオーバーアライメントされることなくユニークな情報を保存できる。我々は、ロバストネスの観点からマルチモーダル学習を明らかにし、理論的には、ネガティブ学習に適した動的誘導機構を導入するMNL(Multimodal Negative Learning)フレームワークを導出する。本手法は,Unimodal Confidence Margin (UCoM) を増大させることにより,マルチモーダル学習のロバスト性低下を確実に抑制し,特に雑音や不均衡シナリオ下での弱いモーダル性の実証誤差を低減する。複数のベンチマークにまたがる大規模な実験は、競合する手法に対するアプローチの有効性と一般化性を実証している。コードはhttps://github.com/BaoquanGong/Multimodal-Negative-Learning.gitで入手できる。

論文の概要: Multimodal Negative Learning

関連論文リスト