Fugu-MT 論文翻訳(概要): Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning

論文の概要: Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning

arxiv url: http://arxiv.org/abs/2603.17474v1
Date: Wed, 18 Mar 2026 08:28:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-19 18:32:57.580779
Title: Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning
Title（参考訳）: クロスアテンションメカニズムの再考:ドメイン適応学習における便益的ノイズの活用
Authors: Zelin Zang, Yehui Yang, Fei Wang, Liangyu Li, Baigui Sun,
Abstract要約: Unsupervised Domain Adaptation (UDA) は、ラベル付きソースドメインからラベル付きターゲットドメインへの知識の転送を試みる。既存のクロスアテンションベースのトランスフォーマーは、ドメイン間で機能を整列させることができるが、大きな外観とスケールのバリエーションの下でコンテンツセマンティクスを保存するのに苦労している。本稿では、制御された摂動を注入することで、相互注意を規則化する有益雑音の概念を導入する。
参考スコア（独自算出の注目度）: 19.18410733239502
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Unsupervised Domain Adaptation (UDA) seeks to transfer knowledge from a labeled source domain to an unlabeled target domain but often suffers from severe domain and scale gaps that degrade performance. Existing cross-attention-based transformers can align features across domains, yet they struggle to preserve content semantics under large appearance and scale variations. To explicitly address these challenges, we introduce the concept of beneficial noise, which regularizes cross-attention by injecting controlled perturbations, encouraging the model to ignore style distractions and focus on content. We propose the Domain-Adaptive Cross-Scale Matching (DACSM) framework, which consists of a Domain-Adaptive Transformer (DAT) for disentangling domain-shared content from domain-specific style, and a Cross-Scale Matching (CSM) module that adaptively aligns features across multiple resolutions. DAT incorporates beneficial noise into cross-attention, enabling progressive domain translation with enhanced robustness, yielding content-consistent and style-invariant representations. Meanwhile, CSM ensures semantic consistency under scale changes. Extensive experiments on VisDA-2017, Office-Home, and DomainNet demonstrate that DACSM achieves state-of-the-art performance, with up to +2.3% improvement over CDTrans on VisDA-2017. Notably, DACSM achieves a +5.9% gain on the challenging "truck" class of VisDA, evidencing the strength of beneficial noise in handling scale discrepancies. These results highlight the effectiveness of combining domain translation, beneficial-noise-enhanced attention, and scale-aware alignment for robust cross-domain representation learning.
Abstract（参考訳）: 教師なしドメイン適応(Unsupervised Domain Adaptation, UDA)は、ラベル付きソースドメインからラベルなしターゲットドメインに知識を転送しようとするが、しばしばパフォーマンスを低下させる厳しいドメインとスケールギャップに悩まされる。既存のクロスアテンションベースのトランスフォーマーは、ドメイン間で機能を整列させることができるが、大きな外観とスケールのバリエーションの下でコンテンツセマンティクスを保存するのに苦労している。これらの課題に明確に対処するために、我々は、制御された摂動を注入することで、相互注意を規則化し、スタイルの邪魔を無視し、コンテンツに集中するようにモデルに促す、有益ノイズの概念を導入する。ドメイン固有のスタイルからドメイン共有コンテンツを切り離すためのドメイン適応型変換器(DAT)と、複数の解像度で機能を適応的に整列するクロススケールマッチング(CSM)モジュールからなる、ドメイン適応型クロススケールマッチング(DACSM)フレームワークを提案する。 DATは有益ノイズをクロスアテンションに取り入れ、拡張された堅牢性を持つプログレッシブドメイン翻訳を可能にし、コンテンツ一貫性とスタイル不変表現をもたらす。一方、CSMはスケール変更時のセマンティック一貫性を保証する。 VisDA-2017、Office-Home、DomainNetの大規模な実験は、DACSMが最先端のパフォーマンスを達成し、VisDA-2017上のCDTransよりも最大で2.3%改善していることを示している。特に、DACSMはVisDAの挑戦的な「トラック」クラスで+5.9%上昇し、スケールの相違に対処する上で有益なノイズの強さを証明している。これらの結果は、堅牢なクロスドメイン表現学習のためのドメイン翻訳、有益ノイズ強調、スケールアライメントの併用の有効性を強調した。

論文の概要: Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning

関連論文リスト