Fugu-MT 論文翻訳(概要): RD-UIE: Relation-Driven State Space Modeling for Underwater Image Enhancement

論文の概要: RD-UIE: Relation-Driven State Space Modeling for Underwater Image Enhancement

arxiv url: http://arxiv.org/abs/2505.01224v1
Date: Fri, 02 May 2025 12:21:44 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-05 17:21:20.016194
Title: RD-UIE: Relation-Driven State Space Modeling for Underwater Image Enhancement
Title（参考訳）: RD-UIE:水中画像強調のための関係駆動状態空間モデリング
Authors: Kui Jiang, Yan Luo, Junjun Jiang, Xin Xu, Fei Ma, Fei Yu,
Abstract要約: 水中画像強調(UIE)は、海洋視覚応用のための重要な前処理ステップである。実効UIE(RD-UIE)のための新しい関係駆動型マンバフレームワークを開発した。水中強化ベンチマークの実験では、RD-UIEは最先端のアプローチであるWMambaよりも優れていた。
参考スコア（独自算出の注目度）: 59.364418120895
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Underwater image enhancement (UIE) is a critical preprocessing step for marine vision applications, where wavelength-dependent attenuation causes severe content degradation and color distortion. While recent state space models like Mamba show potential for long-range dependency modeling, their unfolding operations and fixed scan paths on 1D sequences fail to adapt to local object semantics and global relation modeling, limiting their efficacy in complex underwater environments. To address this, we enhance conventional Mamba with the sorting-based scanning mechanism that dynamically reorders scanning sequences based on statistical distribution of spatial correlation of all pixels. In this way, it encourages the network to prioritize the most informative components--structural and semantic features. Upon building this mechanism, we devise a Visually Self-adaptive State Block (VSSB) that harmonizes dynamic sorting of Mamba with input-dependent dynamic convolution, enabling coherent integration of global context and local relational cues. This exquisite design helps eliminate global focus bias, especially for widely distributed contents, which greatly weakens the statistical frequency. For robust feature extraction and refinement, we design a cross-feature bridge (CFB) to adaptively fuse multi-scale representations. These efforts compose the novel relation-driven Mamba framework for effective UIE (RD-UIE). Extensive experiments on underwater enhancement benchmarks demonstrate RD-UIE outperforms the state-of-the-art approach WMamba in both quantitative metrics and visual fidelity, averagely achieving 0.55 dB performance gain on the three benchmarks. Our code is available at https://github.com/kkoucy/RD-UIE/tree/main
Abstract（参考訳）: 水中画像強調(UIE)は、波長依存性の減衰が厳しい内容劣化と色歪みを引き起こす海洋視覚アプリケーションにとって重要な前処理ステップである。マンバのような最近の状態空間モデルは長距離依存モデリングの可能性を示しているが、その展開操作と1次元シーケンス上の固定されたスキャンパスは局所的なオブジェクトセマンティクスやグローバルな関係モデリングに適応できず、複雑な水中環境での有効性を制限している。そこで本研究では,全画素の空間相関の統計的分布に基づいて,スキャンシーケンスを動的に並べ替えるソートに基づくスキャン機構により,従来のマンバを改良する。このようにして、ネットワークは最も有意義なコンポーネントである構造的特徴と意味的特徴の優先順位付けを推奨する。このメカニズムを構築する際に、入力依存の動的畳み込みとMambaの動的ソートを調和させる視覚的自己適応状態ブロック(VSSB)を考案し、グローバルコンテキストとローカルリレーショナルキューのコヒーレントな統合を可能にする。この厳密な設計は、特に広く分散されたコンテンツにおいて、グローバルな焦点バイアスを排除し、統計周波数を著しく弱めるのに役立つ。頑健な特徴抽出と改善のために,多スケール表現を適応的に融合するクロス機能ブリッジ (CFB) を設計する。これらの取り組みは、効果的なUIE(RD-UIE)のための新しい関係駆動型Mambaフレームワークを構成する。水中強化ベンチマークに関する大規模な実験は、RD-UIEが3つのベンチマークで平均0.55dBの性能向上を達成し、測定値と視力の両方において最先端のアプローチであるWMambaよりも優れていることを示した。私たちのコードはhttps://github.com/kkoucy/RD-UIE/tree/mainで利用可能です。

論文の概要: RD-UIE: Relation-Driven State Space Modeling for Underwater Image Enhancement

関連論文リスト