Fugu-MT 論文翻訳(概要): SP-MoMamba: Superpixel-driven Mixture of State Space Experts for Efficient Image Super-Resolution

論文の概要: SP-MoMamba: Superpixel-driven Mixture of State Space Experts for Efficient Image Super-Resolution

arxiv url: http://arxiv.org/abs/2605.25892v1
Date: Mon, 25 May 2026 14:19:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:20.327194
Title: SP-MoMamba: Superpixel-driven Mixture of State Space Experts for Efficient Image Super-Resolution
Title（参考訳）: SP-MoMamba:高効率超解像のための超画素駆動による状態空間エキスパートの混合
Authors: Wenbin Zou, Yawen Cui, Yi Wang, Lap-Pui Chau, Liang Chen, Jinshan Pan, Huiping Zhuang, Guanbin Li,
Abstract要約: 状態空間モデル(SSM)は、効率的な単一画像超解像(SR)のための強力なパラダイムとして登場した。我々は、コンテント対応SRのための状態空間の専門家による超ピクセル駆動の混合である textbfSP-MoMamba を提案する。私たちの中核となる考え方は、従来の剛性スキャンを、スーパーピクセルを基本単位として扱うことによって、テキストのセマンティックなレベルのインタラクションに変換することです。
参考スコア（独自算出の注目度）: 99.56535031145211
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: State space models (SSMs) have emerged as a powerful paradigm for efficient single-image super-resolution (SR) due to their linear complexity and long-range modeling capabilities. However, existing Mamba-based methods typically rely on data-agnostic rigid scanning, which reshapes 2D images into 1D sequences over a fixed grid, inevitably disrupting spatial-semantic topology and introducing artifacts. Inspired by the \textbf{Gestalt perceptual grouping theory}, we propose \textbf{SP-MoMamba}, a superpixel-driven mixture of state space experts designed for content-aware SR. Our core idea is to transform the traditional rigid scanning into a \textbf{semantic-level interaction} by treating superpixels as fundamental units. Specifically, we introduce the \textbf{Superpixel-driven State Space Model (SP-SSM)}, which compresses semantically homogeneous regions into high-order tokens to preserve global topological consistency. To address the conflict between fixed scanning scales and diverse semantic granularities, we develop the \textbf{Multi-Scale Superpixel Mixture of State Space Experts (MSS-MoE)}. This module utilizes a dynamic routing mechanism to adaptively assign scale-specific experts, effectively capturing multi-scale textures while reducing computational redundancy. Furthermore, to prevent the loss of high-frequency details during global abstraction, we introduce a \textbf{Local Spatial Modulation Expert (LSME)} to complement the global modeling, ensuring a precise reconstruction of sharp edges and fine structures. Extensive experiments on standard benchmarks demonstrate that SP-MoMamba achieves superior reconstruction fidelity and a more favorable efficiency-performance trade-off compared to state-of-the-art efficient SR methods.
Abstract（参考訳）: 状態空間モデル(SSM)は、線形複雑性と長距離モデリング能力のため、効率的な単一画像超解像(SR)のための強力なパラダイムとして登場した。しかし、既存のMambaベースの手法は通常、データに依存しない剛性スキャンに依存しており、2D画像を固定格子上の1Dシーケンスに再結合し、必然的に空間意味的トポロジーを乱し、アーティファクトを導入する。感性グルーピング理論(英語版)に着想を得て、コンテント対応SRのために設計された状態空間の専門家による超ピクセル駆動の混合である \textbf{SP-MoMamba} を提案する。我々の中心となる考え方は、スーパーピクセルを基本単位として扱うことによって、従来の剛性走査を \textbf{semantic-level interaction} に変換することである。具体的には,大域的トポロジ的整合性を維持するために,意味的に同質な領域を高次トークンに圧縮する「textbf{Superpixel-driven State Space Model (SP-SSM)」を導入する。固定走査スケールと多種多様な意味的粒度の相違に対処するため,国家宇宙専門家の超画素混合法(MSS-MoE)}を開発した。このモジュールは動的ルーティング機構を使用して、スケール固有の専門家を適応的に割り当て、計算冗長性を低減しつつ、マルチスケールのテクスチャを効果的にキャプチャする。さらに,グローバル抽象化における高頻度の詳細の喪失を防止するため,グローバルモデリングを補完し,シャープエッジや微細構造を正確に再構築する「textbf{Local Spatial Modulation Expert (LSME)」を導入する。標準ベンチマーク実験により,SP-MoMambaは,最先端のSR手法と比較して,より優れた再構成精度と良好な効率トレードオフを実現することが示された。

論文の概要: SP-MoMamba: Superpixel-driven Mixture of State Space Experts for Efficient Image Super-Resolution

関連論文リスト