Fugu-MT 論文翻訳(概要): Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection

論文の概要: Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection

arxiv url: http://arxiv.org/abs/2604.27889v1
Date: Thu, 30 Apr 2026 14:06:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-01 16:31:54.124353
Title: Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection
Title（参考訳）: Noise2Map:セマンティックセグメンテーションと変化検出のためのエンドツーエンド拡散モデル
Authors: Ali Shibli, Andrea Nascetti, Yifang Ban,
Abstract要約: Noise2Mapは、差別学習のための統合拡散ベースのフレームワークである。タスク固有のノイズスケジュールとタイムステップ条件を用いて、セマンティックマップや変更マップを予測する。私たちのアーキテクチャは、共有バックボーンを通じて、タスク(SSとCD)の両方をサポートします。
参考スコア（独自算出の注目度）: 0.9558392439655014
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Semantic segmentation and change detection are two fundamental challenges in remote sensing, requiring models to capture either spatial semantics or temporal differences from satellite imagery. Existing deep learning models often struggle with temporal inconsistencies or in capturing fine-grained spatial structures, require extensive pretraining, and offer limited interpretability - especially in real-world remote sensing scenarios. Recent advances in diffusion models show that Gaussian noise can be systematically leveraged to learn expressive data representations through denoising. Motivated by this, we investigate whether the noise process in diffusion models can be effectively utilized for discriminative tasks. We propose Noise2Map, a unified diffusion-based framework that repurposes the denoising process for fast, end-to-end discriminative learning. Unlike prior work that uses diffusion only for generation or feature extraction, Noise2Map directly predicts semantic or change maps using task-specific noise schedules and timestep conditioning, avoiding the costly sampling procedures of traditional diffusion models. The model is pretrained via self-supervised denoising and fine-tuned with supervision, enabling both interpretability and robustness. Our architecture supports both tasks (SS and CD) through a shared backbone and task-specific noise schedulers. Extensive evaluations on the SpaceNet7, WHU, and xView2 buildings damaged by wildfires datasets demonstrate that Noise2Map ranks on average 1st among seven models on semantic segmentation and 1st on change detection by a cross-dataset rank metric (average F1 primary, IoU tie-break). Ablation studies highlight the robustness of our model against different training noise schedulers and timestep control in the diffusion process, as well as the ability of the model to perform multi-task learning.
Abstract（参考訳）: セマンティックセグメンテーションと変化検出はリモートセンシングにおける2つの基本的な課題であり、衛星画像から空間的意味論または時間的差異を捉える必要がある。既存のディープラーニングモデルは、時間的不整合やきめ細かい空間構造を捉え、広範囲の事前トレーニングを必要とし、特に現実世界のリモートセンシングシナリオにおいて、限定的な解釈性を提供する。拡散モデルの最近の進歩は、ガウスノイズを体系的に利用して表現的データ表現を学ぶことができることを示している。そこで本研究では,拡散モデルの雑音過程を識別的タスクに有効に活用できるかどうかを考察する。本研究では,高速でエンドツーエンドな識別学習のためのデノナイジングプロセスを再利用した統合拡散ベースフレームワークであるNoss2Mapを提案する。生成や特徴抽出にのみ拡散を利用する以前の作業とは異なり、Noss2Mapはタスク固有のノイズスケジュールとタイムステップ条件を使って、意味や変化マップを直接予測し、従来の拡散モデルの高価なサンプリング手順を避ける。このモデルは、自己監督された装飾によって事前訓練され、監督によって微調整され、解釈可能性と堅牢性の両方を可能にする。我々のアーキテクチャは、共有バックボーンとタスク固有のノイズスケジューラを通じて、タスク(SSとCD)の両方をサポートします。 SpaceNet7、WHU、xView2の建物が山火事のデータセットで損傷し、ノイズ2Mapはセマンティックセグメンテーションのモデル7つの中で平均1位、変更検出はクロスデータセットのランク(平均F1プライマリ、IoUタイブレーク)で1位となった。アブレーション研究では、異なるトレーニングノイズスケジューラに対するモデルの堅牢性、拡散過程におけるタイムステップ制御、マルチタスク学習を行うモデルの能力を強調した。

論文の概要: Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection

関連論文リスト