Weakly Supervised Change Detection via Knowledge Distillation and
Multiscale Sigmoid Inference
- URL: http://arxiv.org/abs/2403.05796v1
- Date: Sat, 9 Mar 2024 05:01:51 GMT
- Title: Weakly Supervised Change Detection via Knowledge Distillation and
Multiscale Sigmoid Inference
- Authors: Binghao Lu, Caiwen Ding, Jinbo Bi, Dongjin Song
- Abstract summary: We develop a novel weakly supervised change detection technique via Knowledge Distillation and Multiscale Sigmoid Inference (KD-MSI)
Our proposed technique, with its integrated training strategy, significantly outperforms the state-of-the-art.
- Score: 26.095501974608908
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Change detection, which aims to detect spatial changes from a pair of
multi-temporal images due to natural or man-made causes, has been widely
applied in remote sensing, disaster management, urban management, etc. Most
existing change detection approaches, however, are fully supervised and require
labor-intensive pixel-level labels. To address this, we develop a novel weakly
supervised change detection technique via Knowledge Distillation and Multiscale
Sigmoid Inference (KD-MSI) that leverages image-level labels. In our approach,
the Class Activation Maps (CAM) are utilized not only to derive a change
probability map but also to serve as a foundation for the knowledge
distillation process. This is done through a joint training strategy of the
teacher and student networks, enabling the student network to highlight
potential change areas more accurately than teacher network based on
image-level labels. Moreover, we designed a Multiscale Sigmoid Inference (MSI)
module as a post processing step to further refine the change probability map
from the trained student network. Empirical results on three public datasets,
i.e., WHU-CD, DSIFN-CD, and LEVIR-CD, demonstrate that our proposed technique,
with its integrated training strategy, significantly outperforms the
state-of-the-art.
Related papers
- HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images [6.890268321645873]
We propose a progressive foreground-balanced sampling strategy on the basis of not adding change information.
This strategy helps the model accurately learn the features of the changed pixels during the early training process.
We also design a discriminative Siamese network, hierarchical attention network (HANet), which can integrate multiscale features and refine detailed features.
arXiv Detail & Related papers (2024-04-14T08:01:27Z) - Direct Distillation between Different Domains [97.39470334253163]
We propose a new one-stage method dubbed Direct Distillation between Different Domains" (4Ds)
We first design a learnable adapter based on the Fourier transform to separate the domain-invariant knowledge from the domain-specific knowledge.
We then build a fusion-activation mechanism to transfer the valuable domain-invariant knowledge to the student network.
arXiv Detail & Related papers (2024-01-12T02:48:51Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - Towards Hierarchical Regional Transformer-based Multiple Instance
Learning [2.16656895298847]
We propose a Transformer-based multiple instance learning approach that replaces the traditional learned attention mechanism with a regional, Vision Transformer inspired self-attention mechanism.
We present a method that fuses regional patch information to derive slide-level predictions and show how this regional aggregation can be stacked to hierarchically process features on different distance levels.
Our approach is able to significantly improve performance over the baseline on two histopathology datasets and points towards promising directions for further research.
arXiv Detail & Related papers (2023-08-24T08:19:15Z) - Background-Mixed Augmentation for Weakly Supervised Change Detection [18.319961338185458]
Change detection (CD) is to decouple object changes (i.e., object missing or appearing) from background changes (i.e., environment variations)
Recent deep learning-based methods develop novel network architectures or optimization strategies with paired-training examples.
We develop a novel weakly supervised training algorithm that only needs image-level labels.
arXiv Detail & Related papers (2022-11-21T14:12:53Z) - Impact of a DCT-driven Loss in Attention-based Knowledge-Distillation
for Scene Recognition [64.29650787243443]
We propose and analyse the use of a 2D frequency transform of the activation maps before transferring them.
This strategy enhances knowledge transferability in tasks such as scene recognition.
We publicly release the training and evaluation framework used along this paper at http://www.vpu.eps.uam.es/publications/DCTBasedKDForSceneRecognition.
arXiv Detail & Related papers (2022-05-04T11:05:18Z) - SATS: Self-Attention Transfer for Continual Semantic Segmentation [50.51525791240729]
continual semantic segmentation suffers from the same catastrophic forgetting issue as in continual classification learning.
This study proposes to transfer a new type of information relevant to knowledge, i.e. the relationships between elements within each image.
The relationship information can be effectively obtained from the self-attention maps in a Transformer-style segmentation model.
arXiv Detail & Related papers (2022-03-15T06:09:28Z) - Activation to Saliency: Forming High-Quality Labels for Unsupervised
Salient Object Detection [54.92703325989853]
We propose a two-stage Activation-to-Saliency (A2S) framework that effectively generates high-quality saliency cues.
No human annotations are involved in our framework during the whole training process.
Our framework reports significant performance compared with existing USOD methods.
arXiv Detail & Related papers (2021-12-07T11:54:06Z) - Unsupervised Change Detection in Satellite Images with Generative
Adversarial Network [20.81970476609318]
We propose a novel change detection framework utilizing a special neural network architecture -- Generative Adversarial Network (GAN) to generate better coregistered images.
The optimized GAN model would produce better coregistered images where changes can be easily spotted and then the change map can be presented through a comparison strategy.
arXiv Detail & Related papers (2020-09-08T10:26:04Z) - Semantics-aware Adaptive Knowledge Distillation for Sensor-to-Vision
Action Recognition [131.6328804788164]
We propose a framework, named Semantics-aware Adaptive Knowledge Distillation Networks (SAKDN), to enhance action recognition in vision-sensor modality (videos)
The SAKDN uses multiple wearable-sensors as teacher modalities and uses RGB videos as student modality.
arXiv Detail & Related papers (2020-09-01T03:38:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.