Toward distortion-aware change detection in realistic scenarios
- URL: http://arxiv.org/abs/2401.05157v1
- Date: Wed, 10 Jan 2024 13:43:06 GMT
- Title: Toward distortion-aware change detection in realistic scenarios
- Authors: Yitao Zhao, Heng-Chao Li, Nanqing Liu, Rui Wang
- Abstract summary: We propose a reusable self-supervised framework for bitemporal geometric distortion in CD tasks.
The framework is composed of Pretext Representation Pre-training, Bitemporal Image Alignment, and Down-stream Decoder Fine-Tuning.
With only single-stage pre-training, the key components of the framework can be reused for assistance in the bitemporal image alignment.
- Score: 16.192695430837443
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In the conventional change detection (CD) pipeline, two manually registered
and labeled remote sensing datasets serve as the input of the model for
training and prediction. However, in realistic scenarios, data from different
periods or sensors could fail to be aligned as a result of various coordinate
systems. Geometric distortion caused by coordinate shifting remains a thorny
issue for CD algorithms. In this paper, we propose a reusable self-supervised
framework for bitemporal geometric distortion in CD tasks. The whole framework
is composed of Pretext Representation Pre-training, Bitemporal Image Alignment,
and Down-stream Decoder Fine-Tuning. With only single-stage pre-training, the
key components of the framework can be reused for assistance in the bitemporal
image alignment, while simultaneously enhancing the performance of the CD
decoder. Experimental results in 2 large-scale realistic scenarios demonstrate
that our proposed method can alleviate the bitemporal geometric distortion in
CD tasks.
Related papers
- EfficientCD: A New Strategy For Change Detection Based With Bi-temporal Layers Exchanged [3.3885253104046993]
We propose a novel deep learning framework named EfficientCD for remote sensing image change detection.
The framework employs EfficientNet as its backbone network for feature extraction.
The EfficientCD has been experimentally validated on four remote sensing datasets.
arXiv Detail & Related papers (2024-07-22T19:11:50Z) - UCDFormer: Unsupervised Change Detection Using a Transformer-driven
Image Translation [20.131754484570454]
Change detection (CD) by comparing two bi-temporal images is a crucial task in remote sensing.
We propose a change detection with domain shift setting for remote sensing images.
We present a novel unsupervised CD method using a light-weight transformer, called UCDFormer.
arXiv Detail & Related papers (2023-08-02T13:39:08Z) - Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image
Restoration in Under-Display Camera [84.41316720913785]
We revisit the classic stereo setup for training data collection -- capturing two images of the same scene with one UDC and one standard camera.
The key idea is to "copy" details from a high-quality reference image and "paste" them on the UDC image.
A novel Transformer-based framework generates well-aligned yet high-quality target data for the corresponding UDC input.
arXiv Detail & Related papers (2023-04-12T17:56:42Z) - Multi-scale Fusion Fault Diagnosis Method Based on Two-Dimensionaliztion
Sequence in Complex Scenarios [0.0]
Rolling bearings are critical components in rotating machinery, and their faults can cause severe damage.
Early detection of abnormalities is crucial to prevent catastrophic accidents.
Traditional and intelligent methods have been used to analyze time series data, but in real-life scenarios, sensor data is often noisy and cannot be accurately characterized in the time domain.
This paper proposes an improved convolutional neural network method with a multi-scale feature fusion model and deep learning compression techniques for deployment in industrial scenarios.
arXiv Detail & Related papers (2023-04-11T13:05:50Z) - Deep Metric Learning for Unsupervised Remote Sensing Change Detection [60.89777029184023]
Remote Sensing Change Detection (RS-CD) aims to detect relevant changes from Multi-Temporal Remote Sensing Images (MT-RSIs)
The performance of existing RS-CD methods is attributed to training on large annotated datasets.
This paper proposes an unsupervised CD method based on deep metric learning that can deal with both of these issues.
arXiv Detail & Related papers (2023-03-16T17:52:45Z) - Multitask AET with Orthogonal Tangent Regularity for Dark Object
Detection [84.52197307286681]
We propose a novel multitask auto encoding transformation (MAET) model to enhance object detection in a dark environment.
In a self-supervision manner, the MAET learns the intrinsic visual structure by encoding and decoding the realistic illumination-degrading transformation.
We have achieved the state-of-the-art performance using synthetic and real-world datasets.
arXiv Detail & Related papers (2022-05-06T16:27:14Z) - Revisiting Consistency Regularization for Semi-supervised Change
Detection in Remote Sensing Images [60.89777029184023]
We propose a semi-supervised CD model in which we formulate an unsupervised CD loss in addition to the supervised Cross-Entropy (CE) loss.
Experiments conducted on two publicly available CD datasets show that the proposed semi-supervised CD method can reach closer to the performance of supervised CD.
arXiv Detail & Related papers (2022-04-18T17:59:01Z) - Test-time Adaptation with Slot-Centric Models [63.981055778098444]
Slot-TTA is a semi-supervised scene decomposition model that at test time is adapted per scene through gradient descent on reconstruction or cross-view synthesis objectives.
We show substantial out-of-distribution performance improvements against state-of-the-art supervised feed-forward detectors, and alternative test-time adaptation methods.
arXiv Detail & Related papers (2022-03-21T17:59:50Z) - Bringing Rolling Shutter Images Alive with Dual Reversed Distortion [75.78003680510193]
Rolling shutter (RS) distortion can be interpreted as the result of picking a row of pixels from instant global shutter (GS) frames over time.
We develop a novel end-to-end model, IFED, to generate dual optical flow sequence through iterative learning of the velocity field during the RS time.
arXiv Detail & Related papers (2022-03-12T14:57:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.