A Strategy Optimized Pix2pix Approach for SAR-to-Optical Image
Translation Task
- URL: http://arxiv.org/abs/2206.13042v2
- Date: Tue, 28 Jun 2022 02:35:48 GMT
- Title: A Strategy Optimized Pix2pix Approach for SAR-to-Optical Image
Translation Task
- Authors: Fujian Cheng, Yashu Kang, Chunlei Chen, Kezhao Jiang
- Abstract summary: This report summarizes the analysis and approach on the image-to-image translation task in the Multimodal Learning for Earth and Environment Challenge (MultiEarth 2022)
In terms of strategy optimization, cloud classification is utilized to filter optical images with dense cloud coverage to aid the supervised learning alike approach.
The results indicate great potential towards SAR-to-optical translation in remote sensing tasks, specifically for the support of long-term environmental monitoring and protection.
- Score: 0.9176056742068814
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This technical report summarizes the analysis and approach on the
image-to-image translation task in the Multimodal Learning for Earth and
Environment Challenge (MultiEarth 2022). In terms of strategy optimization,
cloud classification is utilized to filter optical images with dense cloud
coverage to aid the supervised learning alike approach. The commonly used
pix2pix framework with a few optimizations is applied to build the model. A
weighted combination of mean squared error and mean absolute error is
incorporated in the loss function. As for evaluation, peak to signal ratio and
structural similarity were both considered in our preliminary analysis. Lastly,
our method achieved the second place with a final error score of 0.0412. The
results indicate great potential towards SAR-to-optical translation in remote
sensing tasks, specifically for the support of long-term environmental
monitoring and protection.
Related papers
- Corner-to-Center Long-range Context Model for Efficient Learned Image
Compression [70.0411436929495]
In the framework of learned image compression, the context model plays a pivotal role in capturing the dependencies among latent representations.
We propose the textbfCorner-to-Center transformer-based Context Model (C$3$M) designed to enhance context and latent predictions.
In addition, to enlarge the receptive field in the analysis and synthesis transformation, we use the Long-range Crossing Attention Module (LCAM) in the encoder/decoder.
arXiv Detail & Related papers (2023-11-29T21:40:28Z) - PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant
Semantic Segmentation [50.556961575275345]
We propose a perception-aware fusion framework to promote segmentation robustness in adversarial scenes.
We show that our scheme substantially enhances the robustness, with gains of 15.3% mIOU, compared with advanced competitors.
arXiv Detail & Related papers (2023-08-08T01:55:44Z) - Unsupervised Light Field Depth Estimation via Multi-view Feature
Matching with Occlusion Prediction [15.421219881815956]
It is costly to obtain sufficient depth labels for supervised training.
In this paper, we propose an unsupervised framework to estimate depth from LF images.
arXiv Detail & Related papers (2023-01-20T06:11:17Z) - Generative Adversarial Network (GAN) based Image-Deblurring [0.0]
We show the effective of spectral regularization methods, and point out the linking between the spectral filtering result and the solution of the regularization optimization objective.
For ill-posed problems like image deblurring, the optimization objective contains a regularization term that encodes our prior knowledge into the solution.
Based on the idea of Wasserstein generative adversarial models, we can train a CNN to learn the regularization functional.
arXiv Detail & Related papers (2022-08-24T15:46:09Z) - Hierarchical Similarity Learning for Aliasing Suppression Image
Super-Resolution [64.15915577164894]
A hierarchical image super-resolution network (HSRNet) is proposed to suppress the influence of aliasing.
HSRNet achieves better quantitative and visual performance than other works, and remits the aliasing more effectively.
arXiv Detail & Related papers (2022-06-07T14:55:32Z) - Provable Stochastic Optimization for Global Contrastive Learning: Small
Batch Does Not Harm Performance [53.49803579981569]
We consider a global objective for contrastive learning, which contrasts each positive pair with all negative pairs for an anchor point.
Existing methods such as SimCLR requires a large batch size in order to achieve a satisfactory result.
We propose a memory-efficient optimization algorithm for solving the Global Contrastive Learning of Representations, named SogCLR.
arXiv Detail & Related papers (2022-02-24T22:16:53Z) - Unpaired Image Super-Resolution with Optimal Transport Maps [128.1189695209663]
Real-world image super-resolution (SR) tasks often do not have paired datasets limiting the application of supervised techniques.
We propose an algorithm for unpaired SR which learns an unbiased OT map for the perceptual transport cost.
Our algorithm provides nearly state-of-the-art performance on the large-scale unpaired AIM-19 dataset.
arXiv Detail & Related papers (2022-02-02T16:21:20Z) - Perceptual Loss for Robust Unsupervised Homography Estimation [1.2891210250935146]
BiHomE minimizes the distance in the feature space between the warped image from the source viewpoint and the corresponding image from the target viewpoint.
We show that biHomE achieves state-of-the-art performance on synthetic COCO dataset, which is also comparable or better compared to supervised approaches.
arXiv Detail & Related papers (2021-04-20T14:41:54Z) - Greenhouse Segmentation on High-Resolution Optical Satellite Imagery
using Deep Learning Techniques [0.0]
This paper proposes a sound methodology for pixel-wise classification on images acquired by the Azersky (SPOT-7) optical satellite.
customized variations of U-Net-like architectures are employed to identify greenhouses.
Two models are proposed which uniquely incorporate dilated convolutions and skip connections.
arXiv Detail & Related papers (2020-07-22T06:12:57Z) - Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields.
To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss.
We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.