Deep Fusion of Local and Non-Local Features for Precision Landslide
Recognition
- URL: http://arxiv.org/abs/2002.08547v1
- Date: Thu, 20 Feb 2020 03:18:59 GMT
- Title: Deep Fusion of Local and Non-Local Features for Precision Landslide
Recognition
- Authors: Qing Zhu, Lin Chen, Han Hu, Binzhi Xu, Yeting Zhang, Haifeng Li
- Abstract summary: This paper proposes an effective approach to fuse both local and non-local features to surmount the contextual problem.
Built upon the U-Net architecture that is widely adopted in the remote sensing community, we utilize two additional modules.
Experimental evaluations revealed that the proposed method outperformed state-of-the-art general-purpose semantic segmentation approaches.
- Score: 17.896249114628336
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Precision mapping of landslide inventory is crucial for hazard mitigation.
Most landslides generally co-exist with other confusing geological features,
and the presence of such areas can only be inferred unambiguously at a large
scale. In addition, local information is also important for the preservation of
object boundaries. Aiming to solve this problem, this paper proposes an
effective approach to fuse both local and non-local features to surmount the
contextual problem. Built upon the U-Net architecture that is widely adopted in
the remote sensing community, we utilize two additional modules. The first one
uses dilated convolution and the corresponding atrous spatial pyramid pooling,
which enlarged the receptive field without sacrificing spatial resolution or
increasing memory usage. The second uses a scale attention mechanism to guide
the up-sampling of features from the coarse level by a learned weight map. In
implementation, the computational overhead against the original U-Net was only
a few convolutional layers. Experimental evaluations revealed that the proposed
method outperformed state-of-the-art general-purpose semantic segmentation
approaches. Furthermore, ablation studies have shown that the two models
afforded extensive enhancements in landslide-recognition performance.
Related papers
- Background Activation Suppression for Weakly Supervised Object
Localization and Semantic Segmentation [84.62067728093358]
Weakly supervised object localization and semantic segmentation aim to localize objects using only image-level labels.
New paradigm has emerged by generating a foreground prediction map to achieve pixel-level localization.
This paper presents two astonishing experimental observations on the object localization learning process.
arXiv Detail & Related papers (2023-09-22T15:44:10Z) - DETR Doesn't Need Multi-Scale or Locality Design [69.56292005230185]
This paper presents an improved DETR detector that maintains a "plain" nature.
It uses a single-scale feature map and global cross-attention calculations without specific locality constraints.
We show that two simple technologies are surprisingly effective within a plain design to compensate for the lack of multi-scale feature maps and locality constraints.
arXiv Detail & Related papers (2023-08-03T17:59:04Z) - Towards Generalizable Deepfake Detection by Primary Region
Regularization [52.41801719896089]
This paper enhances the generalization capability from a novel regularization perspective.
Our method consists of two stages, namely the static localization for primary region maps, and the dynamic exploitation of primary region masks.
We conduct extensive experiments over three widely used deepfake datasets - DFDC, DF-1.0, and Celeb-DF with five backbones.
arXiv Detail & Related papers (2023-07-24T05:43:34Z) - ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient
Self-Supervised Monocular Depth Estimation [6.923035780685481]
We propose an efficient local adaptive attention method for geometric aware representation enhancement.
We leverage geometric cues from semantic information to learn local adaptive bounding boxes to guide unsupervised feature aggregation.
Our proposed method establishes a new state-of-the-art in self-supervised monocular depth estimation task.
arXiv Detail & Related papers (2022-12-12T06:38:35Z) - Learning Second Order Local Anomaly for General Face Forgery Detection [7.6896977871741985]
We propose a novel method to improve the generalization ability of CNN-based face forgery detectors.
Specifically, we propose a weakly supervised Second Order Local Anomaly (SOLA) learning module to mine anomalies in local regions.
We also propose a Local Enhancement Module (LEM) to improve the discrimination between local features of real and forged regions.
arXiv Detail & Related papers (2022-09-30T14:27:11Z) - Deep Multi-Scale U-Net Architecture and Noise-Robust Training Strategies
for Histopathological Image Segmentation [6.236433671063744]
We propose to explicitly add multi-scale feature maps in each convolutional module of the U-Net encoder to improve segmentation of histology images.
In experiments on a private dataset of breast cancer lymph nodes, we observed substantial improvement over a U-Net baseline based on the two proposed augmentations.
arXiv Detail & Related papers (2022-05-03T21:00:44Z) - Detection of Deepfake Videos Using Long Distance Attention [73.6659488380372]
Most existing detection methods treat the problem as a vanilla binary classification problem.
In this paper, the problem is treated as a special fine-grained classification problem since the differences between fake and real faces are very subtle.
A spatial-temporal model is proposed which has two components for capturing spatial and temporal forgery traces in global perspective.
arXiv Detail & Related papers (2021-06-24T08:33:32Z) - Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation [62.29076080124199]
This paper proposes a novel coarse-to-fine feature adaptation approach to cross-domain object detection.
At the coarse-grained stage, foreground regions are extracted by adopting the attention mechanism, and aligned according to their marginal distributions.
At the fine-grained stage, we conduct conditional distribution alignment of foregrounds by minimizing the distance of global prototypes with the same category but from different domains.
arXiv Detail & Related papers (2020-03-23T13:40:06Z) - ASLFeat: Learning Local Features of Accurate Shape and Localization [42.70030492742363]
We present ASLFeat, with three light-weight yet effective modifications to mitigate above issues.
First, we resort to deformable convolutional networks to densely estimate and apply local transformation.
Second, we take advantage of the inherent feature hierarchy to restore spatial resolution and low-level details for accurate keypoint localization.
arXiv Detail & Related papers (2020-03-23T04:03:03Z) - Cross-layer Feature Pyramid Network for Salient Object Detection [102.20031050972429]
We propose a novel Cross-layer Feature Pyramid Network to improve the progressive fusion in salient object detection.
The distributed features per layer own both semantics and salient details from all other layers simultaneously, and suffer reduced loss of important information.
arXiv Detail & Related papers (2020-02-25T14:06:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.