Towards Adversarially Robust and Domain Generalizable Stereo Matching by
Rethinking DNN Feature Backbones
- URL: http://arxiv.org/abs/2108.00335v1
- Date: Sat, 31 Jul 2021 22:44:18 GMT
- Title: Towards Adversarially Robust and Domain Generalizable Stereo Matching by
Rethinking DNN Feature Backbones
- Authors: Kelvin Cheng, Christopher Healey, Tianfu Wu
- Abstract summary: This paper shows that a type of weak white-box attacks can fail state-of-the-art methods.
The proposed method is tested in the SceneFlow dataset and the KITTI2015 benchmark.
It significantly improves the adversarial robustness, while retaining accuracy performance comparable to state-of-the-art methods.
- Score: 14.569829985753346
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Stereo matching has recently witnessed remarkable progress using Deep Neural
Networks (DNNs). But, how robust are they? Although it has been well-known that
DNNs often suffer from adversarial vulnerability with a catastrophic drop in
performance, the situation is even worse in stereo matching. This paper first
shows that a type of weak white-box attacks can fail state-of-the-art methods.
The attack is learned by a proposed stereo-constrained projected gradient
descent (PGD) method in stereo matching. This observation raises serious
concerns for the deployment of DNN-based stereo matching. Parallel to the
adversarial vulnerability, DNN-based stereo matching is typically trained under
the so-called simulation to reality pipeline, and thus domain generalizability
is an important problem. This paper proposes to rethink the learnable DNN-based
feature backbone towards adversarially-robust and domain generalizable stereo
matching, either by completely removing it or by applying it only to the left
reference image. It computes the matching cost volume using the classic
multi-scale census transform (i.e., local binary pattern) of the raw input
stereo images, followed by a stacked Hourglass head sub-network solving the
matching problem. In experiments, the proposed method is tested in the
SceneFlow dataset and the KITTI2015 benchmark. It significantly improves the
adversarial robustness, while retaining accuracy performance comparable to
state-of-the-art methods. It also shows better generalizability from simulation
(SceneFlow) to real (KITTI) datasets when no fine-tuning is used.
Related papers
- VQUNet: Vector Quantization U-Net for Defending Adversarial Atacks by Regularizing Unwanted Noise [0.5755004576310334]
We introduce a novel noise-reduction procedure, Vector Quantization U-Net (VQUNet), to reduce adversarial noise and reconstruct data with high fidelity.
VQUNet features a discrete latent representation learning through a multi-scale hierarchical structure for both noise reduction and data reconstruction.
It outperforms other state-of-the-art noise-reduction-based defense methods under various adversarial attacks for both Fashion-MNIST and CIFAR10 datasets.
arXiv Detail & Related papers (2024-06-05T10:10:03Z) - A Geometrical Approach to Evaluate the Adversarial Robustness of Deep
Neural Networks [52.09243852066406]
Adversarial Converging Time Score (ACTS) measures the converging time as an adversarial robustness metric.
We validate the effectiveness and generalization of the proposed ACTS metric against different adversarial attacks on the large-scale ImageNet dataset.
arXiv Detail & Related papers (2023-10-10T09:39:38Z) - Evaluating Similitude and Robustness of Deep Image Denoising Models via
Adversarial Attack [60.40356882897116]
Deep neural networks (DNNs) have shown superior performance compared to traditional image denoising algorithms.
In this paper, we propose an adversarial attack method named denoising-PGD which can successfully attack all the current deep denoising models.
arXiv Detail & Related papers (2023-06-28T09:30:59Z) - Decoupled Mixup for Generalized Visual Recognition [71.13734761715472]
We propose a novel "Decoupled-Mixup" method to train CNN models for visual recognition.
Our method decouples each image into discriminative and noise-prone regions, and then heterogeneously combines these regions to train CNN models.
Experiment results show the high generalization performance of our method on testing data that are composed of unseen contexts.
arXiv Detail & Related papers (2022-10-26T15:21:39Z) - Shuffled Patch-Wise Supervision for Presentation Attack Detection [12.031796234206135]
Face anti-spoofing is essential to prevent false facial verification by using a photo, video, mask, or a different substitute for an authorized person's face.
Most presentation attack detection systems suffer from overfitting, where they achieve near-perfect scores on a single dataset but fail on a different dataset with more realistic data.
We propose a new PAD approach, which combines pixel-wise binary supervision with patch-based CNN.
arXiv Detail & Related papers (2021-09-08T08:14:13Z) - Generalizing RNN-Transducer to Out-Domain Audio via Sparse
Self-Attention Layers [7.025709586759655]
Recurrent neural network transducers (RNN-T) are a promising end-to-end speech recognition framework.
The Conformer can effectively model the local-global context information via its convolution and self-attention layers.
The domain mismatch problem for Conformer RNN-T has not been intensively investigated yet.
arXiv Detail & Related papers (2021-08-22T08:06:15Z) - Discriminator-Free Generative Adversarial Attack [87.71852388383242]
Agenerative-based adversarial attacks can get rid of this limitation.
ASymmetric Saliency-based Auto-Encoder (SSAE) generates the perturbations.
The adversarial examples generated by SSAE not only make thewidely-used models collapse, but also achieves good visual quality.
arXiv Detail & Related papers (2021-07-20T01:55:21Z) - Detect and Defense Against Adversarial Examples in Deep Learning using
Natural Scene Statistics and Adaptive Denoising [12.378017309516965]
We propose a framework for defending DNN against ad-versarial samples.
The detector aims to detect AEs bycharacterizing them through the use of natural scenestatistic.
The proposed method outperforms the state-of-the-art defense techniques.
arXiv Detail & Related papers (2021-07-12T23:45:44Z) - Learning Stereo Matchability in Disparity Regression Networks [40.08209864470944]
This paper proposes a stereo matching network that considers pixel-wise matchability.
The proposed deep stereo matchability (DSM) framework can improve the matching result or accelerate the computation while still guaranteeing the quality.
arXiv Detail & Related papers (2020-08-11T15:55:49Z) - AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching [50.06646151004375]
A novel domain-adaptive pipeline called AdaStereo aims to align multi-level representations for deep stereo matching networks.
Our AdaStereo models achieve state-of-the-art cross-domain performance on multiple stereo benchmarks, including KITTI, Middlebury, ETH3D, and DrivingStereo.
arXiv Detail & Related papers (2020-04-09T16:15:13Z) - Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance
Disparity Estimation [51.17232267143098]
We propose a novel system named Disp R-CNN for 3D object detection from stereo images.
We use a statistical shape model to generate dense disparity pseudo-ground-truth without the need of LiDAR point clouds.
Experiments on the KITTI dataset show that, even when LiDAR ground-truth is not available at training time, Disp R-CNN achieves competitive performance and outperforms previous state-of-the-art methods by 20% in terms of average precision.
arXiv Detail & Related papers (2020-04-07T17:48:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.