MFPP: Morphological Fragmental Perturbation Pyramid for Black-Box Model
Explanations
- URL: http://arxiv.org/abs/2006.02659v3
- Date: Wed, 14 Oct 2020 06:59:37 GMT
- Title: MFPP: Morphological Fragmental Perturbation Pyramid for Black-Box Model
Explanations
- Authors: Qing Yang, Xia Zhu, Jong-Kae Fwu, Yun Ye, Ganmei You and Yuan Zhu
- Abstract summary: We propose a Morphological Fragmental Perturbation Pyramid (P) method to solve the Explainable AI problem.
In the DNNP method, we divide the input image into multi-scale fragments and randomly mask out fragments as perturbation to generate a saliency map.
Compared with the existing input sampling perturbation method, the pyramid structure fragment has proved to be more effective.
- Score: 7.051974163915314
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Deep neural networks (DNNs) have recently been applied and used in many
advanced and diverse tasks, such as medical diagnosis, automatic driving, etc.
Due to the lack of transparency of the deep models, DNNs are often criticized
for their prediction that cannot be explainable by human. In this paper, we
propose a novel Morphological Fragmental Perturbation Pyramid (MFPP) method to
solve the Explainable AI problem. In particular, we focus on the black-box
scheme, which can identify the input area that is responsible for the output of
the DNN without having to understand the internal architecture of the DNN. In
the MFPP method, we divide the input image into multi-scale fragments and
randomly mask out fragments as perturbation to generate a saliency map, which
indicates the significance of each pixel for the prediction result of the black
box model. Compared with the existing input sampling perturbation method, the
pyramid structure fragment has proved to be more effective. It can better
explore the morphological information of the input image to match its semantic
information, and does not need any value inside the DNN. We qualitatively and
quantitatively prove that MFPP meets and exceeds the performance of
state-of-the-art (SOTA) black-box interpretation method on multiple DNN models
and datasets.
Related papers
- MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling [64.09238330331195]
We propose a novel Multi-Modal Auto-Regressive (MMAR) probabilistic modeling framework.
Unlike discretization line of method, MMAR takes in continuous-valued image tokens to avoid information loss.
We show that MMAR demonstrates much more superior performance than other joint multi-modal models.
arXiv Detail & Related papers (2024-10-14T17:57:18Z) - Pixel-Aligned Multi-View Generation with Depth Guided Decoder [86.1813201212539]
We propose a novel method for pixel-level image-to-multi-view generation.
Unlike prior work, we incorporate attention layers across multi-view images in the VAE decoder of a latent video diffusion model.
Our model enables better pixel alignment across multi-view images.
arXiv Detail & Related papers (2024-08-26T04:56:41Z) - CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation [35.021331140484804]
Class activation maps (CAMs) and recent variants provide ways to visually explain the Deep Neural Networks (DNNs) decision-making process.
We propose CAPE, a novel reformulation of CAM that provides a unified and probabilistically meaningful assessment of the contributions of image regions.
arXiv Detail & Related papers (2024-04-03T01:13:05Z) - Interpreting Black-box Machine Learning Models for High Dimensional
Datasets [40.09157165704895]
We train a black-box model on a high-dimensional dataset to learn the embeddings on which the classification is performed.
We then approximate the behavior of the black-box model by means of an interpretable surrogate model on the top-k feature space.
Our approach outperforms state-of-the-art methods like TabNet and XGboost when tested on different datasets.
arXiv Detail & Related papers (2022-08-29T07:36:17Z) - Pyramid Grafting Network for One-Stage High Resolution Saliency
Detection [29.013012579688347]
We propose a one-stage framework called Pyramid Grafting Network (PGNet) to extract features from different resolution images independently.
An attention-based Cross-Model Grafting Module (CMGM) is proposed to enable CNN branch to combine broken detailed information more holistically.
We contribute a new Ultra-High-Resolution Saliency Detection dataset UHRSD, containing 5,920 images at 4K-8K resolutions.
arXiv Detail & Related papers (2022-04-11T12:22:21Z) - Inverting brain grey matter models with likelihood-free inference: a
tool for trustable cytoarchitecture measurements [62.997667081978825]
characterisation of the brain grey matter cytoarchitecture with quantitative sensitivity to soma density and volume remains an unsolved challenge in dMRI.
We propose a new forward model, specifically a new system of equations, requiring a few relatively sparse b-shells.
We then apply modern tools from Bayesian analysis known as likelihood-free inference (LFI) to invert our proposed model.
arXiv Detail & Related papers (2021-11-15T09:08:27Z) - Spatio-Temporal Inception Graph Convolutional Networks for
Skeleton-Based Action Recognition [126.51241919472356]
We design a simple and highly modularized graph convolutional network architecture for skeleton-based action recognition.
Our network is constructed by repeating a building block that aggregates multi-granularity information from both the spatial and temporal paths.
arXiv Detail & Related papers (2020-11-26T14:43:04Z) - Multiscale Detection of Cancerous Tissue in High Resolution Slide Scans [0.0]
We present an algorithm for multi-scale tumor (chimeric cell) detection in high resolution slide scans.
Our approach modifies the effective receptive field at different layers in a CNN so that objects with a broad range of varying scales can be detected in a single forward pass.
arXiv Detail & Related papers (2020-10-01T18:56:46Z) - Adaptive Context-Aware Multi-Modal Network for Depth Completion [107.15344488719322]
We propose to adopt the graph propagation to capture the observed spatial contexts.
We then apply the attention mechanism on the propagation, which encourages the network to model the contextual information adaptively.
Finally, we introduce the symmetric gated fusion strategy to exploit the extracted multi-modal features effectively.
Our model, named Adaptive Context-Aware Multi-Modal Network (ACMNet), achieves the state-of-the-art performance on two benchmarks.
arXiv Detail & Related papers (2020-08-25T06:00:06Z) - Improving the Interpretability of fMRI Decoding using Deep Neural
Networks and Adversarial Robustness [1.254120224317171]
A saliency map is a common approach for producing interpretable visualizations of the relative importance of input features for a prediction.
In this paper, we review a variety of methods for producing gradient-based saliency maps, and present a new adversarial training method we developed to make DNNs robust to input noise.
arXiv Detail & Related papers (2020-04-23T12:56:24Z) - Belief Propagation Reloaded: Learning BP-Layers for Labeling Problems [83.98774574197613]
We take one of the simplest inference methods, a truncated max-product Belief propagation, and add what is necessary to make it a proper component of a deep learning model.
This BP-Layer can be used as the final or an intermediate block in convolutional neural networks (CNNs)
The model is applicable to a range of dense prediction problems, is well-trainable and provides parameter-efficient and robust solutions in stereo, optical flow and semantic segmentation.
arXiv Detail & Related papers (2020-03-13T13:11:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.