Mutual Information for Explainable Deep Learning of Multiscale Systems
- URL: http://arxiv.org/abs/2009.04570v2
- Date: Wed, 19 May 2021 10:04:18 GMT
- Title: Mutual Information for Explainable Deep Learning of Multiscale Systems
- Authors: S{\o}ren Taverniers and Eric J. Hall and Markos A. Katsoulakis and
Daniel M. Tartakovsky
- Abstract summary: We develop a model-agnostic, moment-independent global sensitivity analysis (GSA)
GSA relies on differential mutual information to rank the effects of CVs on QoIs.
We demonstrate that the surrogate-driven mutual information GSA provides useful and distinguishable rankings on two applications of interest in energy storage.
- Score: 1.1470070927586016
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Timely completion of design cycles for complex systems ranging from consumer
electronics to hypersonic vehicles relies on rapid simulation-based
prototyping. The latter typically involves high-dimensional spaces of possibly
correlated control variables (CVs) and quantities of interest (QoIs) with
non-Gaussian and possibly multimodal distributions. We develop a
model-agnostic, moment-independent global sensitivity analysis (GSA) that
relies on differential mutual information to rank the effects of CVs on QoIs.
The data requirements of this information-theoretic approach to GSA are met by
replacing computationally intensive components of the physics-based model with
a deep neural network surrogate. Subsequently, the GSA is used to explain the
network predictions, and the surrogate is deployed to close design loops.
Viewed as an uncertainty quantification method for interrogating the surrogate,
this framework is compatible with a wide variety of black-box models. We
demonstrate that the surrogate-driven mutual information GSA provides useful
and distinguishable rankings on two applications of interest in energy storage.
Consequently, our information-theoretic GSA provides an "outer loop" for
accelerated product design by identifying the most and least sensitive input
directions and performing subsequent optimization over appropriately reduced
parameter subspaces.
Related papers
- Synesthesia of Machines (SoM)-Enhanced ISAC Precoding for Vehicular Networks with Double Dynamics [15.847713094328286]
Integrated sensing and communication (ISAC) technology plays a crucial role in vehicular networks.
Double dynamics present significant challenges for real-time ISAC precoding design.
We propose a synesthesia of machine (SoM)-enhanced precoding paradigm.
arXiv Detail & Related papers (2024-08-24T10:35:10Z) - ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and
Multispectral Data Fusion [54.668445421149364]
Deep learning-based hyperspectral image (HSI) super-resolution aims to generate high spatial resolution HSI (HR-HSI) by fusing hyperspectral image (HSI) and multispectral image (MSI) with deep neural networks (DNNs)
In this letter, we propose a novel adversarial automatic data augmentation framework ADASR that automatically optimize and augments HSI-MSI sample pairs to enrich data diversity for HSI-MSI fusion.
arXiv Detail & Related papers (2023-10-11T07:30:37Z) - Multi-Point Integrated Sensing and Communication: Fusion Model and
Functionality Selection [99.67715229413986]
This paper presents a multi-point ISAC (MPISAC) system that fuses the outputs from multiple ISAC devices for achieving higher sensing performance.
We adopt a fusion model that predicts the fusion accuracy via hypothesis testing and optimal voting analysis.
arXiv Detail & Related papers (2022-08-16T08:09:54Z) - Task-Oriented Sensing, Computation, and Communication Integration for
Multi-Device Edge AI [108.08079323459822]
This paper studies a new multi-intelligent edge artificial-latency (AI) system, which jointly exploits the AI model split inference and integrated sensing and communication (ISAC)
We measure the inference accuracy by adopting an approximate but tractable metric, namely discriminant gain.
arXiv Detail & Related papers (2022-07-03T06:57:07Z) - A Graph Deep Learning Framework for High-Level Synthesis Design Space
Exploration [11.154086943903696]
High-Level Synthesis is a solution for fast prototyping application-specific hardware.
We propose HLS, for the first time in the literature, graph neural networks that jointly predict acceleration performance and hardware costs.
We show that our approach achieves prediction accuracy comparable with that of commonly used simulators.
arXiv Detail & Related papers (2021-11-29T18:17:45Z) - Autoregressive Hidden Markov Models with partial knowledge on latent
space applied to aero-engines prognostics [2.179313476241343]
This paper describes an Autoregressive Partially-hidden Markov model (ARPHMM) for fault detection and prognostics of equipments based on sensors' data.
We show how to apply this model to estimate the remaining useful life based on health indicators.
arXiv Detail & Related papers (2021-05-01T10:23:22Z) - Cross-Attention in Coupled Unmixing Nets for Unsupervised Hyperspectral
Super-Resolution [79.97180849505294]
We propose a novel coupled unmixing network with a cross-attention mechanism, CUCaNet, to enhance the spatial resolution of HSI.
Experiments are conducted on three widely-used HS-MS datasets in comparison with state-of-the-art HSI-SR models.
arXiv Detail & Related papers (2020-07-10T08:08:20Z) - SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier
Detection [63.253850875265115]
Outlier detection (OD) is a key machine learning (ML) task for identifying abnormal objects from general samples.
We propose a modular acceleration system, called SUOD, to address it.
arXiv Detail & Related papers (2020-03-11T00:22:50Z) - Multilinear Compressive Learning with Prior Knowledge [106.12874293597754]
Multilinear Compressive Learning (MCL) framework combines Multilinear Compressive Sensing and Machine Learning into an end-to-end system.
Key idea behind MCL is the assumption of the existence of a tensor subspace which can capture the essential features from the signal for the downstream learning task.
In this paper, we propose a novel solution to address both of the aforementioned requirements, i.e., How to find those tensor subspaces in which the signals of interest are highly separable?
arXiv Detail & Related papers (2020-02-17T19:06:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.