Related papers: Data-Driven Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations

Data-Driven Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations

URL: http://arxiv.org/abs/2512.11946v1
Date: Fri, 12 Dec 2025 15:28:17 GMT
Title: Data-Driven Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations
Authors: Pramudita Satria Palar, Paul Saves, Rommel G. Regis, Koji Shimoyama, Shigeru Obayashi, Nicolas Verstaevel, Joseph Morlier,
Abstract summary: We propose a global sensitivity metric based on Individual Conditional Expectation curves.<n>The method computes the expected feature importance across ICE curves, along with their standard deviation, to more effectively capture the influence of interactions.<n>In addition, we introduce an ICE-based correlation value to quantify how interactions modify between inputs and the output.
Score: 0.29316801942271303
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Explainable machine learning techniques have gained increasing attention in engineering applications, especially in aerospace design and analysis, where understanding how input variables influence data-driven models is essential. Partial Dependence Plots (PDPs) are widely used for interpreting black-box models by showing the average effect of an input variable on the prediction. However, their global sensitivity metric can be misleading when strong interactions are present, as averaging tends to obscure interaction effects. To address this limitation, we propose a global sensitivity metric based on Individual Conditional Expectation (ICE) curves. The method computes the expected feature importance across ICE curves, along with their standard deviation, to more effectively capture the influence of interactions. We provide a mathematical proof demonstrating that the PDP-based sensitivity is a lower bound of the proposed ICE-based metric under truncated orthogonal polynomial expansion. In addition, we introduce an ICE-based correlation value to quantify how interactions modify the relationship between inputs and the output. Comparative evaluations were performed on three cases: a 5-variable analytical function, a 5-variable wind-turbine fatigue problem, and a 9-variable airfoil aerodynamics case, where ICE-based sensitivity was benchmarked against PDP, SHapley Additive exPlanations (SHAP), and Sobol' indices. The results show that ICE-based feature importance provides richer insights than the traditional PDP-based approach, while visual interpretations from PDP, ICE, and SHAP complement one another by offering multiple perspectives.

Related papers

Explainability of Complex AI Models with Correlation Impact Ratio [10.61008729196936]
Complex AI systems make better predictions but often lack transparency, limiting trustworthiness, interpretability, and safe deployment.<n>We introduce ExCIR (Explainability through Correlation Impact Ratio), a theoretically grounded, simple, and reliable metric for explaining the contribution of input features to model outputs.<n>We demonstrate that ExCIR captures dependencies arising from correlated features through a lightweight single pass formulation.
arXiv Detail & Related papers (2026-01-10T21:56:24Z)
Smart Sensor Placement: A Correlation-Aware Attribution Framework (CAAF) for Real-world Data Modeling [11.354527723215568]
Optimal sensor placement (OSP) is critical for efficient, accurate monitoring, control, and inference in real-world systems.<n>We propose a machine-learning-based feature attribution framework to identify OSP for the prediction of quantities of interest.
arXiv Detail & Related papers (2025-10-26T03:50:16Z)
Disentangled Feature Importance [0.0]
We introduce emphDisentangled Feature Importance (DFI), a nonparametric generalization of the classical $R2$ decomposition via optimal transport.<n>DFI correlated features into independent latent variables using a transport map, eliminating correlation distortion.<n>DFI provides a principled decomposition of importance scores that sum to the total predictive variability for latent additive models.
arXiv Detail & Related papers (2025-06-30T20:54:48Z)
Graph-based Complexity for Causal Effect by Empirical Plug-in [56.14597641617531]
This paper focuses on the computational complexity of computing empirical plug-in estimates for causal effect queries. We show that computation can be done efficiently, potentially in time linear in the data size, depending on the estimand's hypergraph.
arXiv Detail & Related papers (2024-11-15T07:42:01Z)
Variable Importance in High-Dimensional Settings Requires Grouping [19.095605415846187]
Conditional Permutation Importance (CPI) bypasses PI's limitations in such cases. Grouping variables statistically via clustering or some prior knowledge gains some power back. We show that the approach extended with stacking controls the type-I error even with highly-correlated groups.
arXiv Detail & Related papers (2023-12-18T00:21:47Z)
Forecasting Auxiliary Energy Consumption for Electric Heavy-Duty Vehicles [6.375656754994484]
Energy consumption prediction is crucial for optimizing the operation of electric commercial heavy-duty vehicles. In this paper, we demonstrate a potential solution by training multiple regression models on subsets of data. Experiments on both synthetic and real-world datasets show that such splitting of a complex problem into simpler ones yields better regression performance and interpretability.
arXiv Detail & Related papers (2023-11-27T16:52:25Z)
Data-Driven Influence Functions for Optimization-Based Causal Inference [105.5385525290466]
We study a constructive algorithm that approximates Gateaux derivatives for statistical functionals by finite differencing. We study the case where probability distributions are not known a priori but need to be estimated from data.
arXiv Detail & Related papers (2022-08-29T16:16:22Z)
Bringing a Ruler Into the Black Box: Uncovering Feature Impact from Individual Conditional Expectation Plots [0.0]
We introduce a model-agnostic, performance-agnostic feature impact metric drawn out from ICE plots. We also introduce an in-distribution variant of ICE feature impact to vary the influence of out-of-distribution points. We demonstrate ICE feature impact's utility in several tasks using real-world data.
arXiv Detail & Related papers (2021-09-06T20:26:29Z)
Identifiable Energy-based Representations: An Application to Estimating Heterogeneous Causal Effects [83.66276516095665]
Conditional average treatment effects (CATEs) allow us to understand the effect heterogeneity across a large population of individuals. Typical CATE learners assume all confounding variables are measured in order for the CATE to be identifiable. We propose an energy-based model (EBM) that learns a low-dimensional representation of the variables by employing a noise contrastive loss function.
arXiv Detail & Related papers (2021-08-06T10:39:49Z)
SGCN:Sparse Graph Convolution Network for Pedestrian Trajectory Prediction [64.16212996247943]
We present a Sparse Graph Convolution Network(SGCN) for pedestrian trajectory prediction. Specifically, the SGCN explicitly models the sparse directed interaction with a sparse directed spatial graph to capture adaptive interaction pedestrians. visualizations indicate that our method can capture adaptive interactions between pedestrians and their effective motion tendencies.
arXiv Detail & Related papers (2021-04-04T03:17:42Z)
Latent Causal Invariant Model [128.7508609492542]
Current supervised learning can learn spurious correlation during the data-fitting process. We propose a Latent Causal Invariance Model (LaCIM) which pursues causal prediction.
arXiv Detail & Related papers (2020-11-04T10:00:27Z)
Repulsive Mixture Models of Exponential Family PCA for Clustering [127.90219303669006]
The mixture extension of exponential family principal component analysis ( EPCA) was designed to encode much more structural information about data distribution than the traditional EPCA. The traditional mixture of local EPCAs has the problem of model redundancy, i.e., overlaps among mixing components, which may cause ambiguity for data clustering. In this paper, a repulsiveness-encouraging prior is introduced among mixing components and a diversified EPCA mixture (DEPCAM) model is developed in the Bayesian framework.
arXiv Detail & Related papers (2020-04-07T04:07:29Z)

This list is automatically generated from the titles and abstracts of the papers in this site.