A Novel Hybrid Feature Importance and Feature Interaction Detection
Framework for Predictive Optimization in Industry 4.0 Applications
- URL: http://arxiv.org/abs/2403.02368v1
- Date: Mon, 4 Mar 2024 13:22:53 GMT
- Title: A Novel Hybrid Feature Importance and Feature Interaction Detection
Framework for Predictive Optimization in Industry 4.0 Applications
- Authors: Zhipeng Ma, Bo N{\o}rregaard J{\o}rgensen, Zheng Grace Ma
- Abstract summary: This paper proposes a novel hybrid framework that combines the feature importance detector - local interpretable model-agnostic explanations (LIME) and the feature interaction detector - neural interaction detection (NID)
The experimental outcomes reveal an augmentation of up to 9.56% in the R2 score, and a diminution of up to 24.05% in the root mean square error.
- Score: 1.0870564199697297
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Advanced machine learning algorithms are increasingly utilized to provide
data-based prediction and decision-making support in Industry 4.0. However, the
prediction accuracy achieved by the existing models is insufficient to warrant
practical implementation in real-world applications. This is because not all
features present in real-world datasets possess a direct relevance to the
predictive analysis being conducted. Consequently, the careful incorporation of
select features has the potential to yield a substantial positive impact on the
outcome. To address the research gap, this paper proposes a novel hybrid
framework that combines the feature importance detector - local interpretable
model-agnostic explanations (LIME) and the feature interaction detector -
neural interaction detection (NID), to improve prediction accuracy. By applying
the proposed framework, unnecessary features can be eliminated, and
interactions are encoded to generate a more conducive dataset for predictive
purposes. Subsequently, the proposed model is deployed to refine the prediction
of electricity consumption in foundry processing. The experimental outcomes
reveal an augmentation of up to 9.56% in the R2 score, and a diminution of up
to 24.05% in the root mean square error.
Related papers
- Enhancing Variable Importance in Random Forests: A Novel Application of Global Sensitivity Analysis [0.9954382983583578]
The present work provides an application of Global Sensitivity Analysis to supervised machine learning methods such as Random Forests.
Global Sensitivity Analysis is primarily used in mathematical modelling to investigate the effect of the uncertainties of the input variables on the output.
A simulation study shows that our proposal can be used to explore what advances can be achieved either in terms of efficiency, explanatory ability, or simply by way of confirming existing results.
arXiv Detail & Related papers (2024-07-19T10:45:36Z) - JRDB-Traj: A Dataset and Benchmark for Trajectory Forecasting in Crowds [79.00975648564483]
Trajectory forecasting models, employed in fields such as robotics, autonomous vehicles, and navigation, face challenges in real-world scenarios.
This dataset provides comprehensive data, including the locations of all agents, scene images, and point clouds, all from the robot's perspective.
The objective is to predict the future positions of agents relative to the robot using raw sensory input data.
arXiv Detail & Related papers (2023-11-05T18:59:31Z) - Toward Robust Uncertainty Estimation with Random Activation Functions [3.0586855806896045]
We propose a novel approach for uncertainty quantification via ensembles, called Random Activation Functions (RAFs) Ensemble.
RAFs Ensemble outperforms state-of-the-art ensemble uncertainty quantification methods on both synthetic and real-world datasets.
arXiv Detail & Related papers (2023-02-28T13:17:56Z) - Prediction-Powered Inference [68.97619568620709]
Prediction-powered inference is a framework for performing valid statistical inference when an experimental dataset is supplemented with predictions from a machine-learning system.
The framework yields simple algorithms for computing provably valid confidence intervals for quantities such as means, quantiles, and linear and logistic regression coefficients.
Prediction-powered inference could enable researchers to draw valid and more data-efficient conclusions using machine learning.
arXiv Detail & Related papers (2023-01-23T18:59:28Z) - Dual Accuracy-Quality-Driven Neural Network for Prediction Interval Generation [0.0]
We present a method to learn prediction intervals for regression-based neural networks automatically.
Our main contribution is the design of a novel loss function for the PI-generation network.
Experiments using a synthetic dataset, eight benchmark datasets, and a real-world crop yield prediction dataset showed that our method was able to maintain a nominal probability coverage.
arXiv Detail & Related papers (2022-12-13T05:03:16Z) - HyperImpute: Generalized Iterative Imputation with Automatic Model
Selection [77.86861638371926]
We propose a generalized iterative imputation framework for adaptively and automatically configuring column-wise models.
We provide a concrete implementation with out-of-the-box learners, simulators, and interfaces.
arXiv Detail & Related papers (2022-06-15T19:10:35Z) - Preference Enhanced Social Influence Modeling for Network-Aware Cascade
Prediction [59.221668173521884]
We propose a novel framework to promote cascade size prediction by enhancing the user preference modeling.
Our end-to-end method makes the user activating process of information diffusion more adaptive and accurate.
arXiv Detail & Related papers (2022-04-18T09:25:06Z) - Hybrid Predictive Coding: Inferring, Fast and Slow [62.997667081978825]
We propose a hybrid predictive coding network that combines both iterative and amortized inference in a principled manner.
We demonstrate that our model is inherently sensitive to its uncertainty and adaptively balances balances to obtain accurate beliefs using minimum computational expense.
arXiv Detail & Related papers (2022-04-05T12:52:45Z) - Masked Transformer for Neighhourhood-aware Click-Through Rate Prediction [74.52904110197004]
We propose Neighbor-Interaction based CTR prediction, which put this task into a Heterogeneous Information Network (HIN) setting.
In order to enhance the representation of the local neighbourhood, we consider four types of topological interaction among the nodes.
We conduct comprehensive experiments on two real world datasets and the experimental results show that our proposed method outperforms state-of-the-art CTR models significantly.
arXiv Detail & Related papers (2022-01-25T12:44:23Z) - Approximate Bayesian Optimisation for Neural Networks [6.921210544516486]
A body of work has been done to automate machine learning algorithm to highlight the importance of model choice.
The necessity to solve the analytical tractability and the computational feasibility in a idealistic fashion enables to ensure the efficiency and the applicability.
arXiv Detail & Related papers (2021-08-27T19:03:32Z) - Detecting Beneficial Feature Interactions for Recommender Systems [15.599904548629537]
Feature interactions are essential for achieving high accuracy in recommender systems.
We propose a graph neural network approach to effectively model them, together with a novel technique to automatically detect those feature interactions.
Our proposed model is proved to be effective through the information bottleneck principle and statistical interaction theory.
arXiv Detail & Related papers (2020-08-02T06:08:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.