Improving Prediction Performance and Model Interpretability through
Attention Mechanisms from Basic and Applied Research Perspectives
- URL: http://arxiv.org/abs/2303.14116v1
- Date: Fri, 24 Mar 2023 16:24:08 GMT
- Title: Improving Prediction Performance and Model Interpretability through
Attention Mechanisms from Basic and Applied Research Perspectives
- Authors: Shunsuke Kitada
- Abstract summary: This bulletin is based on the summary of the author's dissertation.
Deep learning models have much higher prediction performance than traditional machine learning models.
The specific prediction process is still difficult to interpret and/or explain.
- Score: 3.553493344868414
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the dramatic advances in deep learning technology, machine learning
research is focusing on improving the interpretability of model predictions as
well as prediction performance in both basic and applied research. While deep
learning models have much higher prediction performance than traditional
machine learning models, the specific prediction process is still difficult to
interpret and/or explain. This is known as the black-boxing of machine learning
models and is recognized as a particularly important problem in a wide range of
research fields, including manufacturing, commerce, robotics, and other
industries where the use of such technology has become commonplace, as well as
the medical field, where mistakes are not tolerated. This bulletin is based on
the summary of the author's dissertation. The research summarized in the
dissertation focuses on the attention mechanism, which has been the focus of
much attention in recent years, and discusses its potential for both basic
research in terms of improving prediction performance and interpretability, and
applied research in terms of evaluating it for real-world applications using
large data sets beyond the laboratory environment. The dissertation also
concludes with a summary of the implications of these findings for subsequent
research and future prospects in the field.
Related papers
- Human Action Anticipation: A Survey [86.415721659234]
The literature on behavior prediction spans various tasks, including action anticipation, activity forecasting, intent prediction, goal prediction, and so on.
Our survey aims to tie together this fragmented literature, covering recent technical innovations as well as the development of new large-scale datasets for model training and evaluation.
arXiv Detail & Related papers (2024-10-17T21:37:40Z) - "Forgetting" in Machine Learning and Beyond: A Survey [0.0]
This survey focuses on the benefits of forgetting and its applications across various machine learning sub-fields.
The paper discusses current challenges, future directions, and ethical considerations regarding the integration of forgetting mechanisms into machine learning models.
arXiv Detail & Related papers (2024-05-31T05:10:30Z) - Enhancing Generative Class Incremental Learning Performance with Model Forgetting Approach [50.36650300087987]
This study presents a novel approach to Generative Class Incremental Learning (GCIL) by introducing the forgetting mechanism.
We have found that integrating the forgetting mechanisms significantly enhances the models' performance in acquiring new knowledge.
arXiv Detail & Related papers (2024-03-27T05:10:38Z) - Deep learning models for price forecasting of financial time series: A
review of recent advancements: 2020-2022 [6.05458608266581]
Deep learning models are replacing traditional statistical and machine learning models for price forecasting tasks.
This review delves deeply into deep learning-based forecasting models, presenting information on model architectures, practical applications, and their respective advantages and disadvantages.
The present contribution also includes potential directions for future research, such as examining the effectiveness of deep learning models with complex structures for price forecasting.
arXiv Detail & Related papers (2023-04-21T03:46:09Z) - Statistical Foundation Behind Machine Learning and Its Impact on
Computer Vision [8.974457198386414]
This paper revisits the principle of uniform convergence in statistical learning, discusses how it acts as the foundation behind machine learning, and attempts to gain a better understanding of the essential problem that current deep learning algorithms are solving.
Using computer vision as an example domain in machine learning, the discussion shows that recent research trends in leveraging increasingly large-scale data to perform pre-training for representation learning are largely to reduce the discrepancy between a practically tractable empirical loss and its ultimately desired but intractable expected loss.
arXiv Detail & Related papers (2022-09-06T17:59:04Z) - Individual Explanations in Machine Learning Models: A Survey for
Practitioners [69.02688684221265]
The use of sophisticated statistical models that influence decisions in domains of high societal relevance is on the rise.
Many governments, institutions, and companies are reluctant to their adoption as their output is often difficult to explain in human-interpretable ways.
Recently, the academic literature has proposed a substantial amount of methods for providing interpretable explanations to machine learning models.
arXiv Detail & Related papers (2021-04-09T01:46:34Z) - Forethought and Hindsight in Credit Assignment [62.05690959741223]
We work to understand the gains and peculiarities of planning employed as forethought via forward models or as hindsight operating with backward models.
We investigate the best use of models in planning, primarily focusing on the selection of states in which predictions should be (re)-evaluated.
arXiv Detail & Related papers (2020-10-26T16:00:47Z) - Using satellite imagery to understand and promote sustainable
development [87.72561825617062]
We synthesize the growing literature that uses satellite imagery to understand sustainable development outcomes.
We quantify the paucity of ground data on key human-related outcomes and the growing abundance and resolution of satellite imagery.
We review recent machine learning approaches to model-building in the context of scarce and noisy training data.
arXiv Detail & Related papers (2020-09-23T05:20:00Z) - Causality Learning: A New Perspective for Interpretable Machine Learning [15.556963808865918]
interpretable machine learning is currently a mainstream topic in the research community.
This paper provides an overview of causal analysis with the fundamental background and key concepts, and then summarizes most recent causal approaches for interpretable machine learning.
arXiv Detail & Related papers (2020-06-27T13:01:28Z) - Predictive modeling approaches in laser-based material processing [59.04160452043105]
This study aims to automate and forecast the effect of laser processing on material structures.
The focus is centred on the performance of representative statistical and machine learning algorithms.
Results can set the basis for a systematic methodology towards reducing material design, testing and production cost.
arXiv Detail & Related papers (2020-06-13T17:28:52Z) - Introduction to Rare-Event Predictive Modeling for Inferential
Statisticians -- A Hands-On Application in the Prediction of Breakthrough
Patents [0.0]
We introduce a machine learning (ML) approach to quantitative analysis geared towards optimizing the predictive performance.
We discuss the potential synergies between the two fields against the backdrop of this, at first glance, target-incompatibility.
We are providing a hands-on predictive modeling introduction for a quantitative social science audience while aiming at demystifying computer science jargon.
arXiv Detail & Related papers (2020-03-30T13:06:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.