How to Learn from Others: Transfer Machine Learning with Additive
Regression Models to Improve Sales Forecasting
- URL: http://arxiv.org/abs/2005.10698v1
- Date: Fri, 15 May 2020 15:44:37 GMT
- Title: How to Learn from Others: Transfer Machine Learning with Additive
Regression Models to Improve Sales Forecasting
- Authors: Robin Hirt, Niklas K\"uhl, Yusuf Peker, Gerhard Satzger
- Abstract summary: We propose a transfer machine learning approach based on additive regression models.
We evaluate the approach on a rich, multi-year dataset of multiple restaurant branches.
The results show the potential of the approach to exploit the collectively available analytical knowledge.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In a variety of business situations, the introduction or improvement of
machine learning approaches is impaired as these cannot draw on existing
analytical models. However, in many cases similar problems may have already
been solved elsewhere-but the accumulated analytical knowledge cannot be tapped
to solve a new problem, e.g., because of privacy barriers. For the particular
purpose of sales forecasting for similar entities, we propose a transfer
machine learning approach based on additive regression models that lets new
entities benefit from models of existing entities. We evaluate the approach on
a rich, multi-year dataset of multiple restaurant branches. We differentiate
the options to simply transfer models from one branch to another ("zero shot")
or to transfer and adapt them. We analyze feasibility and performance against
several forecasting benchmarks. The results show the potential of the approach
to exploit the collectively available analytical knowledge. Thus, we contribute
an approach that is generalizable beyond sales forecasting and the specific use
case in particular. In addition, we demonstrate its feasibility for a typical
use case as well as the potential for improving forecasting quality. These
results should inform academia, as they help to leverage knowledge across
various entities, and have immediate practical application in industry.
Related papers
- Predictive Churn with the Set of Good Models [64.05949860750235]
We study the effect of conflicting predictions over the set of near-optimal machine learning models.
We present theoretical results on the expected churn between models within the Rashomon set.
We show how our approach can be used to better anticipate, reduce, and avoid churn in consumer-facing applications.
arXiv Detail & Related papers (2024-02-12T16:15:25Z) - Pre-trained Recommender Systems: A Causal Debiasing Perspective [19.712997823535066]
We develop a generic recommender that captures universal interaction patterns by training on generic user-item interaction data extracted from different domains.
Our empirical studies show that the proposed model could significantly improve the recommendation performance in zero- and few-shot learning settings.
arXiv Detail & Related papers (2023-10-30T03:37:32Z) - Fantastic Gains and Where to Find Them: On the Existence and Prospect of
General Knowledge Transfer between Any Pretrained Model [74.62272538148245]
We show that for arbitrary pairings of pretrained models, one model extracts significant data context unavailable in the other.
We investigate if it is possible to transfer such "complementary" knowledge from one model to another without performance degradation.
arXiv Detail & Related papers (2023-10-26T17:59:46Z) - MinT: Boosting Generalization in Mathematical Reasoning via Multi-View
Fine-Tuning [53.90744622542961]
Reasoning in mathematical domains remains a significant challenge for small language models (LMs)
We introduce a new method that exploits existing mathematical problem datasets with diverse annotation styles.
Experimental results show that our strategy enables a LLaMA-7B model to outperform prior approaches.
arXiv Detail & Related papers (2023-07-16T05:41:53Z) - Parameterized Neural Networks for Finance [0.0]
We discuss and analyze a neural network architecture, that enables learning a model class for a set of different data samples.
We apply the approach to one of the standard problems asset managers and banks are facing: the calibration of spread curves.
arXiv Detail & Related papers (2023-04-18T10:18:28Z) - Entity Aware Modelling: A Survey [22.32009539611539]
Recent machine learning advances have led to new state-of-the-art response prediction models.
Models built at a population level often lead to sub-optimal performance in many personalized prediction settings.
In personalized prediction, the goal is to incorporate inherent characteristics of different entities to improve prediction performance.
arXiv Detail & Related papers (2023-02-16T16:33:33Z) - Synthetic Model Combination: An Instance-wise Approach to Unsupervised
Ensemble Learning [92.89846887298852]
Consider making a prediction over new test data without any opportunity to learn from a training set of labelled data.
Give access to a set of expert models and their predictions alongside some limited information about the dataset used to train them.
arXiv Detail & Related papers (2022-10-11T10:20:31Z) - Autoencoder Attractors for Uncertainty Estimation [13.618797548020462]
We propose a novel approach for uncertainty estimation based on autoencoder models.
We evaluate our approach on several dataset combinations as well as on an industrial application for occupant classification in the vehicle interior.
arXiv Detail & Related papers (2022-04-01T12:10:06Z) - Beyond Explaining: Opportunities and Challenges of XAI-Based Model
Improvement [75.00655434905417]
Explainable Artificial Intelligence (XAI) is an emerging research field bringing transparency to highly complex machine learning (ML) models.
This paper offers a comprehensive overview over techniques that apply XAI practically for improving various properties of ML models.
We show empirically through experiments on toy and realistic settings how explanations can help improve properties such as model generalization ability or reasoning.
arXiv Detail & Related papers (2022-03-15T15:44:28Z) - Individual Explanations in Machine Learning Models: A Survey for
Practitioners [69.02688684221265]
The use of sophisticated statistical models that influence decisions in domains of high societal relevance is on the rise.
Many governments, institutions, and companies are reluctant to their adoption as their output is often difficult to explain in human-interpretable ways.
Recently, the academic literature has proposed a substantial amount of methods for providing interpretable explanations to machine learning models.
arXiv Detail & Related papers (2021-04-09T01:46:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.