A Generalized Flow for B2B Sales Predictive Modeling: An Azure Machine
Learning Approach
- URL: http://arxiv.org/abs/2002.01441v2
- Date: Fri, 3 Jul 2020 01:00:22 GMT
- Title: A Generalized Flow for B2B Sales Predictive Modeling: An Azure Machine
Learning Approach
- Authors: Alireza Rezazadeh
- Abstract summary: Predicting the outcome of business to business (B2B) sales is a core part of successful business management.
We propose a thorough data-driven Machine Learning (ML) workflow on a cloud-based computing platform: Microsoft Azure ML.
Our results implied that decision-making based on the ML predictions is more accurate and brings a higher monetary value.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Predicting the outcome of sales opportunities is a core part of successful
business management. Conventionally, making this prediction has relied mostly
on subjective human evaluations in the process of sales decision making. In
this paper, we addressed the problem of forecasting the outcome of business to
business (B2B) sales by proposing a thorough data-driven Machine Learning (ML)
workflow on a cloud-based computing platform: Microsoft Azure Machine Learning
Service (Azure ML). This workflow consists of two pipelines: (1) An ML pipeline
to train probabilistic predictive models on the historical sales opportunities
data. In this pipeline, data is enriched with an extensive feature enhancement
step and then used to train an ensemble of ML classification models in
parallel. (2) A prediction pipeline to utilize the trained ML model and infer
the likelihood of winning new sales opportunities along with calculating
optimal decision boundaries. The effectiveness of the proposed workflow was
evaluated on a real sales dataset of a major global B2B consulting firm. Our
results implied that decision-making based on the ML predictions is more
accurate and brings a higher monetary value.
Related papers
- Scaling Laws for Predicting Downstream Performance in LLMs [75.28559015477137]
This work focuses on the pre-training loss as a more-efficient metric for performance estimation.
We extend the power law analytical function to predict domain-specific pre-training loss based on FLOPs across data sources.
We employ a two-layer neural network to model the non-linear relationship between multiple domain-specific loss and downstream performance.
arXiv Detail & Related papers (2024-10-11T04:57:48Z) - A Probabilistic Perspective on Unlearning and Alignment for Large Language Models [48.96686419141881]
We introduce the first formal probabilistic evaluation framework in Large Language Models (LLMs)
We derive novel metrics with high-probability guarantees concerning the output distribution of a model.
Our metrics are application-independent and allow practitioners to make more reliable estimates about model capabilities before deployment.
arXiv Detail & Related papers (2024-10-04T15:44:23Z) - F-FOMAML: GNN-Enhanced Meta-Learning for Peak Period Demand Forecasting with Proxy Data [65.6499834212641]
We formulate the demand prediction as a meta-learning problem and develop the Feature-based First-Order Model-Agnostic Meta-Learning (F-FOMAML) algorithm.
By considering domain similarities through task-specific metadata, our model improved generalization, where the excess risk decreases as the number of training tasks increases.
Compared to existing state-of-the-art models, our method demonstrates a notable improvement in demand prediction accuracy, reducing the Mean Absolute Error by 26.24% on an internal vending machine dataset and by 1.04% on the publicly accessible JD.com dataset.
arXiv Detail & Related papers (2024-06-23T21:28:50Z) - Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models [51.3422222472898]
We document the capability of large language models (LLMs) like ChatGPT to predict stock price movements using news headlines.
We develop a theoretical model incorporating information capacity constraints, underreaction, limits-to-arbitrage, and LLMs.
arXiv Detail & Related papers (2023-04-15T19:22:37Z) - Data Debugging with Shapley Importance over End-to-End Machine Learning
Pipelines [27.461398584509755]
DataScope is the first system that efficiently computes Shapley values of training examples over an end-to-end machine learning pipeline.
Our results show that DataScope is up to four orders of magnitude faster than state-of-the-art Monte Carlo-based methods.
arXiv Detail & Related papers (2022-04-23T19:29:23Z) - A Survey on Large-scale Machine Learning [67.6997613600942]
Machine learning can provide deep insights into data, allowing machines to make high-quality predictions.
Most sophisticated machine learning approaches suffer from huge time costs when operating on large-scale data.
Large-scale Machine Learning aims to learn patterns from big data with comparable performance efficiently.
arXiv Detail & Related papers (2020-08-10T06:07:52Z) - VAE-LIME: Deep Generative Model Based Approach for Local Data-Driven
Model Interpretability Applied to the Ironmaking Industry [70.10343492784465]
It is necessary to expose to the process engineer, not solely the model predictions, but also their interpretability.
Model-agnostic local interpretability solutions based on LIME have recently emerged to improve the original method.
We present in this paper a novel approach, VAE-LIME, for local interpretability of data-driven models forecasting the temperature of the hot metal produced by a blast furnace.
arXiv Detail & Related papers (2020-07-15T07:07:07Z) - How Much Can A Retailer Sell? Sales Forecasting on Tmall [20.51263243320733]
We study the case of retailers' sales forecasting on Tmall|the world's leading online B2C platform.
We design two mechanisms for sales forecasting, i.e., seasonality extraction and distribution transformation.
arXiv Detail & Related papers (2020-02-27T06:41:00Z) - Profit-oriented sales forecasting: a comparison of forecasting
techniques from a business perspective [3.613072342189595]
This paper compares a large array of techniques for 35 times series that consist of both industry data from the Coca-Cola Company and publicly available datasets.
It introduces a novel and completely automated profit-driven approach that takes into account the expected profit that a technique can create during both the model building and evaluation process.
arXiv Detail & Related papers (2020-02-03T14:50:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.