A Twin Neural Model for Uplift
- URL: http://arxiv.org/abs/2105.05146v1
- Date: Tue, 11 May 2021 16:02:39 GMT
- Title: A Twin Neural Model for Uplift
- Authors: Mouloud Belbahri, Olivier Gandouet, Alejandro Murua and Vahid Partovi
Nia
- Abstract summary: Uplift is a particular case of conditional treatment effect modeling.
We propose a new loss function defined by leveraging a connection with the Bayesian interpretation of the relative risk.
We show our proposed method is competitive with the state-of-the-art in simulation setting and on real data from large scale randomized experiments.
- Score: 59.38563723706796
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Uplift is a particular case of conditional treatment effect modeling. Such
models deal with cause-and-effect inference for a specific factor, such as a
marketing intervention or a medical treatment. In practice, these models are
built on individual data from randomized clinical trials where the goal is to
partition the participants into heterogeneous groups depending on the uplift.
Most existing approaches are adaptations of random forests for the uplift case.
Several split criteria have been proposed in the literature, all relying on
maximizing heterogeneity. However, in practice, these approaches are prone to
overfitting. In this work, we bring a new vision to uplift modeling. We propose
a new loss function defined by leveraging a connection with the Bayesian
interpretation of the relative risk. Our solution is developed for a specific
twin neural network architecture allowing to jointly optimize the marginal
probabilities of success for treated and control individuals. We show that this
model is a generalization of the uplift logistic interaction model. We modify
the stochastic gradient descent algorithm to allow for structured sparse
solutions. This helps training our uplift models to a great extent. We show our
proposed method is competitive with the state-of-the-art in simulation setting
and on real data from large scale randomized experiments.
Related papers
- Bridging the inference gap in Mutimodal Variational Autoencoders [6.246098300155483]
Multimodal Variational Autoencoders offer versatile and scalable methods for generating unobserved modalities from observed ones.
Recent models using mixturesof-experts aggregation suffer from theoretically grounded limitations that restrict their generation quality on complex datasets.
We propose a novel interpretable model able to learn both joint and conditional distributions without introducing mixture aggregation.
arXiv Detail & Related papers (2025-02-06T10:43:55Z) - GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation [1.0808810256442274]
We learn semantically comprehensive yet compact latent representations of the (image, mask) space.
We show that our approach can effectively synthesise unseen high-quality paired segmentation data of remarkable semantic coherence.
arXiv Detail & Related papers (2025-01-18T16:40:53Z) - Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis [55.959002385347645]
Scaling by training on large datasets has been shown to enhance the quality and fidelity of image generation and manipulation with diffusion models.
Latent Drifting enables diffusion models to be conditioned for medical images fitted for the complex task of counterfactual image generation.
Our results demonstrate significant performance gains in various scenarios when combined with different fine-tuning schemes.
arXiv Detail & Related papers (2024-12-30T01:59:34Z) - Protein Design with Guided Discrete Diffusion [67.06148688398677]
A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling.
We propose diffusioN Optimized Sampling (NOS), a guidance method for discrete diffusion models.
NOS makes it possible to perform design directly in sequence space, circumventing significant limitations of structure-based methods.
arXiv Detail & Related papers (2023-05-31T16:31:24Z) - Deep Variational Lesion-Deficit Mapping [0.3914676152740142]
We introduce a comprehensive framework for lesion-deficit model comparison.
We show that our model outperforms established methods by a substantial margin across all simulation scenarios.
Our analysis justifies the widespread adoption of this approach.
arXiv Detail & Related papers (2023-05-27T13:49:35Z) - Bayesian Additive Main Effects and Multiplicative Interaction Models
using Tensor Regression for Multi-environmental Trials [0.0]
We propose a Bayesian tensor regression model to accommodate the effect of multiple factors on phenotype prediction.
We adopt a set of prior distributions that resolve identifiability issues that may arise between the parameters in the model.
We explore the applicability of our model by analysing real-world data related to wheat production across Ireland from 2010 to 2019.
arXiv Detail & Related papers (2023-01-09T19:54:50Z) - Estimation of Bivariate Structural Causal Models by Variational Gaussian
Process Regression Under Likelihoods Parametrised by Normalising Flows [74.85071867225533]
Causal mechanisms can be described by structural causal models.
One major drawback of state-of-the-art artificial intelligence is its lack of explainability.
arXiv Detail & Related papers (2021-09-06T14:52:58Z) - Deep Variational Models for Collaborative Filtering-based Recommender
Systems [63.995130144110156]
Deep learning provides accurate collaborative filtering models to improve recommender system results.
Our proposed models apply the variational concept to injectity in the latent space of the deep architecture.
Results show the superiority of the proposed approach in scenarios where the variational enrichment exceeds the injected noise effect.
arXiv Detail & Related papers (2021-07-27T08:59:39Z) - Adapting Neural Networks for Uplift Models [0.0]
Uplift is estimated using either i) conditional mean regression or ii) transformed outcome regression.
Most existing approaches are adaptations of classification and regression trees for the uplift case.
Here we propose a new method using neural networks.
arXiv Detail & Related papers (2020-10-30T18:42:56Z) - Robust Finite Mixture Regression for Heterogeneous Targets [70.19798470463378]
We propose an FMR model that finds sample clusters and jointly models multiple incomplete mixed-type targets simultaneously.
We provide non-asymptotic oracle performance bounds for our model under a high-dimensional learning framework.
The results show that our model can achieve state-of-the-art performance.
arXiv Detail & Related papers (2020-10-12T03:27:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.