Neural Unbalanced Optimal Transport via Cycle-Consistent Semi-Couplings
- URL: http://arxiv.org/abs/2209.15621v1
- Date: Fri, 30 Sep 2022 17:48:04 GMT
- Title: Neural Unbalanced Optimal Transport via Cycle-Consistent Semi-Couplings
- Authors: Frederike L\"ubeck, Charlotte Bunne, Gabriele Gut, Jacobo Sarabia del
Castillo, Lucas Pelkmans, David Alvarez-Melis
- Abstract summary: We introduce NubOT, a neural unbalanced OT formulation that relies on the formalism of semi-couplings to account for creation and destruction of mass.
We apply our method to the challenging task of forecasting heterogeneous responses of multiple cancer cell lines to various drugs.
- Score: 10.47175870857988
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Comparing unpaired samples of a distribution or population taken at different
points in time is a fundamental task in many application domains where
measuring populations is destructive and cannot be done repeatedly on the same
sample, such as in single-cell biology. Optimal transport (OT) can solve this
challenge by learning an optimal coupling of samples across distributions from
unpaired data. However, the usual formulation of OT assumes conservation of
mass, which is violated in unbalanced scenarios in which the population size
changes (e.g., cell proliferation or death) between measurements. In this work,
we introduce NubOT, a neural unbalanced OT formulation that relies on the
formalism of semi-couplings to account for creation and destruction of mass. To
estimate such semi-couplings and generalize out-of-sample, we derive an
efficient parameterization based on neural optimal transport maps and propose a
novel algorithmic scheme through a cycle-consistent training procedure. We
apply our method to the challenging task of forecasting heterogeneous responses
of multiple cancer cell lines to various drugs, where we observe that by
accurately modeling cell proliferation and death, our method yields notable
improvements over previous neural optimal transport methods.
Related papers
- Distributed Nonparametric Estimation: from Sparse to Dense Samples per Terminal [9.766173684831324]
We characterize the minimax optimal rates for all regimes, and identify phase transitions of the optimal rates as the samples per terminal vary from sparse to dense.
This fully solves the problem left open by previous works, whose scopes are limited to regimes with either dense samples or a single sample per terminal.
The optimal rates are immediate for various special cases such as density estimation, Gaussian, binary, Poisson and heteroskedastic regression models.
arXiv Detail & Related papers (2025-01-14T06:41:55Z) - Stochastic gradient descent estimation of generalized matrix factorization models with application to single-cell RNA sequencing data [41.94295877935867]
Single-cell RNA sequencing allows the quantitation of gene expression at the individual cell level.
Dimensionality reduction is a common preprocessing step to simplify the visualization, clustering, and phenotypic characterization of samples.
We present a generalized matrix factorization model assuming a general exponential dispersion family distribution.
We propose a scalable adaptive descent algorithm that allows us to estimate the model efficiently.
arXiv Detail & Related papers (2024-12-29T16:02:15Z) - Adaptive Sampling to Reduce Epistemic Uncertainty Using Prediction Interval-Generation Neural Networks [0.0]
This paper presents an adaptive sampling approach designed to reduce epistemic uncertainty in predictive models.
Our primary contribution is the development of a metric that estimates potential epistemic uncertainty.
A batch sampling strategy based on Gaussian processes (GPs) is also proposed.
We test our approach on three unidimensional synthetic problems and a multi-dimensional dataset based on an agricultural field for selecting experimental fertilizer rates.
arXiv Detail & Related papers (2024-12-13T21:21:47Z) - Dynamical Measure Transport and Neural PDE Solvers for Sampling [77.38204731939273]
We tackle the task of sampling from a probability density as transporting a tractable density function to the target.
We employ physics-informed neural networks (PINNs) to approximate the respective partial differential equations (PDEs) solutions.
PINNs allow for simulation- and discretization-free optimization and can be trained very efficiently.
arXiv Detail & Related papers (2024-07-10T17:39:50Z) - Multi-CATE: Multi-Accurate Conditional Average Treatment Effect Estimation Robust to Unknown Covariate Shifts [12.289361708127876]
We use methodology for learning multi-accurate predictors to post-process CATE T-learners.
We show how this approach can combine (large) confounded observational and (smaller) randomized datasets.
arXiv Detail & Related papers (2024-05-28T14:12:25Z) - Estimating Unknown Population Sizes Using the Hypergeometric Distribution [1.03590082373586]
We tackle the challenge of estimating discrete distributions when both the total population size and the sizes of its constituent categories are unknown.
We develop our approach to account for a data generating process where the ground-truth is a mixture of distributions conditional on a continuous latent variable.
Empirical data simulation demonstrates that our method outperforms other likelihood functions used to model count data.
arXiv Detail & Related papers (2024-02-22T01:53:56Z) - Learning to Re-weight Examples with Optimal Transport for Imbalanced
Classification [74.62203971625173]
Imbalanced data pose challenges for deep learning based classification models.
One of the most widely-used approaches for tackling imbalanced data is re-weighting.
We propose a novel re-weighting method based on optimal transport (OT) from a distributional point of view.
arXiv Detail & Related papers (2022-08-05T01:23:54Z) - Manifold Interpolating Optimal-Transport Flows for Trajectory Inference [64.94020639760026]
We present a method called Manifold Interpolating Optimal-Transport Flow (MIOFlow)
MIOFlow learns, continuous population dynamics from static snapshot samples taken at sporadic timepoints.
We evaluate our method on simulated data with bifurcations and merges, as well as scRNA-seq data from embryoid body differentiation, and acute myeloid leukemia treatment.
arXiv Detail & Related papers (2022-06-29T22:19:03Z) - Sampling-free Variational Inference for Neural Networks with
Multiplicative Activation Noise [51.080620762639434]
We propose a more efficient parameterization of the posterior approximation for sampling-free variational inference.
Our approach yields competitive results for standard regression problems and scales well to large-scale image classification tasks.
arXiv Detail & Related papers (2021-03-15T16:16:18Z) - Tracking disease outbreaks from sparse data with Bayesian inference [55.82986443159948]
The COVID-19 pandemic provides new motivation for estimating the empirical rate of transmission during an outbreak.
Standard methods struggle to accommodate the partial observability and sparse data common at finer scales.
We propose a Bayesian framework which accommodates partial observability in a principled manner.
arXiv Detail & Related papers (2020-09-12T20:37:33Z) - Improving Maximum Likelihood Training for Text Generation with Density
Ratio Estimation [51.091890311312085]
We propose a new training scheme for auto-regressive sequence generative models, which is effective and stable when operating at large sample space encountered in text generation.
Our method stably outperforms Maximum Likelihood Estimation and other state-of-the-art sequence generative models in terms of both quality and diversity.
arXiv Detail & Related papers (2020-07-12T15:31:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.