Nonlinear Distribution Regression for Remote Sensing Applications
- URL: http://arxiv.org/abs/2012.06377v1
- Date: Mon, 7 Dec 2020 22:04:43 GMT
- Title: Nonlinear Distribution Regression for Remote Sensing Applications
- Authors: Jose E. Adsuara, Adri\'an P\'erez-Suay, Jordi Mu\~noz-Mar\'i, Anna
Mateo-Sanchis, Maria Piles, Gustau Camps-Valls
- Abstract summary: In many remote sensing applications one wants to estimate variables or parameters of interest from observations.
Standard algorithms such as neural networks, random forests or Gaussian processes are readily available to relate to the two.
This paper introduces a nonlinear (kernel-based) method for distribution regression that solves the previous problems without making any assumption on the statistics of the grouped data.
- Score: 6.664736150040092
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In many remote sensing applications one wants to estimate variables or
parameters of interest from observations. When the target variable is available
at a resolution that matches the remote sensing observations, standard
algorithms such as neural networks, random forests or Gaussian processes are
readily available to relate the two. However, we often encounter situations
where the target variable is only available at the group level, i.e.
collectively associated to a number of remotely sensed observations. This
problem setting is known in statistics and machine learning as {\em multiple
instance learning} or {\em distribution regression}. This paper introduces a
nonlinear (kernel-based) method for distribution regression that solves the
previous problems without making any assumption on the statistics of the
grouped data. The presented formulation considers distribution embeddings in
reproducing kernel Hilbert spaces, and performs standard least squares
regression with the empirical means therein. A flexible version to deal with
multisource data of different dimensionality and sample sizes is also presented
and evaluated. It allows working with the native spatial resolution of each
sensor, avoiding the need of match-up procedures. Noting the large
computational cost of the approach, we introduce an efficient version via
random Fourier features to cope with millions of points and groups.
Related papers
- Decomposing Gaussians with Unknown Covariance [3.734088413551237]
We present a general algorithm that encompasses all previous decomposition approaches for Gaussian data as special cases.
It yields a new and more flexible alternative to sample splitting when $n>1$.
We apply these decompositions to the tasks of model selection and post-selection inference in settings where alternative strategies are unavailable.
arXiv Detail & Related papers (2024-09-17T18:56:08Z) - Relaxed Quantile Regression: Prediction Intervals for Asymmetric Noise [51.87307904567702]
Quantile regression is a leading approach for obtaining such intervals via the empirical estimation of quantiles in the distribution of outputs.
We propose Relaxed Quantile Regression (RQR), a direct alternative to quantile regression based interval construction that removes this arbitrary constraint.
We demonstrate that this added flexibility results in intervals with an improvement in desirable qualities.
arXiv Detail & Related papers (2024-06-05T13:36:38Z) - Estimating Unknown Population Sizes Using the Hypergeometric Distribution [1.03590082373586]
We tackle the challenge of estimating discrete distributions when both the total population size and the sizes of its constituent categories are unknown.
We develop our approach to account for a data generating process where the ground-truth is a mixture of distributions conditional on a continuous latent variable.
Empirical data simulation demonstrates that our method outperforms other likelihood functions used to model count data.
arXiv Detail & Related papers (2024-02-22T01:53:56Z) - Compound Batch Normalization for Long-tailed Image Classification [77.42829178064807]
We propose a compound batch normalization method based on a Gaussian mixture.
It can model the feature space more comprehensively and reduce the dominance of head classes.
The proposed method outperforms existing methods on long-tailed image classification.
arXiv Detail & Related papers (2022-12-02T07:31:39Z) - Learning versus Refutation in Noninteractive Local Differential Privacy [133.80204506727526]
We study two basic statistical tasks in non-interactive local differential privacy (LDP): learning and refutation.
Our main result is a complete characterization of the sample complexity of PAC learning for non-interactive LDP protocols.
arXiv Detail & Related papers (2022-10-26T03:19:24Z) - Unsupervised Anomaly and Change Detection with Multivariate
Gaussianization [8.508880949780893]
Anomaly detection is a challenging problem given the high-dimensionality of the data.
We propose an unsupervised method for detecting anomalies and changes in remote sensing images.
We show the efficiency of the method in experiments involving both anomaly detection and change detection.
arXiv Detail & Related papers (2022-04-12T10:52:33Z) - Equivariance Allows Handling Multiple Nuisance Variables When Analyzing
Pooled Neuroimaging Datasets [53.34152466646884]
In this paper, we show how bringing recent results on equivariant representation learning instantiated on structured spaces together with simple use of classical results on causal inference provides an effective practical solution.
We demonstrate how our model allows dealing with more than one nuisance variable under some assumptions and can enable analysis of pooled scientific datasets in scenarios that would otherwise entail removing a large portion of the samples.
arXiv Detail & Related papers (2022-03-29T04:54:06Z) - Random Forest Weighted Local Fréchet Regression with Random Objects [18.128663071848923]
We propose a novel random forest weighted local Fr'echet regression paradigm.
Our first method uses these weights as the local average to solve the conditional Fr'echet mean.
Second method performs local linear Fr'echet regression, both significantly improving existing Fr'echet regression methods.
arXiv Detail & Related papers (2022-02-10T09:10:59Z) - Unrolling Particles: Unsupervised Learning of Sampling Distributions [102.72972137287728]
Particle filtering is used to compute good nonlinear estimates of complex systems.
We show in simulations that the resulting particle filter yields good estimates in a wide range of scenarios.
arXiv Detail & Related papers (2021-10-06T16:58:34Z) - Flexible Model Aggregation for Quantile Regression [92.63075261170302]
Quantile regression is a fundamental problem in statistical learning motivated by a need to quantify uncertainty in predictions.
We investigate methods for aggregating any number of conditional quantile models.
All of the models we consider in this paper can be fit using modern deep learning toolkits.
arXiv Detail & Related papers (2021-02-26T23:21:16Z) - An Embedded Model Estimator for Non-Stationary Random Functions using
Multiple Secondary Variables [0.0]
This paper introduces the method and shows that it has consistency results that are similar in nature to those applying to geostatistical modelling and to Quantile Random Forests.
The algorithm works by estimating a conditional distribution for the target variable at each target location.
arXiv Detail & Related papers (2020-11-09T00:14:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.