Reconstruction of Sentinel-2 Time Series Using Robust Gaussian Mixture
Models -- Application to the Detection of Anomalous Crop Development in wheat
and rapeseed crops
- URL: http://arxiv.org/abs/2110.11780v1
- Date: Fri, 22 Oct 2021 13:35:54 GMT
- Title: Reconstruction of Sentinel-2 Time Series Using Robust Gaussian Mixture
Models -- Application to the Detection of Anomalous Crop Development in wheat
and rapeseed crops
- Authors: Florian Mouret, Mohanad Albughdadi, Sylvie Duthoit, Denis Kouam\'e,
Guillaume Rieu, Jean-Yves Tourneret
- Abstract summary: Missing data is a recurrent problem in remote sensing, mainly due to cloud coverage for multispectral images and acquisition problems.
This paper proposes a Gaussian Mixture Model (GMM) for the reconstruction of parcel-level features extracted from multispectral images.
Additional features extracted from Synthetic Aperture Radar (SAR) images using Sentinel-1 data are also used to provide complementary information.
- Score: 14.849671539394272
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Missing data is a recurrent problem in remote sensing, mainly due to cloud
coverage for multispectral images and acquisition problems. This can be a
critical issue for crop monitoring, especially for applications relying on
machine learning techniques, which generally assume that the feature matrix
does not have missing values. This paper proposes a Gaussian Mixture Model
(GMM) for the reconstruction of parcel-level features extracted from
multispectral images. A robust version of the GMM is also investigated, since
datasets can be contaminated by inaccurate samples or features (e.g., wrong
crop type reported, inaccurate boundaries, undetected clouds, etc). Additional
features extracted from Synthetic Aperture Radar (SAR) images using Sentinel-1
data are also used to provide complementary information and improve the
imputations. The robust GMM investigated in this work assigns reduced weights
to the outliers during the estimation of the GMM parameters, which improves the
final reconstruction. These weights are computed at each step of an
Expectation-Maximization (EM) algorithm by using outlier scores provided by the
isolation forest algorithm. Experimental validation is conducted on rapeseed
and wheat parcels located in the Beauce region (France). Overall, we show that
the GMM imputation method outperforms other reconstruction strategies. A mean
absolute error (MAE) of 0.013 (resp. 0.019) is obtained for the imputation of
the median Normalized Difference Index (NDVI) of the rapeseed (resp. wheat)
parcels. Other indicators (e.g., Normalized Difference Water Index) and
statistics (for instance the interquartile range, which captures heterogeneity
among the parcel indicator) are reconstructed at the same time with good
accuracy. In a dataset contaminated by irrelevant samples, using the robust GMM
is recommended since the standard GMM imputation can lead to inaccurate imputed
values.
Related papers
- Sparse Variational Contaminated Noise Gaussian Process Regression with Applications in Geomagnetic Perturbations Forecasting [4.675221539472143]
We propose a scalable inference algorithm for fitting sparse Gaussian process regression models with contaminated normal noise on large datasets.
We show that our approach yields shorter prediction intervals for similar coverage and accuracy when compared to an artificial dense neural network baseline.
arXiv Detail & Related papers (2024-02-27T15:08:57Z) - Overlap-guided Gaussian Mixture Models for Point Cloud Registration [61.250516170418784]
Probabilistic 3D point cloud registration methods have shown competitive performance in overcoming noise, outliers, and density variations.
This paper proposes a novel overlap-guided probabilistic registration approach that computes the optimal transformation from matched Gaussian Mixture Model (GMM) parameters.
arXiv Detail & Related papers (2022-10-17T08:02:33Z) - Missing Data Imputation and Acquisition with Deep Hierarchical Models
and Hamiltonian Monte Carlo [2.666288135543677]
We present HH-VAEM, a Hierarchical VAE model for mixed-type incomplete data.
Our experiments show that HH-VAEM outperforms existing baselines in the tasks of missing data imputation, supervised learning and outlier identification.
We also present a sampling-based approach for efficiently computing the information gain when missing features are to be acquired with HH-VAEM.
arXiv Detail & Related papers (2022-02-09T17:50:52Z) - Continual Learning with Fully Probabilistic Models [70.3497683558609]
We present an approach for continual learning based on fully probabilistic (or generative) models of machine learning.
We propose a pseudo-rehearsal approach using a Gaussian Mixture Model (GMM) instance for both generator and classifier functionalities.
We show that GMR achieves state-of-the-art performance on common class-incremental learning problems at very competitive time and memory complexity.
arXiv Detail & Related papers (2021-04-19T12:26:26Z) - Overcoming Catastrophic Forgetting with Gaussian Mixture Replay [79.0660895390689]
We present a rehearsal-based approach for continual learning (CL) based on Gaussian Mixture Models (GMM)
We mitigate catastrophic forgetting (CF) by generating samples from previous tasks and merging them with current training data.
We evaluate GMR on multiple image datasets, which are divided into class-disjoint sub-tasks.
arXiv Detail & Related papers (2021-04-19T11:41:34Z) - Shaping Deep Feature Space towards Gaussian Mixture for Visual
Classification [74.48695037007306]
We propose a Gaussian mixture (GM) loss function for deep neural networks for visual classification.
With a classification margin and a likelihood regularization, the GM loss facilitates both high classification performance and accurate modeling of the feature distribution.
The proposed model can be implemented easily and efficiently without using extra trainable parameters.
arXiv Detail & Related papers (2020-11-18T03:32:27Z) - Statistical control for spatio-temporal MEG/EEG source imaging with
desparsified multi-task Lasso [102.84915019938413]
Non-invasive techniques like magnetoencephalography (MEG) or electroencephalography (EEG) offer promise of non-invasive techniques.
The problem of source localization, or source imaging, poses however a high-dimensional statistical inference challenge.
We propose an ensemble of desparsified multi-task Lasso (ecd-MTLasso) to deal with this problem.
arXiv Detail & Related papers (2020-09-29T21:17:16Z) - Improved Slice-wise Tumour Detection in Brain MRIs by Computing
Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods.
We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder.
We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z) - Handling missing data in model-based clustering [0.0]
We propose two methods to fit Gaussian mixtures in the presence of missing data.
Both methods use a variant of the Monte Carlo Expectation-Maximisation algorithm for data augmentation.
We show that the proposed methods outperform the multiple imputation approach, both in terms of clusters identification and density estimation.
arXiv Detail & Related papers (2020-06-04T15:36:31Z) - Effective Learning of a GMRF Mixture Model [8.336315962271396]
We propose restricting the GMM to a Gaussian Markov Random Field Mixture Model (GMRF-MM)
When the sparsity pattern of each matrix is known, we propose an efficient optimization method for the Maximum Likelihood Estimate (MLE) of that matrix.
We show that our "debiasing" approach outperforms GLASSO in both the single-GMRF and the GMRF-MM cases.
arXiv Detail & Related papers (2020-05-18T19:00:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.