A hypothesis-driven method based on machine learning for neuroimaging
data analysis
- URL: http://arxiv.org/abs/2202.04397v1
- Date: Wed, 9 Feb 2022 11:13:02 GMT
- Title: A hypothesis-driven method based on machine learning for neuroimaging
data analysis
- Authors: JM Gorriz, R. Martin-Clemente, C.G. Puntonet, A. Ortiz, J. Ramirez and
J. Suckling
- Abstract summary: Machine learning approaches for discrimination of spatial patterns of brain images have limited their operation to feature extraction and linear classification tasks.
We show that the estimation of the conventional General linear Model (GLM) has been connected to an univariate classification task.
We derive a refined statistical test with the GLM based on the parameters obtained by a linear Support Vector Regression (SVR) in the emphinverse problem (SVR-iGLM)
Using real data from a multisite initiative the proposed MLE-based inference demonstrates statistical power and the control of false positives, outperforming the regular G
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: There remains an open question about the usefulness and the interpretation of
Machine learning (MLE) approaches for discrimination of spatial patterns of
brain images between samples or activation states. In the last few decades,
these approaches have limited their operation to feature extraction and linear
classification tasks for between-group inference. In this context, statistical
inference is assessed by randomly permuting image labels or by the use of
random effect models that consider between-subject variability. These
multivariate MLE-based statistical pipelines, whilst potentially more effective
for detecting activations than hypotheses-driven methods, have lost their
mathematical elegance, ease of interpretation, and spatial localization of the
ubiquitous General linear Model (GLM). Recently, the estimation of the
conventional GLM has been demonstrated to be connected to an univariate
classification task when the design matrix is expressed as a binary indicator
matrix. In this paper we explore the complete connection between the univariate
GLM and MLE \emph{regressions}. To this purpose we derive a refined statistical
test with the GLM based on the parameters obtained by a linear Support Vector
Regression (SVR) in the \emph{inverse} problem (SVR-iGLM). Subsequently, random
field theory (RFT) is employed for assessing statistical significance following
a conventional GLM benchmark. Experimental results demonstrate how parameter
estimations derived from each model (mainly GLM and SVR) result in different
experimental design estimates that are significantly related to the predefined
functional task. Moreover, using real data from a multisite initiative the
proposed MLE-based inference demonstrates statistical power and the control of
false positives, outperforming the regular GLM.
Related papers
- Interpetable Target-Feature Aggregation for Multi-Task Learning based on Bias-Variance Analysis [53.38518232934096]
Multi-task learning (MTL) is a powerful machine learning paradigm designed to leverage shared knowledge across tasks to improve generalization and performance.
We propose an MTL approach at the intersection between task clustering and feature transformation based on a two-phase iterative aggregation of targets and features.
In both phases, a key aspect is to preserve the interpretability of the reduced targets and features through the aggregation with the mean, which is motivated by applications to Earth science.
arXiv Detail & Related papers (2024-06-12T08:30:16Z) - Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning [50.84938730450622]
We propose a trajectory-based method TV score, which uses trajectory volatility for OOD detection in mathematical reasoning.
Our method outperforms all traditional algorithms on GLMs under mathematical reasoning scenarios.
Our method can be extended to more applications with high-density features in output spaces, such as multiple-choice questions.
arXiv Detail & Related papers (2024-05-22T22:22:25Z) - Statistical Agnostic Regression: a machine learning method to validate regression models [0.0]
We introduce Statistical Agnostic Regression (SAR) for evaluating the statistical significance of machine learning (ML)-based linear regression models.
We define a threshold that ensures there is sufficient evidence, with a probability of at least $1-eta$, to conclude the existence of a linear relationship in the population between the explanatory (feature) and the response (label) variables.
arXiv Detail & Related papers (2024-02-23T09:19:26Z) - Weakly supervised covariance matrices alignment through Stiefel matrices
estimation for MEG applications [64.20396555814513]
This paper introduces a novel domain adaptation technique for time series data, called Mixing model Stiefel Adaptation (MSA)
We exploit abundant unlabeled data in the target domain to ensure effective prediction by establishing pairwise correspondence with equivalent signal variances between domains.
MSA outperforms recent methods in brain-age regression with task variations using magnetoencephalography (MEG) signals from the Cam-CAN dataset.
arXiv Detail & Related papers (2024-01-24T19:04:49Z) - Simulation-based inference using surjective sequential neural likelihood
estimation [50.24983453990065]
Surjective Sequential Neural Likelihood estimation is a novel method for simulation-based inference.
By embedding the data in a low-dimensional space, SSNL solves several issues previous likelihood-based methods had when applied to high-dimensional data sets.
arXiv Detail & Related papers (2023-08-02T10:02:38Z) - Unsupervised representation learning with recognition-parametrised
probabilistic models [12.865596223775649]
We introduce a new approach to probabilistic unsupervised learning based on the recognition-parametrised model ( RPM)
Under the key assumption that observations are conditionally independent given latents, the RPM combines parametric prior observation-conditioned latent distributions with non-parametric observationfactors.
The RPM provides a powerful framework to discover meaningful latent structure underlying observational data, a function critical to both animal and artificial intelligence.
arXiv Detail & Related papers (2022-09-13T00:33:21Z) - MAML is a Noisy Contrastive Learner [72.04430033118426]
Model-agnostic meta-learning (MAML) is one of the most popular and widely-adopted meta-learning algorithms nowadays.
We provide a new perspective to the working mechanism of MAML and discover that: MAML is analogous to a meta-learner using a supervised contrastive objective function.
We propose a simple but effective technique, zeroing trick, to alleviate such interference.
arXiv Detail & Related papers (2021-06-29T12:52:26Z) - A connection between the pattern classification problem and the General
Linear Model for statistical inference [0.2320417845168326]
Both approaches, i.e. GLM and LRM, apply to different domains, the observation and the label domains.
We derive a statistical test based on a more refined predictive algorithm.
The MLE-based inference employs a residual score and includes the upper bound to compute a better estimation of the actual (real) error.
arXiv Detail & Related papers (2020-12-16T12:26:26Z) - Statistical control for spatio-temporal MEG/EEG source imaging with
desparsified multi-task Lasso [102.84915019938413]
Non-invasive techniques like magnetoencephalography (MEG) or electroencephalography (EEG) offer promise of non-invasive techniques.
The problem of source localization, or source imaging, poses however a high-dimensional statistical inference challenge.
We propose an ensemble of desparsified multi-task Lasso (ecd-MTLasso) to deal with this problem.
arXiv Detail & Related papers (2020-09-29T21:17:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.