RAxSS: Retrieval-Augmented Sparse Sampling for Explainable Variable-Length Medical Time Series Classification
- URL: http://arxiv.org/abs/2510.02936v1
- Date: Fri, 03 Oct 2025 12:26:19 GMT
- Title: RAxSS: Retrieval-Augmented Sparse Sampling for Explainable Variable-Length Medical Time Series Classification
- Authors: Aydin Javadov, Samir Garibov, Tobias Hoesli, Qiyang Sun, Florian von Wangenheim, Joseph Ollier, Björn W. Schuller,
- Abstract summary: We generalize sparse sampling framework for retrieval-informed classification.<n>We aggregate predictions by within-channel similarity and aggregate them in probability space, convex series-level scores and an explicit evidence trail for explainability.<n>Our method achieves competitive iEEG classification performance and provides practitioners with greater transparency and explainability.
- Score: 38.39105016249814
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Medical time series analysis is challenging due to data sparsity, noise, and highly variable recording lengths. Prior work has shown that stochastic sparse sampling effectively handles variable-length signals, while retrieval-augmented approaches improve explainability and robustness to noise and weak temporal correlations. In this study, we generalize the stochastic sparse sampling framework for retrieval-informed classification. Specifically, we weight window predictions by within-channel similarity and aggregate them in probability space, yielding convex series-level scores and an explicit evidence trail for explainability. Our method achieves competitive iEEG classification performance and provides practitioners with greater transparency and explainability. We evaluate our method in iEEG recordings collected in four medical centers, demonstrating its potential for reliable and explainable clinical variable-length time series classification.
Related papers
- Combating Noisy Labels through Fostering Self- and Neighbor-Consistency [120.4394402099635]
Label noise is pervasive in various real-world scenarios, posing challenges in supervised deep learning.<n>We propose a noise-robust method named Jo-SNC (textbfJoint sample selection and model regularization based on textbfSelf- and textbfNeighbor-textbfConsistency)<n>We design a self-adaptive, data-driven thresholding scheme to adjust per-class selection thresholds.
arXiv Detail & Related papers (2026-01-19T07:55:29Z) - Tuberculosis Screening from Cough Audio: Baseline Models, Clinical Variables, and Uncertainty Quantification [0.6015898117103067]
We propose a standardized framework for automatic tuberculosis (TB) detection from cough audio and routinely collected clinical data using machine learning.<n>Existing studies vary substantially in datasets, cohort definitions, feature representations, model families, validation protocols, and reported metrics.<n>We establish a strong, well-documented baseline for TB prediction using cough recordings and accompanying clinical metadata from a recently compiled dataset from several countries.
arXiv Detail & Related papers (2026-01-12T20:04:05Z) - Spatial Reasoning with Denoising Models [49.83744014336816]
We introduce a framework to perform reasoning over sets of continuous variables via denoising generative models.<n>For the first time, that order of generation can successfully be predicted by the denoising network itself.<n>Using these findings, we can increase the accuracy of specific reasoning tasks from 1% to >50%.
arXiv Detail & Related papers (2025-02-28T14:08:30Z) - Stochastic Sparse Sampling: A Framework for Variable-Length Medical Time Series Classification [9.474649136535705]
We propose $textbfS$tochastic $textbfS$parse $textbfS$ampling (SSS), a novel VTSC framework developed for medical time series.
SSS manages variable-length sequences by sparsely sampling fixed windows to compute local predictions, which are then aggregated and calibrated to form a global prediction.
We evaluate our method on the Epilepsy iEEG Multicenter dataset, a heterogeneous collection of intracranial electroencephalography (iEEG) recordings obtained from four independent medical centers.
arXiv Detail & Related papers (2024-10-08T22:48:39Z) - Disentangled Noisy Correspondence Learning [56.06801962154915]
Cross-modal retrieval is crucial in understanding latent correspondences across modalities.
DisNCL is a novel information-theoretic framework for feature Disentanglement in Noisy Correspondence Learning.
arXiv Detail & Related papers (2024-08-10T09:49:55Z) - FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms [60.195642571004804]
We introduce FlowSDF, an image-guided conditional flow matching framework, to represent an implicit distribution of segmentation masks.<n>Our framework enables accurate sampling of segmentation masks and the computation of relevant statistical measures.
arXiv Detail & Related papers (2024-05-28T11:47:12Z) - Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation [63.180725016463974]
Cross-modal retrieval relies on well-matched large-scale datasets that are laborious in practice.
We introduce a novel noisy correspondence learning framework, namely textbfSelf-textbfReinforcing textbfErrors textbfMitigation (SREM)
arXiv Detail & Related papers (2023-12-27T09:03:43Z) - Learning Beyond Similarities: Incorporating Dissimilarities between
Positive Pairs in Self-Supervised Time Series Learning [4.2807943283312095]
This paper pioneers an SSL approach that transcends mere similarities by integrating dissimilarities among positive pairs.
The framework is applied to electrocardiogram (ECG) signals, leading to a notable enhancement of +10% in the detection accuracy of Atrial Fibrillation (AFib) across diverse subjects.
arXiv Detail & Related papers (2023-09-14T08:49:35Z) - Accounting For Informative Sampling When Learning to Forecast Treatment
Outcomes Over Time [66.08455276899578]
We show that informative sampling can prohibit accurate estimation of treatment outcomes if not properly accounted for.
We present a general framework for learning treatment outcomes in the presence of informative sampling using inverse intensity-weighting.
We propose a novel method, TESAR-CDE, that instantiates this framework using Neural CDEs.
arXiv Detail & Related papers (2023-06-07T08:51:06Z) - ShapeWordNet: An Interpretable Shapelet Neural Network for Physiological
Signal Classification [16.82411861562806]
We propose a more effective and interpretable scheme tailored for the physiological signal classification task.
We exploit the time series shapelets to extract prominent local patterns and perform interpretable sequence discretization.
We name our method as ShapeWordNet and conduct extensive experiments on three real-world datasets to investigate its effectiveness.
arXiv Detail & Related papers (2023-02-10T02:30:31Z) - Undersampling and Cumulative Class Re-decision Methods to Improve
Detection of Agitation in People with Dementia [16.949993123698345]
Agitation is one of the most prevalent symptoms in people with dementia (PwD)
In a previous study, we collected multimodal wearable sensor data from 17 participants for 600 days and developed machine learning models for detecting agitation in one-minute windows.
In this paper, we first implemented different undersampling methods to eliminate the imbalance problem, and came to the conclusion that only 20% of normal behaviour data were adequate to train a competitive agitation detection model.
arXiv Detail & Related papers (2023-02-07T03:14:00Z) - Segmentation-Consistent Probabilistic Lesion Counting [3.6513059119482145]
Lesion counts are important indicators of disease severity, patient prognosis, and treatment efficacy, yet counting as a task in medical imaging is often overlooked in favor of segmentation.
This work introduces a novel continuously differentiable function that maps lesion segmentation predictions to lesion count probability distributions in a consistent manner.
arXiv Detail & Related papers (2022-04-11T17:26:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.