Human-like visual computing advances explainability and few-shot learning in deep neural networks for complex physiological data
- URL: http://arxiv.org/abs/2512.22349v1
- Date: Fri, 26 Dec 2025 19:19:59 GMT
- Title: Human-like visual computing advances explainability and few-shot learning in deep neural networks for complex physiological data
- Authors: Alaa Alahmadi, Mohamed Hasan,
- Abstract summary: We show that a perception-informed pseudo-colouring technique can improve both explainability and few-shot learning in deep neural networks.<n>We focus on acquired, drug-induced long QT syndrome (LQTS) as a challenging case study.<n>By encoding clinically salient temporal features, such as QT-interval duration, into structured colour representations, models learn discriminative and interpretable features from as few as one or five training examples.
- Score: 0.34376560669160394
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Machine vision models, particularly deep neural networks, are increasingly applied to physiological signal interpretation, including electrocardiography (ECG), yet they typically require large training datasets and offer limited insight into the causal features underlying their predictions. This lack of data efficiency and interpretability constrains their clinical reliability and alignment with human reasoning. Here, we show that a perception-informed pseudo-colouring technique, previously demonstrated to enhance human ECG interpretation, can improve both explainability and few-shot learning in deep neural networks analysing complex physiological data. We focus on acquired, drug-induced long QT syndrome (LQTS) as a challenging case study characterised by heterogeneous signal morphology, variable heart rate, and scarce positive cases associated with life-threatening arrhythmias such as torsades de pointes. This setting provides a stringent test of model generalisation under extreme data scarcity. By encoding clinically salient temporal features, such as QT-interval duration, into structured colour representations, models learn discriminative and interpretable features from as few as one or five training examples. Using prototypical networks and a ResNet-18 architecture, we evaluate one-shot and few-shot learning on ECG images derived from single cardiac cycles and full 10-second rhythms. Explainability analyses show that pseudo-colouring guides attention toward clinically meaningful ECG features while suppressing irrelevant signal components. Aggregating multiple cardiac cycles further improves performance, mirroring human perceptual averaging across heartbeats. Together, these findings demonstrate that human-like perceptual encoding can bridge data efficiency, explainability, and causal reasoning in medical machine intelligence.
Related papers
- BEAT-Net: Injecting Biomimetic Spatio-Temporal Priors for Interpretable ECG Classification [1.3909285316906435]
BEAT-Net is a Biomimetic ECG Analysis with Tokenization framework.<n>It decomposes cardiac physiology through specialized encoders that extract local beat morphology.<n>It exhibits exceptional data efficiency, recovering fully supervised performance using only 30 to 35 percent of annotated data.
arXiv Detail & Related papers (2026-01-12T08:37:47Z) - AICRN: Attention-Integrated Convolutional Residual Network for Interpretable Electrocardiogram Analysis [0.4077139177290857]
This work proposes a novel deep learning architecture called the attention-integrated convolutional residual network (AICRN) to regress key ECG parameters.<n>Our architecture is specially designed with spatial and channel attention-related mechanisms to address the type and spatial location of the ECG features for regression.<n>The designed system addresses traditional analysis challenges, such as loss of focus due to human errors, and facilitates the fast and easy detection of cardiac events.
arXiv Detail & Related papers (2025-08-16T21:10:45Z) - ArrhythmiaVision: Resource-Conscious Deep Learning Models with Visual Explanations for ECG Arrhythmia Classification [0.0]
We propose ArrhythmiNet V1 and V2, optimized for efficient, real-time arrhythmia classification on edge devices.<n>Inspired by MobileNet's depthwise separable convolutional design, these models maintain memory footprints of just 302.18 KB and 157.76 KB, respectively.<n>Our findings demonstrate the feasibility of combining interpretability, predictive accuracy, and computational efficiency in practical, wearable, and embedded ECG monitoring systems.
arXiv Detail & Related papers (2025-04-30T18:22:45Z) - Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks [59.38765771221084]
We present a physiologically inspired speech recognition architecture compatible and scalable with deep learning frameworks.
We show end-to-end gradient descent training leads to the emergence of neural oscillations in the central spiking neural network.
Our findings highlight the crucial inhibitory role of feedback mechanisms, such as spike frequency adaptation and recurrent connections, in regulating and synchronising neural activity to improve recognition performance.
arXiv Detail & Related papers (2024-04-22T09:40:07Z) - GeoECG: Data Augmentation via Wasserstein Geodesic Perturbation for
Robust Electrocardiogram Prediction [20.8603653664403]
We propose a physiologically-inspired data augmentation method to improve performance and increase the robustness of heart disease detection based on ECG signals.
We obtain augmented samples by perturbing the data distribution towards other classes along the geodesic in Wasserstein space.
Learning from 12-lead ECG signals, our model is able to distinguish five categories of cardiac conditions.
arXiv Detail & Related papers (2022-08-02T03:14:13Z) - Data-driven emergence of convolutional structure in neural networks [83.4920717252233]
We show how fully-connected neural networks solving a discrimination task can learn a convolutional structure directly from their inputs.
By carefully designing data models, we show that the emergence of this pattern is triggered by the non-Gaussian, higher-order local structure of the inputs.
arXiv Detail & Related papers (2022-02-01T17:11:13Z) - A complex network approach to time series analysis with application in
diagnosis of neuromuscular disorders [1.9659095632676098]
This paper proposes a new approach to network development named GraphTS to overcome the limited accuracy of existing methods.
For this purpose, EMG signals are pre-processed and mapped to a complex network by a standard visibility graph algorithm.
The resulting networks can differentiate between healthy and patient samples.
arXiv Detail & Related papers (2021-08-16T06:44:48Z) - Functional Magnetic Resonance Imaging data augmentation through
conditional ICA [44.483210864902304]
We introduce Conditional Independent Components Analysis (Conditional ICA): a fast functional Magnetic Resonance Imaging (fMRI) data augmentation technique.
We show that Conditional ICA is successful at synthesizing data indistinguishable from observations, and that it yields gains in classification accuracy in brain decoding problems.
arXiv Detail & Related papers (2021-07-11T22:36:14Z) - EEG-based Cross-Subject Driver Drowsiness Recognition with an
Interpretable Convolutional Neural Network [0.0]
We develop a novel convolutional neural network combined with an interpretation technique that allows sample-wise analysis of important features for classification.
Results show that the model achieves an average accuracy of 78.35% on 11 subjects for leave-one-out cross-subject recognition.
arXiv Detail & Related papers (2021-05-30T14:47:20Z) - Uncovering the structure of clinical EEG signals with self-supervised
learning [64.4754948595556]
Supervised learning paradigms are often limited by the amount of labeled data that is available.
This phenomenon is particularly problematic in clinically-relevant data, such as electroencephalography (EEG)
By extracting information from unlabeled data, it might be possible to reach competitive performance with deep neural networks.
arXiv Detail & Related papers (2020-07-31T14:34:47Z) - Video-based Remote Physiological Measurement via Cross-verified Feature
Disentangling [121.50704279659253]
We propose a cross-verified feature disentangling strategy to disentangle the physiological features with non-physiological representations.
We then use the distilled physiological features for robust multi-task physiological measurements.
The disentangled features are finally used for the joint prediction of multiple physiological signals like average HR values and r signals.
arXiv Detail & Related papers (2020-07-16T09:39:17Z) - Learning Dynamic and Personalized Comorbidity Networks from Event Data
using Deep Diffusion Processes [102.02672176520382]
Comorbid diseases co-occur and progress via complex temporal patterns that vary among individuals.
In electronic health records we can observe the different diseases a patient has, but can only infer the temporal relationship between each co-morbid condition.
We develop deep diffusion processes to model "dynamic comorbidity networks"
arXiv Detail & Related papers (2020-01-08T15:47:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.