Affordable EEG, Actionable Insights: An Open Dataset and Evaluation Framework for Epilepsy Patient Stratification
- URL: http://arxiv.org/abs/2511.01879v1
- Date: Wed, 22 Oct 2025 15:25:05 GMT
- Title: Affordable EEG, Actionable Insights: An Open Dataset and Evaluation Framework for Epilepsy Patient Stratification
- Authors: HM Shadman Tabib, Md. Hasnaen Adil, Ayesha Rahman, Ahmmad Nur Swapnil, Maoyejatun Hasana, Ahmed Hossain Chowdhury, A. B. M. Alim Al Islam,
- Abstract summary: We present NEUROSKY-EPI, the first open dataset of single-channel, consumer-grade EEG for epilepsy.<n>To explore its utility, we introduce EmbedCluster, a patient-stratification pipeline.<n>Results show that low-cost, single-channel data can support meaningful stratification.
- Score: 2.879398564096746
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Access to clinical multi-channel EEG remains limited in many regions worldwide. We present NEUROSKY-EPI, the first open dataset of single-channel, consumer-grade EEG for epilepsy, collected in a South Asian clinical setting along with rich contextual metadata. To explore its utility, we introduce EmbedCluster, a patient-stratification pipeline that transfers representations from EEGNet models trained on clinical data and enriches them with contextual autoencoder embeddings, followed by unsupervised clustering of patients based on EEG patterns. Results show that low-cost, single-channel data can support meaningful stratification. Beyond algorithmic performance, we emphasize human-centered concerns such as deployability in resource-constrained environments, interpretability for non-specialists, and safeguards for privacy, inclusivity, and bias. By releasing the dataset and code, we aim to catalyze interdisciplinary research across health technology, human-computer interaction, and machine learning, advancing the goal of affordable and actionable EEG-based epilepsy care.
Related papers
- Omni-iEEG: A Large-Scale, Comprehensive iEEG Dataset and Benchmark for Epilepsy Research [4.4167069169736655]
$textbf Omni-iEEG is a large-scale, pre-surgical iEEG resource comprising $textbf302 patients$ and $textbf178 hours$ of high-resolution recordings.<n>It includes harmonized clinical metadata such as seizure onset zones, resections, and surgical outcomes, validated by board-certified epileptologists.<n>It defines clinically meaningful tasks with unified evaluation metrics grounded in clinical priors, enabling systematic evaluation of models in clinically relevant settings.
arXiv Detail & Related papers (2026-02-17T22:55:11Z) - LookAroundNet: Extending Temporal Context with Transformers for Clinically Viable EEG Seizure Detection [0.0]
LookAroundNet is a transformer-based seizure detector that uses a wider temporal window of EEG data to model seizure activity.<n>We evaluate the proposed method on multiple EEG datasets spanning diverse clinical environments.
arXiv Detail & Related papers (2026-01-09T18:52:24Z) - Integrating Genomics into Multimodal EHR Foundation Models [56.31910745104141]
This paper introduces an innovative EHR foundation model that integrates Polygenic Risk Scores (PRS) as a foundational data modality.<n>The framework aims to learn complex relationships between clinical data and genetic predispositions.<n>This approach is pivotal for unlocking new insights into disease prediction, proactive health management, risk stratification, and personalized treatment strategies.
arXiv Detail & Related papers (2025-10-24T15:56:40Z) - Mitigating Clinician Information Overload: Generative AI for Integrated EHR and RPM Data Analysis [0.523377539745706]
We present a comprehensive overview of the capabilities, requirements and applications of Generative Artificial Intelligence (GenAI)<n>We first provide some background on the forms and sources of patient data, namely real-time Remote Patient Monitoring ( RPM) streams and traditional Electronic Health Records ( EHRs)<n>These applications can enhance navigation of longitudinal patient data and provide actionable clinical decision support through natural language dialogue.
arXiv Detail & Related papers (2025-08-26T17:10:21Z) - A Comprehensive Review of Datasets for Clinical Mental Health AI Systems [55.67299586253951]
We present the first comprehensive survey of clinical mental health datasets relevant to the training and development of AI-powered clinical assistants.<n>Our survey identifies critical gaps such as a lack of longitudinal data, limited cultural and linguistic representation, inconsistent collection and annotation standards, and a lack of modalities in synthetic data.
arXiv Detail & Related papers (2025-08-13T13:42:35Z) - Adaptable Cardiovascular Disease Risk Prediction from Heterogeneous Data using Large Language Models [70.64969663547703]
AdaCVD is an adaptable CVD risk prediction framework built on large language models extensively fine-tuned on over half a million participants from the UK Biobank.<n>It addresses key clinical challenges across three dimensions: it flexibly incorporates comprehensive yet variable patient information; it seamlessly integrates both structured data and unstructured text; and it rapidly adapts to new patient populations using minimal additional data.
arXiv Detail & Related papers (2025-05-30T14:42:02Z) - Explainable AI for Classifying UTI Risk Groups Using a Real-World Linked EHR and Pathology Lab Dataset [0.47517735516852333]
We leverage a linked EHR dataset to characterize urinary tract infections (UTIs)<n>We introduce a UTI risk estimation framework informed by clinical expertise to estimate UTI risk across individual patient timelines.<n>Our findings reveal differences in clinical and demographic predictors across risk groups.
arXiv Detail & Related papers (2024-11-26T18:10:51Z) - Polar-Net: A Clinical-Friendly Model for Alzheimer's Disease Detection
in OCTA Images [53.235117594102675]
Optical Coherence Tomography Angiography is a promising tool for detecting Alzheimer's disease (AD) by imaging the retinal microvasculature.
We propose a novel deep-learning framework called Polar-Net to provide interpretable results and leverage clinical prior knowledge.
We show that Polar-Net outperforms existing state-of-the-art methods and provides more valuable pathological evidence for the association between retinal vascular changes and AD.
arXiv Detail & Related papers (2023-11-10T11:49:49Z) - fMRI from EEG is only Deep Learning away: the use of interpretable DL to
unravel EEG-fMRI relationships [68.8204255655161]
We present an interpretable domain grounded solution to recover the activity of several subcortical regions from multichannel EEG data.
We recover individual spatial and time-frequency patterns of scalp EEG predictive of the hemodynamic signal in the subcortical nuclei.
arXiv Detail & Related papers (2022-10-23T15:11:37Z) - Uncovering the structure of clinical EEG signals with self-supervised
learning [64.4754948595556]
Supervised learning paradigms are often limited by the amount of labeled data that is available.
This phenomenon is particularly problematic in clinically-relevant data, such as electroencephalography (EEG)
By extracting information from unlabeled data, it might be possible to reach competitive performance with deep neural networks.
arXiv Detail & Related papers (2020-07-31T14:34:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.