BIOT: Cross-data Biosignal Learning in the Wild
- URL: http://arxiv.org/abs/2305.10351v1
- Date: Wed, 10 May 2023 19:26:58 GMT
- Title: BIOT: Cross-data Biosignal Learning in the Wild
- Authors: Chaoqi Yang, M. Brandon Westover, Jimeng Sun
- Abstract summary: Current deep learning models for biosignals are typically specialized for specific datasets and clinical settings.
method model is versatile and applicable to various biosignal learning settings across different datasets.
- Score: 36.22753628246332
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Biological signals, such as electroencephalograms (EEG), play a crucial role
in numerous clinical applications, exhibiting diverse data formats and quality
profiles. Current deep learning models for biosignals are typically specialized
for specific datasets and clinical settings, limiting their broader
applicability. Motivated by the success of large language models in text
processing, we explore the development of foundational models that are trained
from multiple data sources and can be fine-tuned on different downstream
biosignal tasks.
To overcome the unique challenges associated with biosignals of various
formats, such as mismatched channels, variable sample lengths, and prevalent
missing values, we propose a Biosignal Transformer (\method). The proposed
\method model can enable cross-data learning with mismatched channels, variable
lengths, and missing values by tokenizing diverse biosignals into unified
"biosignal sentences". Specifically, we tokenize each channel into fixed-length
segments containing local signal features, flattening them to form consistent
"sentences". Channel embeddings and {\em relative} position embeddings are
added to preserve spatio-temporal features.
The \method model is versatile and applicable to various biosignal learning
settings across different datasets, including joint pre-training for larger
models. Comprehensive evaluations on EEG, electrocardiogram (ECG), and human
activity sensory signals demonstrate that \method outperforms robust baselines
in common settings and facilitates learning across multiple datasets with
different formats. Use CHB-MIT seizure detection task as an example, our
vanilla \method model shows 3\% improvement over baselines in balanced
accuracy, and the pre-trained \method models (optimized from other data
sources) can further bring up to 4\% improvements.
Related papers
- Nested Deep Learning Model Towards A Foundation Model for Brain Signal Data [12.98830356097411]
This paper introduces a novel Nested Deep Learning (NDL) framework designed to overcome limitations of current methods.
NDL applies a weighted combination of signals across all channels, ensuring adaptability to different channel setups.
The results show that NDL improves prediction accuracy, supports cross-modality data integration, and can be fine-tuned for various neurophysiological applications.
arXiv Detail & Related papers (2024-10-04T07:06:02Z) - Multi-Source and Test-Time Domain Adaptation on Multivariate Signals using Spatio-Temporal Monge Alignment [59.75420353684495]
Machine learning applications on signals such as computer vision or biomedical data often face challenges due to the variability that exists across hardware devices or session recordings.
In this work, we propose Spatio-Temporal Monge Alignment (STMA) to mitigate these variabilities.
We show that STMA leads to significant and consistent performance gains between datasets acquired with very different settings.
arXiv Detail & Related papers (2024-07-19T13:33:38Z) - Physics-informed and Unsupervised Riemannian Domain Adaptation for Machine Learning on Heterogeneous EEG Datasets [53.367212596352324]
We propose an unsupervised approach leveraging EEG signal physics.
We map EEG channels to fixed positions using field, source-free domain adaptation.
Our method demonstrates robust performance in brain-computer interface (BCI) tasks and potential biomarker applications.
arXiv Detail & Related papers (2024-03-07T16:17:33Z) - Joint Alignment of Multivariate Quasi-Periodic Functional Data Using
Deep Learning [0.0]
We present a novel method for joint alignment of multivariate quasi-periodic functions using deep neural networks.
Our proposed neural network uses a special activation of the output that builds on the unit simplex transformation.
We demonstrate our method on two simulated datasets and one real example, comprising data from 12-lead 10s electrocardiogram recordings.
arXiv Detail & Related papers (2023-11-14T10:09:40Z) - Multi-view self-supervised learning for multivariate variable-channel
time series [1.094320514634939]
We propose learning one encoder to operate on all input channels individually.
We then use a message passing neural network to extract a single representation across channels.
We show that our method, combined with the TS2Vec loss, outperforms all other methods in most settings.
arXiv Detail & Related papers (2023-07-13T19:03:06Z) - Decision Forest Based EMG Signal Classification with Low Volume Dataset
Augmented with Random Variance Gaussian Noise [51.76329821186873]
We produce a model that can classify six different hand gestures with a limited number of samples that generalizes well to a wider audience.
We appeal to a set of more elementary methods such as the use of random bounds on a signal, but desire to show the power these methods can carry in an online setting.
arXiv Detail & Related papers (2022-06-29T23:22:18Z) - Learning Signal-Agnostic Manifolds of Neural Fields [50.066449953522685]
We leverage neural fields to capture the underlying structure in image, shape, audio and cross-modal audiovisual domains.
We show that by walking across the underlying manifold of GEM, we may generate new samples in our signal domains.
arXiv Detail & Related papers (2021-11-11T18:57:40Z) - Discriminative Singular Spectrum Classifier with Applications on
Bioacoustic Signal Recognition [67.4171845020675]
We present a bioacoustic signal classifier equipped with a discriminative mechanism to extract useful features for analysis and classification efficiently.
Unlike current bioacoustic recognition methods, which are task-oriented, the proposed model relies on transforming the input signals into vector subspaces.
The validity of the proposed method is verified using three challenging bioacoustic datasets containing anuran, bee, and mosquito species.
arXiv Detail & Related papers (2021-03-18T11:01:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.