Tree-Based Learning on Amperometric Time Series Data Demonstrates High
Accuracy for Classification
- URL: http://arxiv.org/abs/2302.02650v1
- Date: Mon, 6 Feb 2023 09:44:53 GMT
- Title: Tree-Based Learning on Amperometric Time Series Data Demonstrates High
Accuracy for Classification
- Authors: Jeyashree Krishnan, Zeyu Lian, Pieter E. Oomen, Xiulan He, Soodabeh
Majdi, Andreas Schuppert, Andrew Ewing
- Abstract summary: We present a universal method for the classification with respect to diverse amperometric datasets using data-driven approaches in computational science.
We demonstrate a very high prediction accuracy (greater than or equal to 95%)
This is one of the first studies that propose a scheme for machine learning, and in particular, supervised learning on full amperometry time series data.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Elucidating exocytosis processes provide insights into cellular
neurotransmission mechanisms, and may have potential in neurodegenerative
diseases research. Amperometry is an established electrochemical method for the
detection of neurotransmitters released from and stored inside cells. An
important aspect of the amperometry method is the sub-millisecond temporal
resolution of the current recordings which leads to several hundreds of
gigabytes of high-quality data. In this study, we present a universal method
for the classification with respect to diverse amperometric datasets using
data-driven approaches in computational science. We demonstrate a very high
prediction accuracy (greater than or equal to 95%). This includes an end-to-end
systematic machine learning workflow for amperometric time series datasets
consisting of pre-processing; feature extraction; model identification;
training and testing; followed by feature importance evaluation - all
implemented. We tested the method on heterogeneous amperometric time series
datasets generated using different experimental approaches, chemical
stimulations, electrode types, and varying recording times. We identified a
certain overarching set of common features across these datasets which enables
accurate predictions. Further, we showed that information relevant for the
classification of amperometric traces are neither in the spiky segments alone,
nor can it be retrieved from just the temporal structure of spikes. In fact,
the transients between spikes and the trace baselines carry essential
information for a successful classification, thereby strongly demonstrating
that an effective feature representation of amperometric time series requires
the full time series. To our knowledge, this is one of the first studies that
propose a scheme for machine learning, and in particular, supervised learning
on full amperometry time series data.
Related papers
- Stacked ensemble\-based mutagenicity prediction model using multiple modalities with graph attention network [0.9736758288065405]
Mutagenicity is a concern due to its association with genetic mutations which can result in a variety of negative consequences.
In this work, we introduce a novel stacked ensemble based mutagenicity prediction model.
arXiv Detail & Related papers (2024-09-03T09:14:21Z) - Fast and Functional Structured Data Generators Rooted in
Out-of-Equilibrium Physics [62.997667081978825]
We address the challenge of using energy-based models to produce high-quality, label-specific data in structured datasets.
Traditional training methods encounter difficulties due to inefficient Markov chain Monte Carlo mixing.
We use a novel training algorithm that exploits non-equilibrium effects.
arXiv Detail & Related papers (2023-07-13T15:08:44Z) - Multivariate Time Series Classification: A Deep Learning Approach [1.0742675209112622]
This paper investigates different methods and various neural network architectures applicable in the time series classification domain.
Data is obtained from a fleet of gas sensors that measure and track quantities such as oxygen and sound.
With the help of this data, we can detect events such as occupancy in a specific environment.
arXiv Detail & Related papers (2023-07-05T12:50:48Z) - Machine Learning for Flow Cytometry Data Analysis [0.0]
Flow cytometers can rapidly analyse tens of thousands of cells at the same time while also measuring multiple parameters from a single cell.
Researchers need to be able to distinguish interesting-looking cell populations manually in multi-dimensional data collected from millions of cells.
Three representative automated clustering algorithms are selected to be applied, compared and evaluated by completely and partially automated gating.
arXiv Detail & Related papers (2023-03-16T00:43:46Z) - Time Associated Meta Learning for Clinical Prediction [78.99422473394029]
We propose a novel time associated meta learning (TAML) method to make effective predictions at multiple future time points.
To address the sparsity problem after task splitting, TAML employs a temporal information sharing strategy to augment the number of positive samples.
We demonstrate the effectiveness of TAML on multiple clinical datasets, where it consistently outperforms a range of strong baselines.
arXiv Detail & Related papers (2023-03-05T03:54:54Z) - DriPP: Driven Point Processes to Model Stimuli Induced Patterns in M/EEG
Signals [62.997667081978825]
We develop a novel statistical point process model-called driven temporal point processes (DriPP)
We derive a fast and principled expectation-maximization (EM) algorithm to estimate the parameters of this model.
Results on standard MEG datasets demonstrate that our methodology reveals event-related neural responses.
arXiv Detail & Related papers (2021-12-08T13:07:21Z) - Deep Metric Learning with Locality Sensitive Angular Loss for
Self-Correcting Source Separation of Neural Spiking Signals [77.34726150561087]
We propose a methodology based on deep metric learning to address the need for automated post-hoc cleaning and robust separation filters.
We validate this method with an artificially corrupted label set based on source-separated high-density surface electromyography recordings.
This approach enables a neural network to learn to accurately decode neurophysiological time series using any imperfect method of labelling the signal.
arXiv Detail & Related papers (2021-10-13T21:51:56Z) - Benchmarking Deep Learning Interpretability in Time Series Predictions [41.13847656750174]
Saliency methods are used extensively to highlight the importance of input features in model predictions.
We set out to extensively compare the performance of various saliency-based interpretability methods across diverse neural architectures.
arXiv Detail & Related papers (2020-10-26T22:07:53Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.