Semi-unsupervised Learning for Time Series Classification
- URL: http://arxiv.org/abs/2207.03119v2
- Date: Fri, 8 Jul 2022 07:30:51 GMT
- Title: Semi-unsupervised Learning for Time Series Classification
- Authors: Padraig Davidson and Michael Steininger and Andr\'e Huhn and Anna
Krause and Andreas Hotho
- Abstract summary: Time series are ubiquitous and inherently hard to analyze and ultimately to label or cluster.
We present SuSL4TS, a deep generative Gaussian mixture model for semi-unsupervised learning to classify time series data.
- Score: 1.8811803364757567
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Time series are ubiquitous and therefore inherently hard to analyze and
ultimately to label or cluster. With the rise of the Internet of Things (IoT)
and its smart devices, data is collected in large amounts any given second. The
collected data is rich in information, as one can detect accidents (e.g. cars)
in real time, or assess injury/sickness over a given time span (e.g. health
devices). Due to its chaotic nature and massive amounts of datapoints,
timeseries are hard to label manually. Furthermore new classes within the data
could emerge over time (contrary to e.g. handwritten digits), which would
require relabeling the data. In this paper we present SuSL4TS, a deep
generative Gaussian mixture model for semi-unsupervised learning, to classify
time series data. With our approach we can alleviate manual labeling steps,
since we can detect sparsely labeled classes (semi-supervised) and identify
emerging classes hidden in the data (unsupervised). We demonstrate the efficacy
of our approach with established time series classification datasets from
different domains.
Related papers
- From Categories to Classifiers: Name-Only Continual Learning by Exploring the Web [118.67589717634281]
Continual learning often relies on the availability of extensive annotated datasets, an assumption that is unrealistically time-consuming and costly in practice.
We explore a novel paradigm termed name-only continual learning where time and cost constraints prohibit manual annotation.
Our proposed solution leverages the expansive and ever-evolving internet to query and download uncurated webly-supervised data for image classification.
arXiv Detail & Related papers (2023-11-19T10:43:43Z) - Time-Varying Propensity Score to Bridge the Gap between the Past and Present [104.46387765330142]
We introduce a time-varying propensity score that can detect gradual shifts in the distribution of data.
We demonstrate different ways of implementing it and evaluate it on a variety of problems.
arXiv Detail & Related papers (2022-10-04T07:21:49Z) - HAR-GCNN: Deep Graph CNNs for Human Activity Recognition From Highly
Unlabeled Mobile Sensor Data [61.79595926825511]
Acquiring balanced datasets containing accurate activity labels requires humans to correctly annotate and potentially interfere with the subjects' normal activities in real-time.
We propose HAR-GCCN, a deep graph CNN model that leverages the correlation between chronologically adjacent sensor measurements to predict the correct labels for unclassified activities.
Har-GCCN shows superior performance relative to previously used baseline methods, improving classification accuracy by about 25% and up to 68% on different datasets.
arXiv Detail & Related papers (2022-03-07T01:23:46Z) - AutoGeoLabel: Automated Label Generation for Geospatial Machine Learning [69.47585818994959]
We evaluate a big data processing pipeline to auto-generate labels for remote sensing data.
We utilize the big geo-data platform IBM PAIRS to dynamically generate such labels in dense urban areas.
arXiv Detail & Related papers (2022-01-31T20:02:22Z) - Diverse Complexity Measures for Dataset Curation in Self-driving [80.55417232642124]
We propose a new data selection method that exploits a diverse set of criteria that quantize interestingness of traffic scenes.
Our experiments show that the proposed curation pipeline is able to select datasets that lead to better generalization and higher performance.
arXiv Detail & Related papers (2021-01-16T23:45:02Z) - Time Series Data Imputation: A Survey on Deep Learning Approaches [4.4458738910060775]
Time series data imputation is a well-studied problem with different categories of methods.
Time series methods based on deep learning have made progress with the usage of models like RNN.
We will review and discuss their model architectures, their pros and cons as well as their effects to show the development of the time series imputation methods.
arXiv Detail & Related papers (2020-11-23T11:57:27Z) - Deep learning for time series classification [2.0305676256390934]
Time series analysis allows us to visualize and understand the evolution of a process over time.
Time series classification consists of constructing algorithms dedicated to automatically label time series data.
Deep learning has emerged as one of the most effective methods for tackling the supervised classification task.
arXiv Detail & Related papers (2020-10-01T17:38:40Z) - Temporal Calibrated Regularization for Robust Noisy Label Learning [60.90967240168525]
Deep neural networks (DNNs) exhibit great success on many tasks with the help of large-scale well annotated datasets.
However, labeling large-scale data can be very costly and error-prone so that it is difficult to guarantee the annotation quality.
We propose a Temporal Calibrated Regularization (TCR) in which we utilize the original labels and the predictions in the previous epoch together.
arXiv Detail & Related papers (2020-07-01T04:48:49Z) - Deep Categorization with Semi-Supervised Self-Organizing Maps [0.0]
This article presents a semi-supervised model, called Batch Semi-Supervised Self-Organizing Map (Batch SS-SOM)
The results show that Batch SS-SOM is a good option for semi-supervised classification and clustering.
It performs well in terms of accuracy and clustering error, even with a small number of labeled samples.
arXiv Detail & Related papers (2020-06-17T22:00:04Z) - Clustering Time Series Data through Autoencoder-based Deep Learning
Models [1.0499611180329802]
This paper introduces a two-stage method for clustering time series data.
First, a technique is introduced to utilize the characteristics of given time series data in order to create labels.
Second, an autoencoder-based deep learning model is built to learn and model both known and hidden features of time series data.
arXiv Detail & Related papers (2020-04-11T18:51:13Z) - Time Series Data Augmentation for Deep Learning: A Survey [35.2161833151567]
We systematically review different data augmentation methods for time series data.
We empirically compare different data augmentation methods for different tasks including time series classification, anomaly detection, and forecasting.
arXiv Detail & Related papers (2020-02-27T23:38:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.