Fast, Accurate and Interpretable Time Series Classification Through
Randomization
- URL: http://arxiv.org/abs/2105.14876v1
- Date: Mon, 31 May 2021 10:59:11 GMT
- Title: Fast, Accurate and Interpretable Time Series Classification Through
Randomization
- Authors: Nestor Cabello, Elham Naghizade, Jianzhong Qi, Lars Kulik
- Abstract summary: Time series classification (TSC) aims to predict the class label of a given time series.
We propose a novel TSC method - the Randomized-Supervised Time Series Forest (r-STSF)
r-STSF is highly efficient, achieves state-of-the-art classification accuracy and enables interpretability.
- Score: 20.638480955703102
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Time series classification (TSC) aims to predict the class label of a given
time series, which is critical to a rich set of application areas such as
economics and medicine. State-of-the-art TSC methods have mostly focused on
classification accuracy and efficiency, without considering the
interpretability of their classifications, which is an important property
required by modern applications such as appliance modeling and legislation such
as the European General Data Protection Regulation. To address this gap, we
propose a novel TSC method - the Randomized-Supervised Time Series Forest
(r-STSF). r-STSF is highly efficient, achieves state-of-the-art classification
accuracy and enables interpretability. r-STSF takes an efficient interval-based
approach to classify time series according to aggregate values of
discriminatory sub-series (intervals). To achieve state-of-the-art accuracy,
r-STSF builds an ensemble of randomized trees using the discriminatory
sub-series. It uses four time series representations, nine aggregation
functions and a supervised binary-inspired search combined with a feature
ranking metric to identify highly discriminatory sub-series. The discriminatory
sub-series enable interpretable classifications. Experiments on extensive
datasets show that r-STSF achieves state-of-the-art accuracy while being orders
of magnitude faster than most existing TSC methods. It is the only classifier
from the state-of-the-art group that enables interpretability. Our findings
also highlight that r-STSF is the best TSC method when classifying complex time
series datasets.
Related papers
- Exploring Hierarchical Classification Performance for Time Series Data:
Dissimilarity Measures and Classifier Comparisons [0.0]
This study investigates the comparative performance of hierarchical classification (HC) and flat classification (FC) methodologies in time series data analysis.
Dissimilarity measures, including Jensen-Shannon Distance (JSD), Task Similarity Distance (TSD), and Based Distance (CBD) are leveraged.
arXiv Detail & Related papers (2024-02-07T21:46:26Z) - TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling [67.02157180089573]
Time series pre-training has recently garnered wide attention for its potential to reduce labeling expenses and benefit various downstream tasks.
This paper proposes TimeSiam as a simple but effective self-supervised pre-training framework for Time series based on Siamese networks.
arXiv Detail & Related papers (2024-02-04T13:10:51Z) - Shapelet-based Model-agnostic Counterfactual Local Explanations for Time
Series Classification [5.866975269666861]
We propose a model-agnostic instance-based post-hoc explainability method for time series classification.
The proposed algorithm, namely Time-CF, leverages shapelets and TimeGAN to provide counterfactual explanations for arbitrary time series classifiers.
arXiv Detail & Related papers (2024-02-02T11:57:53Z) - Convolutional and Deep Learning based techniques for Time Series Ordinal Classification [7.047582157120573]
Time Series Ordinal Classification (TSOC) is the field covering this gap, yet unexplored in the literature.
This paper presents a first benchmarking of TSOC methodologies, exploiting the ordering of the target labels to boost the performance of current TSC state-of-the-art.
arXiv Detail & Related papers (2023-06-16T11:57:11Z) - ProTeCt: Prompt Tuning for Taxonomic Open Set Classification [59.59442518849203]
Few-shot adaptation methods do not fare well in the taxonomic open set (TOS) setting.
We propose a prompt tuning technique that calibrates the hierarchical consistency of model predictions.
A new Prompt Tuning for Hierarchical Consistency (ProTeCt) technique is then proposed to calibrate classification across label set granularities.
arXiv Detail & Related papers (2023-06-04T02:55:25Z) - Parametric Classification for Generalized Category Discovery: A Baseline
Study [70.73212959385387]
Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples.
We investigate the failure of parametric classifiers, verify the effectiveness of previous design choices when high-quality supervision is available, and identify unreliable pseudo-labels as a key problem.
We propose a simple yet effective parametric classification method that benefits from entropy regularisation, achieves state-of-the-art performance on multiple GCD benchmarks and shows strong robustness to unknown class numbers.
arXiv Detail & Related papers (2022-11-21T18:47:11Z) - Early Time-Series Classification Algorithms: An Empirical Comparison [59.82930053437851]
Early Time-Series Classification (ETSC) is the task of predicting the class of incoming time-series by observing as few measurements as possible.
We evaluate six existing ETSC algorithms on publicly available data, as well as on two newly introduced datasets.
arXiv Detail & Related papers (2022-03-03T10:43:56Z) - Classification of multivariate weakly-labelled time-series with
attention [0.0]
Weakly labelled time-series are time-series containing noise and significant redundancies.
This paper proposes an approach of exploiting context relevance of subsequences to improve classification accuracy.
arXiv Detail & Related papers (2021-02-16T16:05:38Z) - Binary Classification from Multiple Unlabeled Datasets via Surrogate Set
Classification [94.55805516167369]
We propose a new approach for binary classification from m U-sets for $mge2$.
Our key idea is to consider an auxiliary classification task called surrogate set classification (SSC)
arXiv Detail & Related papers (2021-02-01T07:36:38Z) - The Canonical Interval Forest (CIF) Classifier for Time Series
Classification [0.0]
Time series forest (TSF) is one of the most well known interval methods.
We propose combining TSF and catch22 to form a new classifier, the Canonical Interval Forest (CIF)
We demonstrate a large and significant improvement in accuracy over both TSF and catch22, and show it to be on par with top performers from other algorithmic classes.
arXiv Detail & Related papers (2020-08-20T19:26:24Z) - Benchmarking Multivariate Time Series Classification Algorithms [69.12151492736524]
Time Series Classification (TSC) involved building predictive models for a discrete target variable from ordered, real valued, attributes.
Over recent years, a new set of TSC algorithms have been developed which have made significant improvement over the previous state of the art.
We review recently proposed bespoke MTSC algorithms based on deep learning, shapelets and bag of words approaches.
arXiv Detail & Related papers (2020-07-26T15:56:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.