Clustering acoustic emission data streams with sequentially appearing
clusters using mixture models
- URL: http://arxiv.org/abs/2108.11211v1
- Date: Wed, 25 Aug 2021 13:01:06 GMT
- Title: Clustering acoustic emission data streams with sequentially appearing
clusters using mixture models
- Authors: Emmanuel Ramasso, Thierry Den{\o}e ux, Ga\"el Chevallier
- Abstract summary: We develop a new clustering method to handle the specificities of unlabeled acoustic emission (AE) data.
The method, called GMMSEQ, is experimentally validated to characterize the loosening phenomenon in bolted structure under vibrations.
In view of developing an open acoustic emission initiative, the datasets and the codes are made available to reproduce the research of this paper.
- Score: 6.166295570030645
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The interpretation of unlabeled acoustic emission (AE) data classically
relies on general-purpose clustering methods. While several external criteria
have been used in the past to select the hyperparameters of those algorithms,
few studies have paid attention to the development of dedicated objective
functions in clustering methods able to cope with the specificities of AE data.
We investigate how to explicitly represent clusters onsets in mixture models in
general, and in Gaussian Mixture Models (GMM) in particular. By modifying the
internal criterion of such models, we propose the first clustering method able
to provide, through parameters estimated by an expectation-maximization
procedure, information about when clusters occur (onsets), how they grow
(kinetics) and their level of activation through time. This new objective
function accommodates continuous timestamps of AE signals and, thus, their
order of occurrence. The method, called GMMSEQ, is experimentally validated to
characterize the loosening phenomenon in bolted structure under vibrations. A
comparison with three standard clustering methods on raw streaming data from
five experimental campaigns shows that GMMSEQ not only provides useful
qualitative information about the timeline of clusters, but also shows better
performance in terms of cluster characterization. In view of developing an open
acoustic emission initiative and according to the FAIR principles, the datasets
and the codes are made available to reproduce the research of this paper.
Related papers
- GCC: Generative Calibration Clustering [55.44944397168619]
We propose a novel Generative Clustering (GCC) method to incorporate feature learning and augmentation into clustering procedure.
First, we develop a discrimirative feature alignment mechanism to discover intrinsic relationship across real and generated samples.
Second, we design a self-supervised metric learning to generate more reliable cluster assignment.
arXiv Detail & Related papers (2024-04-14T01:51:11Z) - Robust and Automatic Data Clustering: Dirichlet Process meets
Median-of-Means [18.3248037914529]
We present an efficient and automatic clustering technique by integrating the principles of model-based and centroid-based methodologies.
Statistical guarantees on the upper bound of clustering error suggest the advantages of our proposed method over existing state-of-the-art clustering algorithms.
arXiv Detail & Related papers (2023-11-26T19:01:15Z) - Unified Multi-View Orthonormal Non-Negative Graph Based Clustering
Framework [74.25493157757943]
We formulate a novel clustering model, which exploits the non-negative feature property and incorporates the multi-view information into a unified joint learning framework.
We also explore, for the first time, the multi-model non-negative graph-based approach to clustering data based on deep features.
arXiv Detail & Related papers (2022-11-03T08:18:27Z) - Time Series Clustering with an EM algorithm for Mixtures of Linear
Gaussian State Space Models [0.0]
We propose a novel model-based time series clustering method with mixtures of linear Gaussian state space models.
The proposed method uses a new expectation-maximization algorithm for the mixture model to estimate the model parameters.
Experiments on a simulated dataset demonstrate the effectiveness of the method in clustering, parameter estimation, and model selection.
arXiv Detail & Related papers (2022-08-25T07:41:23Z) - K-ARMA Models for Clustering Time Series Data [4.345882429229813]
We present an approach to clustering time series data using a model-based generalization of the K-Means algorithm.
We show how the clustering algorithm can be made robust to outliers using a least-absolute deviations criteria.
We perform experiments on real data which show that our method is competitive with other existing methods for similar time series clustering tasks.
arXiv Detail & Related papers (2022-06-30T18:16:11Z) - Correlation Clustering Reconstruction in Semi-Adversarial Models [70.11015369368272]
Correlation Clustering is an important clustering problem with many applications.
We study the reconstruction version of this problem in which one is seeking to reconstruct a latent clustering corrupted by random noise and adversarial modifications.
arXiv Detail & Related papers (2021-08-10T14:46:17Z) - Deep Conditional Gaussian Mixture Model for Constrained Clustering [7.070883800886882]
Constrained clustering can leverage prior information on a growing amount of only partially labeled data.
We propose a novel framework for constrained clustering that is intuitive, interpretable, and can be trained efficiently in the framework of gradient variational inference.
arXiv Detail & Related papers (2021-06-11T13:38:09Z) - Scalable Hierarchical Agglomerative Clustering [65.66407726145619]
Existing scalable hierarchical clustering methods sacrifice quality for speed.
We present a scalable, agglomerative method for hierarchical clustering that does not sacrifice quality and scales to billions of data points.
arXiv Detail & Related papers (2020-10-22T15:58:35Z) - Kernel learning approaches for summarising and combining posterior
similarity matrices [68.8204255655161]
We build upon the notion of the posterior similarity matrix (PSM) in order to suggest new approaches for summarising the output of MCMC algorithms for Bayesian clustering models.
A key contribution of our work is the observation that PSMs are positive semi-definite, and hence can be used to define probabilistically-motivated kernel matrices.
arXiv Detail & Related papers (2020-09-27T14:16:14Z) - Progressive Cluster Purification for Unsupervised Feature Learning [48.87365358296371]
In unsupervised feature learning, sample specificity based methods ignore the inter-class information.
We propose a novel clustering based method, which excludes class inconsistent samples during progressive cluster formation.
Our approach, referred to as Progressive Cluster Purification (PCP), implements progressive clustering by gradually reducing the number of clusters during training.
arXiv Detail & Related papers (2020-07-06T08:11:03Z) - Autoencoder-based time series clustering with energy applications [0.0]
Time series clustering is a challenging task due to the specific nature of the data.
In this paper we investigate the combination of a convolutional autoencoder and a k-medoids algorithm to perfom time series clustering.
arXiv Detail & Related papers (2020-02-10T10:04:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.