Snapshot Spectral Clustering -- a costless approach to deep clustering
ensembles generation
- URL: http://arxiv.org/abs/2307.08591v1
- Date: Mon, 17 Jul 2023 16:01:22 GMT
- Title: Snapshot Spectral Clustering -- a costless approach to deep clustering
ensembles generation
- Authors: Adam Pir\'og, Halina Kwa\'snicka
- Abstract summary: This paper proposes a novel deep clustering ensemble method - Snapshot Spectral Clustering.
It is designed to maximize the gain from combining multiple data views while minimizing the computational costs of creating the ensemble.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Despite tremendous advancements in Artificial Intelligence, learning from
large sets of data in an unsupervised manner remains a significant challenge.
Classical clustering algorithms often fail to discover complex dependencies in
large datasets, especially considering sparse, high-dimensional spaces.
However, deep learning techniques proved to be successful when dealing with
large quantities of data, efficiently reducing their dimensionality without
losing track of underlying information. Several interesting advancements have
already been made to combine deep learning and clustering. Still, the idea of
enhancing the clustering results by combining multiple views of the data
generated by deep neural networks appears to be insufficiently explored yet.
This paper aims to investigate this direction and bridge the gap between deep
neural networks, clustering techniques and ensemble learning methods. To
achieve this goal, we propose a novel deep clustering ensemble method -
Snapshot Spectral Clustering, designed to maximize the gain from combining
multiple data views while minimizing the computational costs of creating the
ensemble. Comparative analysis and experiments described in this paper prove
the proposed concept, while the conducted hyperparameter study provides a
valuable intuition to follow when selecting proper values.
Related papers
- GCC: Generative Calibration Clustering [55.44944397168619]
We propose a novel Generative Clustering (GCC) method to incorporate feature learning and augmentation into clustering procedure.
First, we develop a discrimirative feature alignment mechanism to discover intrinsic relationship across real and generated samples.
Second, we design a self-supervised metric learning to generate more reliable cluster assignment.
arXiv Detail & Related papers (2024-04-14T01:51:11Z) - CDIMC-net: Cognitive Deep Incomplete Multi-view Clustering Network [53.72046586512026]
We propose a novel incomplete multi-view clustering network, called Cognitive Deep Incomplete Multi-view Clustering Network (CDIMC-net)
It captures the high-level features and local structure of each view by incorporating the view-specific deep encoders and graph embedding strategy into a framework.
Based on the human cognition, i.e., learning from easy to hard, it introduces a self-paced strategy to select the most confident samples for model training.
arXiv Detail & Related papers (2024-03-28T15:45:03Z) - Deep Clustering Using the Soft Silhouette Score: Towards Compact and
Well-Separated Clusters [0.0]
We propose soft silhoutte, a probabilistic formulation of the silhouette coefficient.
We introduce an autoencoder-based deep learning architecture that is suitable for optimizing the soft silhouette objective function.
The proposed deep clustering method has been tested and compared with several well-studied deep clustering methods on various benchmark datasets.
arXiv Detail & Related papers (2024-02-01T14:02:06Z) - Hard Regularization to Prevent Deep Online Clustering Collapse without
Data Augmentation [65.268245109828]
Online deep clustering refers to the joint use of a feature extraction network and a clustering model to assign cluster labels to each new data point or batch as it is processed.
While faster and more versatile than offline methods, online clustering can easily reach the collapsed solution where the encoder maps all inputs to the same point and all are put into a single cluster.
We propose a method that does not require data augmentation, and that, differently from existing methods, regularizes the hard assignments.
arXiv Detail & Related papers (2023-03-29T08:23:26Z) - Deep Clustering: A Comprehensive Survey [53.387957674512585]
Clustering analysis plays an indispensable role in machine learning and data mining.
Deep clustering, which can learn clustering-friendly representations using deep neural networks, has been broadly applied in a wide range of clustering tasks.
Existing surveys for deep clustering mainly focus on the single-view fields and the network architectures, ignoring the complex application scenarios of clustering.
arXiv Detail & Related papers (2022-10-09T02:31:32Z) - A Deep Dive into Deep Cluster [0.2578242050187029]
DeepCluster is a simple and scalable unsupervised pretraining of visual representations.
We show that DeepCluster convergence and performance depend on the interplay between the quality of the randomly filters of the convolutional layer and the selected number of clusters.
arXiv Detail & Related papers (2022-07-24T22:55:09Z) - A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and
Future Directions [48.97008907275482]
Clustering is a fundamental machine learning task which has been widely studied in the literature.
Deep Clustering, i.e., jointly optimizing the representation learning and clustering, has been proposed and hence attracted growing attention in the community.
We summarize the essential components of deep clustering and categorize existing methods by the ways they design interactions between deep representation learning and clustering.
arXiv Detail & Related papers (2022-06-15T15:05:13Z) - DeepCluE: Enhanced Image Clustering via Multi-layer Ensembles in Deep
Neural Networks [53.88811980967342]
This paper presents a Deep Clustering via Ensembles (DeepCluE) approach.
It bridges the gap between deep clustering and ensemble clustering by harnessing the power of multiple layers in deep neural networks.
Experimental results on six image datasets confirm the advantages of DeepCluE over the state-of-the-art deep clustering approaches.
arXiv Detail & Related papers (2022-06-01T09:51:38Z) - Large-Scale Hyperspectral Image Clustering Using Contrastive Learning [18.473767002905433]
We present a scalable deep online clustering model, named Spectral-Spatial Contrastive Clustering (SSCC)
We exploit a symmetric twin neural network comprised of a projection head with a dimensionality of the cluster number to conduct dual contrastive learning from a spectral-spatial augmentation pool.
The resulting approach is trained in an end-to-end fashion by batch-wise optimization, making it robust in large-scale data and resulting in good generalization ability for unseen data.
arXiv Detail & Related papers (2021-11-15T17:50:06Z) - Decorrelating Adversarial Nets for Clustering Mobile Network Data [0.7034976835586089]
A subset of deep learning, deep clustering could be a valuable tool for many network automation use-cases.
Most state-of-the-art clustering algorithms target image datasets, which makes them hard to apply to mobile network data.
We propose a new algorithm, DANCE, intended to be a reliable deep clustering method which also performs well when applied to network automation use-cases.
arXiv Detail & Related papers (2021-03-11T15:26:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.