DeepVAT: A Self-Supervised Technique for Cluster Assessment in Image
Datasets
- URL: http://arxiv.org/abs/2306.00011v2
- Date: Mon, 31 Jul 2023 15:36:39 GMT
- Title: DeepVAT: A Self-Supervised Technique for Cluster Assessment in Image
Datasets
- Authors: Alokendu Mazumder, Tirthajit Baruah, Akash Kumar Singh, Pagadla
Krishna Murthy, Vishwajeet Pattanaik, Punit Rathore
- Abstract summary: Estimating the number of clusters and cluster structures in unlabeled, complex, and high-dimensional datasets (like images) is challenging for traditional clustering algorithms.
We propose a deep-learning-based framework that enables the assessment of cluster structure in complex image datasets.
- Score: 1.911666854588103
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Estimating the number of clusters and cluster structures in unlabeled,
complex, and high-dimensional datasets (like images) is challenging for
traditional clustering algorithms. In recent years, a matrix reordering-based
algorithm called Visual Assessment of Tendency (VAT), and its variants have
attracted many researchers from various domains to estimate the number of
clusters and inherent cluster structure present in the data. However, these
algorithms face significant challenges when dealing with image data as they
fail to effectively capture the crucial features inherent in images. To
overcome these limitations, we propose a deep-learning-based framework that
enables the assessment of cluster structure in complex image datasets. Our
approach utilizes a self-supervised deep neural network to generate
representative embeddings for the data. These embeddings are then reduced to
2-dimension using t-distributed Stochastic Neighbour Embedding (t-SNE) and
inputted into VAT based algorithms to estimate the underlying cluster
structure. Importantly, our framework does not rely on any prior knowledge of
the number of clusters. Our proposed approach demonstrates superior performance
compared to state-of-the-art VAT family algorithms and two other deep
clustering algorithms on four benchmark image datasets, namely MNIST, FMNIST,
CIFAR-10, and INTEL.
Related papers
- Unfolding ADMM for Enhanced Subspace Clustering of Hyperspectral Images [43.152314090830174]
We introduce an innovative clustering architecture for hyperspectral images (HSI) by unfolding an iterative solver based on the Alternating Direction Method of Multipliers (ADMM) for sparse subspace clustering.
Our approach captures well the structural characteristics of HSI data by employing the K nearest neighbors algorithm as part of a structure preservation module.
arXiv Detail & Related papers (2024-04-10T15:51:46Z) - Reinforcement Graph Clustering with Unknown Cluster Number [91.4861135742095]
We propose a new deep graph clustering method termed Reinforcement Graph Clustering.
In our proposed method, cluster number determination and unsupervised representation learning are unified into a uniform framework.
In order to conduct feedback actions, the clustering-oriented reward function is proposed to enhance the cohesion of the same clusters and separate the different clusters.
arXiv Detail & Related papers (2023-08-13T18:12:28Z) - Multi-View Clustering via Semi-non-negative Tensor Factorization [120.87318230985653]
We develop a novel multi-view clustering based on semi-non-negative tensor factorization (Semi-NTF)
Our model directly considers the between-view relationship and exploits the between-view complementary information.
In addition, we provide an optimization algorithm for the proposed method and prove mathematically that the algorithm always converges to the stationary KKT point.
arXiv Detail & Related papers (2023-03-29T14:54:19Z) - DeepCluE: Enhanced Image Clustering via Multi-layer Ensembles in Deep
Neural Networks [53.88811980967342]
This paper presents a Deep Clustering via Ensembles (DeepCluE) approach.
It bridges the gap between deep clustering and ensemble clustering by harnessing the power of multiple layers in deep neural networks.
Experimental results on six image datasets confirm the advantages of DeepCluE over the state-of-the-art deep clustering approaches.
arXiv Detail & Related papers (2022-06-01T09:51:38Z) - Very Compact Clusters with Structural Regularization via Similarity and
Connectivity [3.779514860341336]
We propose an end-to-end deep clustering algorithm, i.e., Very Compact Clusters (VCC) for the general datasets.
Our proposed approach achieves better clustering performance over most of the state-of-the-art clustering methods.
arXiv Detail & Related papers (2021-06-09T23:22:03Z) - DAC: Deep Autoencoder-based Clustering, a General Deep Learning
Framework of Representation Learning [0.0]
We propose DAC, Deep Autoencoder-based Clustering, a data-driven framework to learn clustering representations using deep neuron networks.
Experiment results show that our approach could effectively boost performance of the KMeans clustering algorithm on a variety of datasets.
arXiv Detail & Related papers (2021-02-15T11:31:00Z) - Scattering Transform Based Image Clustering using Projection onto
Orthogonal Complement [2.0305676256390934]
We introduce Projected-Scattering Spectral Clustering (PSSC), a state-of-the-art, stable, and fast algorithm for image clustering.
PSSC includes a novel method to exploit the geometric structure of the scattering transform of small images.
Our experiments demonstrate that PSSC obtains the best results among all shallow clustering algorithms.
arXiv Detail & Related papers (2020-11-23T17:59:03Z) - Scalable Hierarchical Agglomerative Clustering [65.66407726145619]
Existing scalable hierarchical clustering methods sacrifice quality for speed.
We present a scalable, agglomerative method for hierarchical clustering that does not sacrifice quality and scales to billions of data points.
arXiv Detail & Related papers (2020-10-22T15:58:35Z) - Unsupervised Multi-view Clustering by Squeezing Hybrid Knowledge from
Cross View and Each View [68.88732535086338]
This paper proposes a new multi-view clustering method, low-rank subspace multi-view clustering based on adaptive graph regularization.
Experimental results for five widely used multi-view benchmarks show that our proposed algorithm surpasses other state-of-the-art methods by a clear margin.
arXiv Detail & Related papers (2020-08-23T08:25:06Z) - Improving k-Means Clustering Performance with Disentangled Internal
Representations [0.0]
We propose a simpler approach of optimizing the entanglement of the learned latent code representation of an autoencoder.
Using our proposed approach, the test clustering accuracy was 96.2% on the MNIST dataset, 85.6% on the Fashion-MNIST dataset, and 79.2% on the EMNIST Balanced dataset, outperforming our baseline models.
arXiv Detail & Related papers (2020-06-05T11:32:34Z) - Learning to Cluster Faces via Confidence and Connectivity Estimation [136.5291151775236]
We propose a fully learnable clustering framework without requiring a large number of overlapped subgraphs.
Our method significantly improves clustering accuracy and thus performance of the recognition models trained on top, yet it is an order of magnitude more efficient than existing supervised methods.
arXiv Detail & Related papers (2020-04-01T13:39:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.