ConiVAT: Cluster Tendency Assessment and Clustering with Partial
Background Knowledge
- URL: http://arxiv.org/abs/2008.09570v2
- Date: Mon, 28 Sep 2020 17:21:09 GMT
- Title: ConiVAT: Cluster Tendency Assessment and Clustering with Partial
Background Knowledge
- Authors: Punit Rathore, James C. Bezdek, Paolo Santi, Carlo Ratti
- Abstract summary: ConiVAT is a constraint-based version of iVAT that makes use of background knowledge in the form of constraints.
We demonstrate ConiVAT approach to visual assessment and single linkage clustering on nine datasets.
- Score: 11.600065064765325
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The VAT method is a visual technique for determining the potential cluster
structure and the possible number of clusters in numerical data. Its improved
version, iVAT, uses a path-based distance transform to improve the
effectiveness of VAT for "tough" cases. Both VAT and iVAT have also been used
in conjunction with a single-linkage(SL) hierarchical clustering algorithm.
However, they are sensitive to noise and bridge points between clusters in the
dataset, and consequently, the corresponding VAT/iVAT images are often
in-conclusive for such cases. In this paper, we propose a constraint-based
version of iVAT, which we call ConiVAT, that makes use of background knowledge
in the form of constraints, to improve VAT/iVAT for challenging and complex
datasets. ConiVAT uses the input constraints to learn the underlying similarity
metric and builds a minimum transitive dissimilarity matrix, before applying
VAT to it. We demonstrate ConiVAT approach to visual assessment and single
linkage clustering on nine datasets to show that, it improves the quality of
iVAT images for complex datasets, and it also overcomes the limitation of SL
clustering with VAT/iVAT due to "noisy" bridges between clusters. Extensive
experiment results on nine datasets suggest that ConiVAT outperforms the other
three semi-supervised clustering algorithms in terms of improved clustering
accuracy.
Related papers
- A3S: A General Active Clustering Method with Pairwise Constraints [66.74627463101837]
A3S features strategic active clustering adjustment on the initial cluster result, which is obtained by an adaptive clustering algorithm.
In extensive experiments across diverse real-world datasets, A3S achieves desired results with significantly fewer human queries.
arXiv Detail & Related papers (2024-07-14T13:37:03Z) - DeepVAT: A Self-Supervised Technique for Cluster Assessment in Image
Datasets [1.911666854588103]
Estimating the number of clusters and cluster structures in unlabeled, complex, and high-dimensional datasets (like images) is challenging for traditional clustering algorithms.
We propose a deep-learning-based framework that enables the assessment of cluster structure in complex image datasets.
arXiv Detail & Related papers (2023-05-29T22:32:39Z) - Multi-View Clustering via Semi-non-negative Tensor Factorization [120.87318230985653]
We develop a novel multi-view clustering based on semi-non-negative tensor factorization (Semi-NTF)
Our model directly considers the between-view relationship and exploits the between-view complementary information.
In addition, we provide an optimization algorithm for the proposed method and prove mathematically that the algorithm always converges to the stationary KKT point.
arXiv Detail & Related papers (2023-03-29T14:54:19Z) - CADIS: Handling Cluster-skewed Non-IID Data in Federated Learning with
Clustered Aggregation and Knowledge DIStilled Regularization [3.3711670942444014]
Federated learning enables edge devices to train a global model collaboratively without exposing their data.
We tackle a new type of Non-IID data, called cluster-skewed non-IID, discovered in actual data sets.
We propose an aggregation scheme that guarantees equality between clusters.
arXiv Detail & Related papers (2023-02-21T02:53:37Z) - Weakly Supervised Video Anomaly Detection Based on Cross-Batch
Clustering Guidance [39.43891080713327]
Weakly supervised video anomaly detection (WSVAD) is a challenging task since only video-level labels are available for training.
We propose a novel WSVAD method based on cross-batch clustering guidance.
arXiv Detail & Related papers (2022-12-16T14:38:30Z) - Graph Contrastive Clustering [131.67881457114316]
We propose a novel graph contrastive learning framework, which is then applied to the clustering task and we come up with the Graph Constrastive Clustering(GCC) method.
Specifically, on the one hand, the graph Laplacian based contrastive loss is proposed to learn more discriminative and clustering-friendly features.
On the other hand, a novel graph-based contrastive learning strategy is proposed to learn more compact clustering assignments.
arXiv Detail & Related papers (2021-04-03T15:32:49Z) - Contrastive Clustering [57.71729650297379]
We propose Contrastive Clustering (CC) which explicitly performs the instance- and cluster-level contrastive learning.
In particular, CC achieves an NMI of 0.705 (0.431) on the CIFAR-10 (CIFAR-100) dataset, which is an up to 19% (39%) performance improvement compared with the best baseline.
arXiv Detail & Related papers (2020-09-21T08:54:40Z) - iCVI-ARTMAP: Accelerating and improving clustering using adaptive
resonance theory predictive mapping and incremental cluster validity indices [1.160208922584163]
iCVI-ARTMAP uses incremental cluster validity indices (iCVIs) to perform unsupervised learning.
It can achieve running times up to two orders of magnitude shorter than when using batch CVI computations.
arXiv Detail & Related papers (2020-08-22T19:37:01Z) - Pairwise Similarity Knowledge Transfer for Weakly Supervised Object
Localization [53.99850033746663]
We study the problem of learning localization model on target classes with weakly supervised image labels.
In this work, we argue that learning only an objectness function is a weak form of knowledge transfer.
Experiments on the COCO and ILSVRC 2013 detection datasets show that the performance of the localization model improves significantly with the inclusion of pairwise similarity function.
arXiv Detail & Related papers (2020-03-18T17:53:33Z) - New advances in enumerative biclustering algorithms with online
partitioning [80.22629846165306]
This paper further extends RIn-Close_CVC, a biclustering algorithm capable of performing an efficient, complete, correct and non-redundant enumeration of maximal biclusters with constant values on columns in numerical datasets.
The improved algorithm is called RIn-Close_CVC3, keeps those attractive properties of RIn-Close_CVC, and is characterized by: a drastic reduction in memory usage; a consistent gain in runtime.
arXiv Detail & Related papers (2020-03-07T14:54:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.