Self-supervised Multi-view Clustering in Computer Vision: A Survey
- URL: http://arxiv.org/abs/2309.09473v1
- Date: Mon, 18 Sep 2023 04:11:18 GMT
- Title: Self-supervised Multi-view Clustering in Computer Vision: A Survey
- Authors: Jiatai Wang, Zhiwei Xu, Xuewen Yang, Hailong Li, Bo Li, Xuying Meng
- Abstract summary: Multi-view clustering (MVC) has had significant implications in cross-modal representation learning and data-driven decision-making.
This paper explores the reasons and advantages of the emergence of self-supervised MVC.
- Score: 14.432997752719473
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-view clustering (MVC) has had significant implications in cross-modal
representation learning and data-driven decision-making in recent years. It
accomplishes this by leveraging the consistency and complementary information
among multiple views to cluster samples into distinct groups. However, as
contrastive learning continues to evolve within the field of computer vision,
self-supervised learning has also made substantial research progress and is
progressively becoming dominant in MVC methods. It guides the clustering
process by designing proxy tasks to mine the representation of image and video
data itself as supervisory information. Despite the rapid development of
self-supervised MVC, there has yet to be a comprehensive survey to analyze and
summarize the current state of research progress. Therefore, this paper
explores the reasons and advantages of the emergence of self-supervised MVC and
discusses the internal connections and classifications of common datasets, data
issues, representation learning methods, and self-supervised learning methods.
This paper does not only introduce the mechanisms for each category of methods
but also gives a few examples of how these techniques are used. In the end,
some open problems are pointed out for further investigation and development.
Related papers
- SLRL: Structured Latent Representation Learning for Multi-view Clustering [24.333292079699554]
Multi-View Clustering (MVC) aims to exploit the inherent consistency and complementarity among different views to improve clustering outcomes.
Despite extensive research in MVC, most existing methods focus predominantly on harnessing complementary information across views to enhance clustering effectiveness.
We introduce a novel framework, termed Structured Latent Representation Learning based Multi-View Clustering method.
arXiv Detail & Related papers (2024-07-11T09:43:57Z) - CDIMC-net: Cognitive Deep Incomplete Multi-view Clustering Network [53.72046586512026]
We propose a novel incomplete multi-view clustering network, called Cognitive Deep Incomplete Multi-view Clustering Network (CDIMC-net)
It captures the high-level features and local structure of each view by incorporating the view-specific deep encoders and graph embedding strategy into a framework.
Based on the human cognition, i.e., learning from easy to hard, it introduces a self-paced strategy to select the most confident samples for model training.
arXiv Detail & Related papers (2024-03-28T15:45:03Z) - Neural Clustering based Visual Representation Learning [61.72646814537163]
Clustering is one of the most classic approaches in machine learning and data analysis.
We propose feature extraction with clustering (FEC), which views feature extraction as a process of selecting representatives from data.
FEC alternates between grouping pixels into individual clusters to abstract representatives and updating the deep features of pixels with current representatives.
arXiv Detail & Related papers (2024-03-26T06:04:50Z) - A Probabilistic Model behind Self-Supervised Learning [53.64989127914936]
In self-supervised learning (SSL), representations are learned via an auxiliary task without annotated labels.
We present a generative latent variable model for self-supervised learning.
We show that several families of discriminative SSL induce a comparable distribution over representations.
arXiv Detail & Related papers (2024-02-02T13:31:17Z) - Masked Modeling for Self-supervised Representation Learning on Vision
and Beyond [69.64364187449773]
Masked modeling has emerged as a distinctive approach that involves predicting parts of the original data that are proportionally masked during training.
We elaborate on the details of techniques within masked modeling, including diverse masking strategies, recovering targets, network architectures, and more.
We conclude by discussing the limitations of current techniques and point out several potential avenues for advancing masked modeling research.
arXiv Detail & Related papers (2023-12-31T12:03:21Z) - Multi-View Class Incremental Learning [57.14644913531313]
Multi-view learning (MVL) has gained great success in integrating information from multiple perspectives of a dataset to improve downstream task performance.
This paper investigates a novel paradigm called multi-view class incremental learning (MVCIL), where a single model incrementally classifies new classes from a continual stream of views.
arXiv Detail & Related papers (2023-06-16T08:13:41Z) - Generative Partial Multi-View Clustering [133.36721417531734]
We propose a generative partial multi-view clustering model, named as GP-MVC, to address the incomplete multi-view problem.
First, multi-view encoder networks are trained to learn common low-dimensional representations, followed by a clustering layer to capture the consistent cluster structure across multiple views.
Second, view-specific generative adversarial networks are developed to generate the missing data of one view conditioning on the shared representation given by other views.
arXiv Detail & Related papers (2020-03-29T17:48:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.