Incomplete Multi-View Multi-Label Learning via Label-Guided Masked View-
and Category-Aware Transformers
- URL: http://arxiv.org/abs/2303.07180v1
- Date: Mon, 13 Mar 2023 15:22:50 GMT
- Title: Incomplete Multi-View Multi-Label Learning via Label-Guided Masked View-
and Category-Aware Transformers
- Authors: Chengliang Liu, Jie Wen, Xiaoling Luo, Yong Xu
- Abstract summary: We propose a general multi-view multi-label learning framework named label-guided masked view- and category-aware transformers.
Considering the imbalance of expressive power among views, an adaptively weighted view fusion module is proposed to obtain view-consistent embedding features.
- Score: 19.720564730308993
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As we all know, multi-view data is more expressive than single-view data and
multi-label annotation enjoys richer supervision information than single-label,
which makes multi-view multi-label learning widely applicable for various
pattern recognition tasks. In this complex representation learning problem,
three main challenges can be characterized as follows: i) How to learn
consistent representations of samples across all views? ii) How to exploit and
utilize category correlations of multi-label to guide inference? iii) How to
avoid the negative impact resulting from the incompleteness of views or labels?
To cope with these problems, we propose a general multi-view multi-label
learning framework named label-guided masked view- and category-aware
transformers in this paper. First, we design two transformer-style based
modules for cross-view features aggregation and multi-label classification,
respectively. The former aggregates information from different views in the
process of extracting view-specific features, and the latter learns subcategory
embedding to improve classification performance. Second, considering the
imbalance of expressive power among views, an adaptively weighted view fusion
module is proposed to obtain view-consistent embedding features. Third, we
impose a label manifold constraint in sample-level representation learning to
maximize the utilization of supervised information. Last but not least, all the
modules are designed under the premise of incomplete views and labels, which
makes our method adaptable to arbitrary multi-view and multi-label data.
Extensive experiments on five datasets confirm that our method has clear
advantages over other state-of-the-art methods.
Related papers
- Multi-Label Knowledge Distillation [86.03990467785312]
We propose a novel multi-label knowledge distillation method.
On one hand, it exploits the informative semantic knowledge from the logits by dividing the multi-label learning problem into a set of binary classification problems.
On the other hand, it enhances the distinctiveness of the learned feature representations by leveraging the structural information of label-wise embeddings.
arXiv Detail & Related papers (2023-08-12T03:19:08Z) - Reliable Representations Learning for Incomplete Multi-View Partial Multi-Label Classification [78.15629210659516]
In this paper, we propose an incomplete multi-view partial multi-label classification network named RANK.
We break through the view-level weights inherent in existing methods and propose a quality-aware sub-network to dynamically assign quality scores to each view of each sample.
Our model is not only able to handle complete multi-view multi-label datasets, but also works on datasets with missing instances and labels.
arXiv Detail & Related papers (2023-03-30T03:09:25Z) - DICNet: Deep Instance-Level Contrastive Network for Double Incomplete
Multi-View Multi-Label Classification [20.892833511657166]
Multi-view multi-label data in the real world is commonly incomplete due to the uncertain factors of data collection and manual annotation.
We propose a deep instance-level contrastive network, namely DICNet, to deal with the double incomplete multi-view multi-label classification problem.
Our DICNet is adept in capturing consistent discriminative representations of multi-view multi-label data and avoiding the negative effects of missing views and missing labels.
arXiv Detail & Related papers (2023-03-15T04:24:01Z) - Cross-view Graph Contrastive Representation Learning on Partially
Aligned Multi-view Data [52.491074276133325]
Multi-view representation learning has developed rapidly over the past decades and has been applied in many fields.
We propose a new cross-view graph contrastive learning framework, which integrates multi-view information to align data and learn latent representations.
Experiments conducted on several real datasets demonstrate the effectiveness of the proposed method on the clustering and classification tasks.
arXiv Detail & Related papers (2022-11-08T09:19:32Z) - TSK Fuzzy System Towards Few Labeled Incomplete Multi-View Data
Classification [24.01191516774655]
A transductive semi-supervised incomplete multi-view TSK fuzzy system modeling method (SSIMV_TSK) is proposed to address these challenges.
The proposed method integrates missing view imputation, pseudo label learning of unlabeled data, and fuzzy system modeling into a single process to yield a model with interpretable fuzzy rules.
Experimental results on real datasets show that the proposed method significantly outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2021-10-08T11:41:06Z) - Unsupervised Person Re-Identification with Multi-Label Learning Guided
Self-Paced Clustering [48.31017226618255]
Unsupervised person re-identification (Re-ID) has drawn increasing research attention recently.
In this paper, we address the unsupervised person Re-ID with a conceptually novel yet simple framework, termed as Multi-label Learning guided self-paced Clustering (MLC)
MLC mainly learns discriminative features with three crucial modules, namely a multi-scale network, a multi-label learning module, and a self-paced clustering module.
arXiv Detail & Related papers (2021-03-08T07:30:13Z) - Embedded Deep Bilinear Interactive Information and Selective Fusion for
Multi-view Learning [70.67092105994598]
We propose a novel multi-view learning framework to make the multi-view classification better aimed at the above-mentioned two aspects.
In particular, we train different deep neural networks to learn various intra-view representations.
Experiments on six publicly available datasets demonstrate the effectiveness of the proposed method.
arXiv Detail & Related papers (2020-07-13T01:13:23Z) - A Concise yet Effective model for Non-Aligned Incomplete Multi-view and
Missing Multi-label Learning [29.827794317616497]
Learning from multi-view multi-label data inevitably confronts three challenges: missing labels, incomplete views, and non-aligned views.
Existing methods mainly concern the first two and commonly need multiple assumptions to attack them.
In this paper, we aim at meeting these under the least assumption by building a concise yet effective model with just one hyper- parameter.
arXiv Detail & Related papers (2020-05-03T03:38:24Z) - Automatically Discovering and Learning New Visual Categories with
Ranking Statistics [145.89790963544314]
We tackle the problem of discovering novel classes in an image collection given labelled examples of other classes.
We learn a general-purpose clustering model and use the latter to identify the new classes in the unlabelled data.
We evaluate our approach on standard classification benchmarks and outperform current methods for novel category discovery by a significant margin.
arXiv Detail & Related papers (2020-02-13T18:53:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.