Beyond Correlation: Causal Multi-View Unsupervised Feature Selection Learning
- URL: http://arxiv.org/abs/2509.13763v1
- Date: Wed, 17 Sep 2025 07:22:22 GMT
- Title: Beyond Correlation: Causal Multi-View Unsupervised Feature Selection Learning
- Authors: Zongxin Shen, Yanyong Huang, Bin Wang, Jinyuan Chang, Shiyu Liu, Tianrui Li,
- Abstract summary: Multi-view unsupervised feature selection (MUFS) has recently received increasing attention for its promising ability in dimensionality reduction on unlabeled data.<n>Existing MUFS methods typically select discriminative features by capturing correlations between features and clustering labels.<n>We introduce a novel structural causal model, which reveals that existing methods may select irrelevant features because they overlook spurious correlations caused by confounders.
- Score: 14.848818417654316
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-view unsupervised feature selection (MUFS) has recently received increasing attention for its promising ability in dimensionality reduction on multi-view unlabeled data. Existing MUFS methods typically select discriminative features by capturing correlations between features and clustering labels. However, an important yet underexplored question remains: \textit{Are such correlations sufficiently reliable to guide feature selection?} In this paper, we analyze MUFS from a causal perspective by introducing a novel structural causal model, which reveals that existing methods may select irrelevant features because they overlook spurious correlations caused by confounders. Building on this causal perspective, we propose a novel MUFS method called CAusal multi-view Unsupervised feature Selection leArning (CAUSA). Specifically, we first employ a generalized unsupervised spectral regression model that identifies informative features by capturing dependencies between features and consensus clustering labels. We then introduce a causal regularization module that can adaptively separate confounders from multi-view data and simultaneously learn view-shared sample weights to balance confounder distributions, thereby mitigating spurious correlations. Thereafter, integrating both into a unified learning framework enables CAUSA to select causally informative features. Comprehensive experiments demonstrate that CAUSA outperforms several state-of-the-art methods. To our knowledge, this is the first in-depth study of causal multi-view feature selection in the unsupervised setting.
Related papers
- Cross-view Joint Learning for Mixed-Missing Multi-view Unsupervised Feature Selection [24.037106656954666]
We propose CLIM-FS, a novel IMUFS method designed to address the mixed-missing problem.<n>We integrate the imputation of both missing views and variables into a feature selection model based on nonnegative matrix factorization.<n>We fully leverage consensus cluster structure and cross-view local geometrical structure to enhance the synergistic learning process.
arXiv Detail & Related papers (2025-11-15T15:34:52Z) - Generalized Deep Multi-view Clustering via Causal Learning with Partially Aligned Cross-view Correspondence [72.41989962665285]
Multi-view clustering (MVC) aims to explore the common clustering structure across multiple views.<n>However, real-world scenarios often present a challenge as only partial data is consistently aligned across different views.<n>We design a causal multi-view clustering network, termed CauMVC, to tackle this problem.
arXiv Detail & Related papers (2025-09-19T14:31:40Z) - TRUST-FS: Tensorized Reliable Unsupervised Multi-View Feature Selection for Incomplete Data [15.2618846897032]
Multi-view unsupervised feature selection (MUFS) has attracted increasing research interest in recent years.<n>Existing methods for incomplete multi-view data are limited to handling missing views and are unable to address the more general scenario of missing variables.<n>We propose a novel MUFS method for incomplete multi-view data termed Reliable Unview Feature Selection ( TRUSTFS)<n>TrustFS simultaneously performs feature selection, missing imputation and view weight learning within a unified factorization framework.
arXiv Detail & Related papers (2025-09-16T15:54:15Z) - Causally-Aware Unsupervised Feature Selection Learning [15.20376149047008]
Unsupervised feature selection (UFS) has recently gained attention for its effectiveness in processing unlabeled high-dimensional data.<n>Previous graph-based methods fail to account for the differing impacts of non-causal and causal features in constructing the similarity graph.<n>A novel UFS method, called Causally-Aware UnSupErvised Feature Selection learning (CAUSE-FS), is proposed.
arXiv Detail & Related papers (2024-10-16T04:41:38Z) - Regularized Contrastive Partial Multi-view Outlier Detection [76.77036536484114]
We propose a novel method named Regularized Contrastive Partial Multi-view Outlier Detection (RCPMOD)
In this framework, we utilize contrastive learning to learn view-consistent information and distinguish outliers by the degree of consistency.
Experimental results on four benchmark datasets demonstrate that our proposed approach could outperform state-of-the-art competitors.
arXiv Detail & Related papers (2024-08-02T14:34:27Z) - Adaptive Collaborative Correlation Learning-based Semi-Supervised Multi-Label Feature Selection [25.195711274756334]
We propose an Adaptive Collaborative Correlation lEarning-based Semi-Supervised Multi-label Feature Selection (Access-MFS) method to address these issues.<n> Specifically, a generalized regression model equipped with an extended uncorrelated constraint is introduced to select discriminative yet irrelevant features.<n>The correlation instance and label correlation are integrated into the proposed regression model to adaptively learn both the sample similarity graph and the label similarity graph.
arXiv Detail & Related papers (2024-06-18T01:47:38Z) - Mitigating Shortcut Learning with Diffusion Counterfactuals and Diverse Ensembles [104.60508550106618]
We propose DiffDiv, an ensemble diversification framework exploiting Diffusion Probabilistic Models (DPMs)<n>We show that DPMs can generate images with novel feature combinations, even when trained on samples displaying correlated input features.<n>We show that DPM-guided diversification is sufficient to remove dependence on shortcut cues, without a need for additional supervised signals.
arXiv Detail & Related papers (2023-11-23T15:47:33Z) - Causal Feature Selection via Transfer Entropy [59.999594949050596]
Causal discovery aims to identify causal relationships between features with observational data.
We introduce a new causal feature selection approach that relies on the forward and backward feature selection procedures.
We provide theoretical guarantees on the regression and classification errors for both the exact and the finite-sample cases.
arXiv Detail & Related papers (2023-10-17T08:04:45Z) - PatchMix Augmentation to Identify Causal Features in Few-shot Learning [55.64873998196191]
Few-shot learning aims to transfer knowledge learned from base with sufficient categories labelled data to novel categories with scarce known information.
We propose a novel data augmentation strategy dubbed as PatchMix that can break this spurious dependency.
We show that such an augmentation mechanism, different from existing ones, is able to identify the causal features.
arXiv Detail & Related papers (2022-11-29T08:41:29Z) - Bayesian Sparse Factor Analysis with Kernelized Observations [67.60224656603823]
Multi-view problems can be faced with latent variable models.
High-dimensionality and non-linear issues are traditionally handled by kernel methods.
We propose merging both approaches into single model.
arXiv Detail & Related papers (2020-06-01T14:25:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.