Fairness-aware Multi-view Clustering
- URL: http://arxiv.org/abs/2302.05788v1
- Date: Sat, 11 Feb 2023 21:36:42 GMT
- Title: Fairness-aware Multi-view Clustering
- Authors: Lecheng Zheng, Yada Zhu, Jingrui He
- Abstract summary: We propose a fairness-aware multi-view clustering method named FairMVC.
It incorporates the group fairness constraint into the soft membership assignment for each cluster to ensure that the fraction of different groups in each cluster is approximately identical to the entire data set.
We also propose novel regularizers to handle heterogeneous data in complex scenarios with missing data or noisy features.
- Score: 41.479310583848246
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In the era of big data, we are often facing the challenge of data
heterogeneity and the lack of label information simultaneously. In the
financial domain (e.g., fraud detection), the heterogeneous data may include
not only numerical data (e.g., total debt and yearly income), but also text and
images (e.g., financial statement and invoice images). At the same time, the
label information (e.g., fraud transactions) may be missing for building
predictive models. To address these challenges, many state-of-the-art
multi-view clustering methods have been proposed and achieved outstanding
performance. However, these methods typically do not take into consideration
the fairness aspect and are likely to generate biased results using sensitive
information such as race and gender. Therefore, in this paper, we propose a
fairness-aware multi-view clustering method named FairMVC. It incorporates the
group fairness constraint into the soft membership assignment for each cluster
to ensure that the fraction of different groups in each cluster is
approximately identical to the entire data set. Meanwhile, we adopt the idea of
both contrastive learning and non-contrastive learning and propose novel
regularizers to handle heterogeneous data in complex scenarios with missing
data or noisy features. Experimental results on real-world data sets
demonstrate the effectiveness and efficiency of the proposed framework. We also
derive insights regarding the relative performance of the proposed regularizers
in various scenarios.
Related papers
- Evidential Deep Partial Multi-View Classification With Discount Fusion [24.139495744683128]
We propose a novel framework called Evidential Deep Partial Multi-View Classification (EDP-MVC)
We use K-means imputation to address missing views, creating a complete set of multi-view data.
The potential conflicts and uncertainties within this imputed data can affect the reliability of downstream inferences.
arXiv Detail & Related papers (2024-08-23T14:50:49Z) - Tackling Diverse Minorities in Imbalanced Classification [80.78227787608714]
Imbalanced datasets are commonly observed in various real-world applications, presenting significant challenges in training classifiers.
We propose generating synthetic samples iteratively by mixing data samples from both minority and majority classes.
We demonstrate the effectiveness of our proposed framework through extensive experiments conducted on seven publicly available benchmark datasets.
arXiv Detail & Related papers (2023-08-28T18:48:34Z) - Rethinking Data Heterogeneity in Federated Learning: Introducing a New
Notion and Standard Benchmarks [65.34113135080105]
We show that not only the issue of data heterogeneity in current setups is not necessarily a problem but also in fact it can be beneficial for the FL participants.
Our observations are intuitive.
Our code is available at https://github.com/MMorafah/FL-SC-NIID.
arXiv Detail & Related papers (2022-09-30T17:15:19Z) - Fair Federated Learning for Heterogeneous Face Data [10.707311210901548]
We consider the problem of achieving fair classification in Federated Learning (FL) under data heterogeneity.
Most of the approaches proposed for fair classification require diverse data that represent the different demographic groups involved.
In contrast, it is common for each client to own data that represents only a single demographic group.
arXiv Detail & Related papers (2021-09-06T10:44:16Z) - MultiFair: Multi-Group Fairness in Machine Learning [52.24956510371455]
We study multi-group fairness in machine learning (MultiFair)
We propose a generic end-to-end algorithmic framework to solve it.
Our proposed framework is generalizable to many different settings.
arXiv Detail & Related papers (2021-05-24T02:30:22Z) - Error-Robust Multi-View Clustering: Progress, Challenges and
Opportunities [67.54503077766171]
Since label information is often expensive to acquire, multi-view clustering has gained growing interest.
Error-robust multi-view clustering approaches with explicit error removal formulation can be structured into five broad research categories.
This survey summarizes and reviews recent advances in error-robust clustering for multi-view data.
arXiv Detail & Related papers (2021-05-07T04:03:02Z) - Joint Representation Learning and Novel Category Discovery on Single-
and Multi-modal Data [16.138075558585516]
We present a generic, end-to-end framework to jointly learn a reliable representation and assign clusters to unlabelled data.
We employ Winner-Take-All (WTA) hashing algorithm on the shared representation space to generate pairwise pseudo labels for unlabelled data.
We thoroughly evaluate our framework on large-scale multi-modal video benchmarks Kinetics-400 and VGG-Sound, and image benchmarks CIFAR10, CIFAR100 and ImageNet.
arXiv Detail & Related papers (2021-04-26T15:56:16Z) - Learning to Generate Fair Clusters from Demonstrations [27.423983748614198]
We show how to identify the intended fairness constraint for a problem based on limited demonstrations from an expert.
We present an algorithm to identify the fairness metric from demonstrations and generate clusters using existing off-the-shelf clustering techniques.
We investigate how to generate interpretable solutions using our approach.
arXiv Detail & Related papers (2021-02-08T03:09:33Z) - Causal Feature Selection for Algorithmic Fairness [61.767399505764736]
We consider fairness in the integration component of data management.
We propose an approach to identify a sub-collection of features that ensure the fairness of the dataset.
arXiv Detail & Related papers (2020-06-10T20:20:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.