Protecting Individual Interests across Clusters: Spectral Clustering
with Guarantees
- URL: http://arxiv.org/abs/2105.03714v1
- Date: Sat, 8 May 2021 15:03:25 GMT
- Title: Protecting Individual Interests across Clusters: Spectral Clustering
with Guarantees
- Authors: Shubham Gupta and Ambedkar Dukkipati
- Abstract summary: We propose an individual fairness criterion for clustering a graph $mathcalG$ that requires each cluster to contain an adequate number of members connected to the individual.
We devise a spectral clustering algorithm to find fair clusters under a given representation graph.
- Score: 20.350342151402963
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Studies related to fairness in machine learning have recently gained traction
due to its ever-expanding role in high-stakes decision making. For example, it
may be desirable to ensure that all clusters discovered by an algorithm have
high gender diversity. Previously, these problems have been studied under a
setting where sensitive attributes, with respect to which fairness conditions
impose diversity across clusters, are assumed to be observable; hence,
protected groups are readily available. Most often, this may not be true, and
diversity or individual interests can manifest as an intrinsic or latent
feature of a social network. For example, depending on latent sensitive
attributes, individuals interact with each other and represent each other's
interests, resulting in a network, which we refer to as a representation graph.
Motivated by this, we propose an individual fairness criterion for clustering a
graph $\mathcal{G}$ that requires each cluster to contain an adequate number of
members connected to the individual under a representation graph $\mathcal{R}$.
We devise a spectral clustering algorithm to find fair clusters under a given
representation graph. We further propose a variant of the stochastic block
model and establish our algorithm's weak consistency under this model. Finally,
we present experimental results to corroborate our theoretical findings.
Related papers
- Quality check of a sample partition using multinomial distribution [0.0]
We advocate a novel measure for the purpose of checking the quality of a cluster partition for a sample into several distinct classes.
We apply the multinomial distribution to the distances of data members, clustered in a group, from their respective cluster representatives.
arXiv Detail & Related papers (2024-04-11T14:14:58Z) - Reinforcement Graph Clustering with Unknown Cluster Number [91.4861135742095]
We propose a new deep graph clustering method termed Reinforcement Graph Clustering.
In our proposed method, cluster number determination and unsupervised representation learning are unified into a uniform framework.
In order to conduct feedback actions, the clustering-oriented reward function is proposed to enhance the cohesion of the same clusters and separate the different clusters.
arXiv Detail & Related papers (2023-08-13T18:12:28Z) - Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model [79.46465138631592]
We devise an efficient algorithm that recovers clusters using the observed labels.
We present Instance-Adaptive Clustering (IAC), the first algorithm whose performance matches these lower bounds both in expectation and with high probability.
arXiv Detail & Related papers (2023-06-18T08:46:06Z) - Fair Correlation Clustering in Forests [8.810926150873991]
A clustering is said to be fair, if each cluster has the same distribution of manifestations of a sensitive attribute as the whole input set.
This is motivated by various applications where the objects to be clustered have sensitive attributes that should not be over- or underrepresented.
We consider restricted graph classes which allow us to characterize the distributions of sensitive attributes for which this form of fairness is tractable.
arXiv Detail & Related papers (2023-02-22T11:27:06Z) - Fairness via Adversarial Attribute Neighbourhood Robust Learning [49.93775302674591]
We propose a principled underlineRobust underlineAdversarial underlineAttribute underlineNeighbourhood (RAAN) loss to debias the classification head.
arXiv Detail & Related papers (2022-10-12T23:39:28Z) - Learning Statistical Representation with Joint Deep Embedded Clustering [2.1267423178232407]
StatDEC is an unsupervised framework for joint statistical representation learning and clustering.
Our experiments show that using these representations, one can considerably improve results on imbalanced image clustering across a variety of image datasets.
arXiv Detail & Related papers (2021-09-11T09:26:52Z) - You Never Cluster Alone [150.94921340034688]
We extend the mainstream contrastive learning paradigm to a cluster-level scheme, where all the data subjected to the same cluster contribute to a unified representation.
We define a set of categorical variables as clustering assignment confidence, which links the instance-level learning track with the cluster-level one.
By reparametrizing the assignment variables, TCC is trained end-to-end, requiring no alternating steps.
arXiv Detail & Related papers (2021-06-03T14:59:59Z) - Deep Fair Discriminative Clustering [24.237000220172906]
We study a general notion of group-level fairness for binary and multi-state protected status variables (PSVs)
We propose a refinement learning algorithm to combine the clustering goal with the fairness objective to learn fair clusters adaptively.
Our framework shows promising results for novel clustering tasks including flexible fairness constraints, multi-state PSVs and predictive clustering.
arXiv Detail & Related papers (2021-05-28T23:50:48Z) - MultiFair: Multi-Group Fairness in Machine Learning [52.24956510371455]
We study multi-group fairness in machine learning (MultiFair)
We propose a generic end-to-end algorithmic framework to solve it.
Our proposed framework is generalizable to many different settings.
arXiv Detail & Related papers (2021-05-24T02:30:22Z) - Graph Contrastive Clustering [131.67881457114316]
We propose a novel graph contrastive learning framework, which is then applied to the clustering task and we come up with the Graph Constrastive Clustering(GCC) method.
Specifically, on the one hand, the graph Laplacian based contrastive loss is proposed to learn more discriminative and clustering-friendly features.
On the other hand, a novel graph-based contrastive learning strategy is proposed to learn more compact clustering assignments.
arXiv Detail & Related papers (2021-04-03T15:32:49Z) - A Notion of Individual Fairness for Clustering [22.288902523866867]
A common distinction in fair machine learning, in particular in fair classification, is between group fairness and individual fairness.
In this paper, we propose a natural notion of individual fairness for clustering. Our notion asks that every data point, on average, is closer to the points in its own cluster than to the points in any other cluster.
arXiv Detail & Related papers (2020-06-08T21:41:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.