Analyzing Modularity Maximization in Approximation, Heuristic, and Graph
Neural Network Algorithms for Community Detection
- URL: http://arxiv.org/abs/2310.10898v2
- Date: Wed, 10 Jan 2024 21:28:46 GMT
- Title: Analyzing Modularity Maximization in Approximation, Heuristic, and Graph
Neural Network Algorithms for Community Detection
- Authors: Samin Aref and Mahdi Mostajabdaveh
- Abstract summary: Our study assesses the performance of various modularity-based algorithms in obtaining optimal partitions.
We observe that near-optimal partitions are often disproportionately dissimilar to any optimal partition.
If modularity is to be used for detecting communities, we recommend approximate optimization algorithms for a more methodologically sound usage of modularity.
- Score: 1.1220356822493742
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Community detection, which involves partitioning nodes within a network, has
widespread applications across computational sciences. Modularity-based
algorithms identify communities by attempting to maximize the modularity
function across network node partitions. Our study assesses the performance of
various modularity-based algorithms in obtaining optimal partitions. Our
analysis utilizes 104 networks, including both real-world instances from
diverse contexts and modular graphs from two families of synthetic benchmarks.
We analyze ten inexact modularity-based algorithms against the exact integer
programming baseline that globally optimizes modularity. Our comparative
analysis includes eight heuristics, two variants of a graph neural network
algorithm, and nine variations of the Bayan approximation algorithm.
Our findings reveal that the average modularity-based heuristic yields
optimal partitions in only 43.9% of the 104 networks analyzed. Graph neural
networks and approximate Bayan, on average, achieve optimality on 68.7% and
82.3% of the networks respectively. Additionally, our analysis of three
partition similarity metrics exposes substantial dissimilarities between
high-modularity sub-optimal partitions and any optimal partition of the
networks. We observe that near-optimal partitions are often disproportionately
dissimilar to any optimal partition. Taken together, our analysis points to a
crucial limitation of the commonly used modularity-based methods: they rarely
produce an optimal partition or a partition resembling an optimal partition
even on networks with modular structures. If modularity is to be used for
detecting communities, we recommend approximate optimization algorithms for a
more methodologically sound usage of modularity within its applicability
limits.
Related papers
- Ensemble Quadratic Assignment Network for Graph Matching [52.20001802006391]
Graph matching is a commonly used technique in computer vision and pattern recognition.
Recent data-driven approaches have improved the graph matching accuracy remarkably.
We propose a graph neural network (GNN) based approach to combine the advantages of data-driven and traditional methods.
arXiv Detail & Related papers (2024-03-11T06:34:05Z) - Learning Unnormalized Statistical Models via Compositional Optimization [73.30514599338407]
Noise-contrastive estimation(NCE) has been proposed by formulating the objective as the logistic loss of the real data and the artificial noise.
In this paper, we study it a direct approach for optimizing the negative log-likelihood of unnormalized models.
arXiv Detail & Related papers (2023-06-13T01:18:16Z) - Heuristic Modularity Maximization Algorithms for Community Detection
Rarely Return an Optimal Partition or Anything Similar [0.0]
We investigate the extent to which current modularity algorithms succeed in returning maximum-modularity partitions.
We compare eight existing algorithms against an exact integer programming method that globally maximizes modularity.
Our results show that near-optimal partitions are often disproportionately dissimilar to any optimal partition.
arXiv Detail & Related papers (2023-02-28T16:11:08Z) - Bayan Algorithm: Detecting Communities in Networks Through Exact and Approximate Optimization of Modularity [0.8450726582512176]
We compare 30 community detection methods including our algorithm that offers optimality and approximation guarantees: the Bayan algorithm.
Compared to the partitions from 29 other algorithms, maximum-modularity partitions have the best medians for description length, coverage, performance, average conductance, and well clusteredness.
A Python implementation of the Bayan (bayanpy) is publicly available through the package installer for Python.
arXiv Detail & Related papers (2022-09-10T00:17:09Z) - Late Fusion Multi-view Clustering via Global and Local Alignment
Maximization [61.89218392703043]
Multi-view clustering (MVC) optimally integrates complementary information from different views to improve clustering performance.
Most of existing approaches directly fuse multiple pre-specified similarities to learn an optimal similarity matrix for clustering.
We propose late fusion MVC via alignment to address these issues.
arXiv Detail & Related papers (2022-08-02T01:49:31Z) - Amortized Implicit Differentiation for Stochastic Bilevel Optimization [53.12363770169761]
We study a class of algorithms for solving bilevel optimization problems in both deterministic and deterministic settings.
We exploit a warm-start strategy to amortize the estimation of the exact gradient.
By using this framework, our analysis shows these algorithms to match the computational complexity of methods that have access to an unbiased estimate of the gradient.
arXiv Detail & Related papers (2021-11-29T15:10:09Z) - Partition-based formulations for mixed-integer optimization of trained
ReLU neural networks [66.88252321870085]
This paper introduces a class of mixed-integer formulations for trained ReLU neural networks.
At one extreme, one partition per input recovers the convex hull of a node, i.e., the tightest possible formulation for each node.
arXiv Detail & Related papers (2021-02-08T17:27:34Z) - Partition of unity networks: deep hp-approximation [0.0]
We propose partition of unity networks (POUnets) which incorporate these elements directly into the architecture.
POUnets yield hp-convergence for smooth functions and consistently outperform piecewise functions with large numbers of discontinuities.
arXiv Detail & Related papers (2021-01-27T08:26:11Z) - Efficient semidefinite-programming-based inference for binary and
multi-class MRFs [83.09715052229782]
We propose an efficient method for computing the partition function or MAP estimate in a pairwise MRF.
We extend semidefinite relaxations from the typical binary MRF to the full multi-class setting, and develop a compact semidefinite relaxation that can again be solved efficiently using the solver.
arXiv Detail & Related papers (2020-12-04T15:36:29Z) - Distributed Optimization, Averaging via ADMM, and Network Topology [0.0]
We study the connection between network topology and convergence rates for different algorithms on a real world problem of sensor localization.
We also show interesting connections between ADMM and lifted Markov chains besides providing an explicitly characterization of its convergence.
arXiv Detail & Related papers (2020-09-05T21:44:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.