Heuristic Modularity Maximization Algorithms for Community Detection
Rarely Return an Optimal Partition or Anything Similar
- URL: http://arxiv.org/abs/2302.14698v3
- Date: Sun, 25 Jun 2023 14:36:55 GMT
- Title: Heuristic Modularity Maximization Algorithms for Community Detection
Rarely Return an Optimal Partition or Anything Similar
- Authors: Samin Aref, Mahdi Mostajabdaveh, and Hriday Chheda
- Abstract summary: We investigate the extent to which current modularity algorithms succeed in returning maximum-modularity partitions.
We compare eight existing algorithms against an exact integer programming method that globally maximizes modularity.
Our results show that near-optimal partitions are often disproportionately dissimilar to any optimal partition.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Community detection is a fundamental problem in computational sciences with
extensive applications in various fields. The most commonly used methods are
the algorithms designed to maximize modularity over different partitions of the
network nodes. Using 80 real and random networks from a wide range of contexts,
we investigate the extent to which current heuristic modularity maximization
algorithms succeed in returning maximum-modularity (optimal) partitions. We
evaluate (1) the ratio of the algorithms' output modularity to the maximum
modularity for each input graph, and (2) the maximum similarity between their
output partition and any optimal partition of that graph. We compare eight
existing heuristic algorithms against an exact integer programming method that
globally maximizes modularity. The average modularity-based heuristic algorithm
returns optimal partitions for only 19.4% of the 80 graphs considered.
Additionally, results on adjusted mutual information reveal substantial
dissimilarity between the sub-optimal partitions and any optimal partition of
the networks in our experiments. More importantly, our results show that
near-optimal partitions are often disproportionately dissimilar to any optimal
partition. Taken together, our analysis points to a crucial limitation of
commonly used modularity-based heuristics for discovering communities: they
rarely produce an optimal partition or a partition resembling an optimal
partition. If modularity is to be used for detecting communities, exact or
approximate optimization algorithms are recommendable for a more
methodologically sound usage of modularity within its applicability limits.
Related papers
- Adaptive Sampled Softmax with Inverted Multi-Index: Methods, Theory and Applications [79.53938312089308]
The MIDX-Sampler is a novel adaptive sampling strategy based on an inverted multi-index approach.
Our method is backed by rigorous theoretical analysis, addressing key concerns such as sampling bias, gradient bias, convergence rates, and generalization error bounds.
arXiv Detail & Related papers (2025-01-15T04:09:21Z) - Analyzing Modularity Maximization in Approximation, Heuristic, and Graph
Neural Network Algorithms for Community Detection [1.1220356822493742]
Our study assesses the performance of various modularity-based algorithms in obtaining optimal partitions.
We observe that near-optimal partitions are often disproportionately dissimilar to any optimal partition.
If modularity is to be used for detecting communities, we recommend approximate optimization algorithms for a more methodologically sound usage of modularity.
arXiv Detail & Related papers (2023-10-17T00:12:18Z) - Practical Parallel Algorithms for Non-Monotone Submodular Maximization [20.13836086815112]
Submodular has found extensive applications in various domains within the field of artificial intelligence.
One of the parallelizability of a submodular algorithm is its adaptive complexity, which indicates the number of rounds where a number of queries to the objective function can be executed in parallel.
We propose the first algorithm with both provable approximation ratio and sub adaptive complexity for the problem of non-monotone submodepsular subject to a $k$-system.
arXiv Detail & Related papers (2023-08-21T11:48:34Z) - Learning Unnormalized Statistical Models via Compositional Optimization [73.30514599338407]
Noise-contrastive estimation(NCE) has been proposed by formulating the objective as the logistic loss of the real data and the artificial noise.
In this paper, we study it a direct approach for optimizing the negative log-likelihood of unnormalized models.
arXiv Detail & Related papers (2023-06-13T01:18:16Z) - Online and Streaming Algorithms for Constrained $k$-Submodular
Maximization [25.15934533974176]
Constrained $k$-submodular is a general framework that captures many discrete optimization problems such as ad allocation, influence, personalized recommendation, and many others.
In this work, we develop single-pass streaming and online algorithms for $k$-submodular constrained with both monotone and general (possibly non-monotone) objectives.
arXiv Detail & Related papers (2023-05-25T12:53:17Z) - Bayan Algorithm: Detecting Communities in Networks Through Exact and Approximate Optimization of Modularity [0.8450726582512176]
We compare 30 community detection methods including our algorithm that offers optimality and approximation guarantees: the Bayan algorithm.
Compared to the partitions from 29 other algorithms, maximum-modularity partitions have the best medians for description length, coverage, performance, average conductance, and well clusteredness.
A Python implementation of the Bayan (bayanpy) is publicly available through the package installer for Python.
arXiv Detail & Related papers (2022-09-10T00:17:09Z) - Late Fusion Multi-view Clustering via Global and Local Alignment
Maximization [61.89218392703043]
Multi-view clustering (MVC) optimally integrates complementary information from different views to improve clustering performance.
Most of existing approaches directly fuse multiple pre-specified similarities to learn an optimal similarity matrix for clustering.
We propose late fusion MVC via alignment to address these issues.
arXiv Detail & Related papers (2022-08-02T01:49:31Z) - Lower Bounds and Optimal Algorithms for Smooth and Strongly Convex
Decentralized Optimization Over Time-Varying Networks [79.16773494166644]
We consider the task of minimizing the sum of smooth and strongly convex functions stored in a decentralized manner across the nodes of a communication network.
We design two optimal algorithms that attain these lower bounds.
We corroborate the theoretical efficiency of these algorithms by performing an experimental comparison with existing state-of-the-art methods.
arXiv Detail & Related papers (2021-06-08T15:54:44Z) - Minimax Optimization with Smooth Algorithmic Adversaries [59.47122537182611]
We propose a new algorithm for the min-player against smooth algorithms deployed by an adversary.
Our algorithm is guaranteed to make monotonic progress having no limit cycles, and to find an appropriate number of gradient ascents.
arXiv Detail & Related papers (2021-06-02T22:03:36Z) - Partition-based formulations for mixed-integer optimization of trained
ReLU neural networks [66.88252321870085]
This paper introduces a class of mixed-integer formulations for trained ReLU neural networks.
At one extreme, one partition per input recovers the convex hull of a node, i.e., the tightest possible formulation for each node.
arXiv Detail & Related papers (2021-02-08T17:27:34Z) - Efficient semidefinite-programming-based inference for binary and
multi-class MRFs [83.09715052229782]
We propose an efficient method for computing the partition function or MAP estimate in a pairwise MRF.
We extend semidefinite relaxations from the typical binary MRF to the full multi-class setting, and develop a compact semidefinite relaxation that can again be solved efficiently using the solver.
arXiv Detail & Related papers (2020-12-04T15:36:29Z) - Batch greedy maximization of non-submodular functions: Guarantees and
applications to experimental design [0.0]
We analyze greedys for cardinality constrained of non-submodular non-decreasing set functions.
Our theoretical guarantees are characterized by the combination of submodularity and supermodularity ratios.
arXiv Detail & Related papers (2020-06-03T18:58:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.