A Human Behavioral Baseline for Collective Governance in Software Projects
- URL: http://arxiv.org/abs/2510.08956v1
- Date: Fri, 10 Oct 2025 03:04:46 GMT
- Title: A Human Behavioral Baseline for Collective Governance in Software Projects
- Authors: Mobina Noori, Mahasweta Chakraborti, Amy X Zhang, Seth Frey,
- Abstract summary: We study how open source communities describe participation and control through version controlled governance documents.<n>Using a corpus of 710 projects with paired snapshots, we parse text into actors, rules, actions, and objects, then group them and measure change with entropy for evenness, richness for diversity, and Jensen Shannon divergence for drift.
- Score: 12.47967674379491
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We study how open source communities describe participation and control through version controlled governance documents. Using a corpus of 710 projects with paired snapshots, we parse text into actors, rules, actions, and objects, then group them and measure change with entropy for evenness, richness for diversity, and Jensen Shannon divergence for drift. Projects define more roles and more actions over time, and these are distributed more evenly, while the composition of rules remains stable. These findings indicate that governance grows by expanding and balancing categories of participation without major shifts in prescriptive force. The analysis provides a reproducible baseline for evaluating whether future AI mediated workflows concentrate or redistribute authority.
Related papers
- Human Control Is the Anchor, Not the Answer: Early Divergence of Oversight in Agentic AI Communities [2.5424331328233207]
Oversight for agentic AI is often discussed as a single goal ("human control"), yet early adoption may produce role-specific expectations.<n>We present a comparative analysis of two newly active Reddit communities that reflect different socio-technical roles: r/OpenClaw (deployment and operations) and r/Moltbook (agent-centered social interaction)<n>Across both communities, "human control" is an operational meaning, but its meaning diverges: r/OpenClaw emphasizes execution guardrails and recovery (action-risk), while r/Moltbook emphasizes identity, legitimacy, and accountability in public interaction (meaning
arXiv Detail & Related papers (2026-02-10T00:10:20Z) - Hyperactive Minority Alter the Stability of Community Notes [39.13508775153173]
We study the emergence and visibility of Community Notes on X.<n>We show that contribution activity is highly concentrated.<n>We replicate the notes' emergence process by integrating the open-source implementation of the Community Notes consensus algorithm.
arXiv Detail & Related papers (2026-02-09T18:04:54Z) - Patterns in the Transition From Founder-Leadership to Community Governance of Open Source [12.47967674379491]
We analyze 637 GitHub repositories to trace transitions from founder-led to shared governance.<n>We find roles and actions grow, and regulation becomes more balanced.
arXiv Detail & Related papers (2025-09-19T15:55:08Z) - Are We on the Same Page? Examining Developer Perception Alignment in Open Source Code Reviews [2.66269503676104]
Code reviews are a critical aspect of open-source software (OSS) development, ensuring quality and fostering collaboration.<n>This study examines perceptions, challenges, and biases in OSS code review processes, focusing on the perspectives of Contributors and maintainers.
arXiv Detail & Related papers (2025-04-25T15:03:39Z) - When Disagreements Elicit Robustness: Investigating Self-Repair Capabilities under LLM Multi-Agent Disagreements [56.29265568399648]
We argue that disagreements prevent premature consensus and expand the explored solution space.<n>Disagreements on task-critical steps can derail collaboration depending on the topology of solution paths.
arXiv Detail & Related papers (2025-02-21T02:24:43Z) - Off-Policy Evaluation for Large Action Spaces via Policy Convolution [60.6953713877886]
Policy Convolution family of estimators uses latent structure within actions to strategically convolve the logging and target policies.
Experiments on synthetic and benchmark datasets demonstrate remarkable mean squared error (MSE) improvements when using PC.
arXiv Detail & Related papers (2023-10-24T01:00:01Z) - Enclosed Loops: How open source communities become datasets [2.4269101271105176]
Centralization in code hosting and package management in the 2010s created fundamental shifts in the social arrangements of open source ecosystems.
In this paper we examine Dependabot, Crater and Copilot as three nascent tools whose existence is predicated on centralized software at scale.
arXiv Detail & Related papers (2023-06-09T00:02:25Z) - Exploring the Trade-off between Plausibility, Change Intensity and
Adversarial Power in Counterfactual Explanations using Multi-objective
Optimization [73.89239820192894]
We argue that automated counterfactual generation should regard several aspects of the produced adversarial instances.
We present a novel framework for the generation of counterfactual examples.
arXiv Detail & Related papers (2022-05-20T15:02:53Z) - This Must Be the Place: Predicting Engagement of Online Communities in a
Large-scale Distributed Campaign [70.69387048368849]
We study the behavior of communities with millions of active members.
We develop a hybrid model, combining textual cues, community meta-data, and structural properties.
We demonstrate the applicability of our model through Reddit's r/place a large-scale online experiment.
arXiv Detail & Related papers (2022-01-14T08:23:16Z) - A game-theoretic analysis of networked system control for common-pool
resource management using multi-agent reinforcement learning [54.55119659523629]
Multi-agent reinforcement learning has recently shown great promise as an approach to networked system control.
Common-pool resources include arable land, fresh water, wetlands, wildlife, fish stock, forests and the atmosphere.
arXiv Detail & Related papers (2020-10-15T14:12:26Z) - Randomized Entity-wise Factorization for Multi-Agent Reinforcement
Learning [59.62721526353915]
Multi-agent settings in the real world often involve tasks with varying types and quantities of agents and non-agent entities.
Our method aims to leverage these commonalities by asking the question: What is the expected utility of each agent when only considering a randomly selected sub-group of its observed entities?''
arXiv Detail & Related papers (2020-06-07T18:28:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.