Integrated strong reciprocity enables productive punishment and protective defection
- URL: http://arxiv.org/abs/2601.03681v1
- Date: Wed, 07 Jan 2026 08:03:49 GMT
- Title: Integrated strong reciprocity enables productive punishment and protective defection
- Authors: Tatsuya Sasaki, Satochi Uchida,
- Abstract summary: We analyze an evolutionary game model that integrates upstream and downstream reciprocity with costly punishment.<n>We demonstrate that ISR admits a stable mixed equilibrium of ISR and unconditional defection.<n>At the same time, the mixed equilibrium of ISR and ALLD remains robust under modest complexity.
- Score: 1.4323566945483497
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Cooperation in large groups and one-shot interactions is often hindered by freeloading. Punishment can enforce cooperation, but it is usually regarded as wasteful because the costs of punishing offset its benefits. Here, we analyze an evolutionary game model that integrates upstream and downstream reciprocity with costly punishment: integrated strong reciprocity (ISR). We demonstrate that ISR admits a stable mixed equilibrium of ISR and unconditional defection (ALLD), and that costly punishment can become productive: When sufficiently efficient, it raises collective welfare above the no-punishment baseline. ALLD players persist as evolutionary shields, preventing invasion by unconditional cooperation (ALLC) or alternative conditional strategies (e.g., antisocial punishment). At the same time, the mixed equilibrium of ISR and ALLD remains robust under modest complexity costs that destabilize other symmetric cooperative systems.
Related papers
- Uncertainty-Aware Jamming Mitigation with Active RIS: A Robust Stackelberg Game Approach [65.06640919319413]
This paper investigates the jamming mitigation by leveraging an active reconfigurable intelligent surface (ARIS)<n>We adopt the Stackelberg game formulation to model the strategic interaction between the legitimate side and the adversary.<n>We first derive the optimal jamming policy as the follower's best response, which is then incorporated into the legitimate-side optimization for robust anti-jamming design.
arXiv Detail & Related papers (2026-02-20T12:02:01Z) - Unifying Stable Optimization and Reference Regularization in RLHF [64.16830602324345]
This paper introduces a unified regularization approach that balances objectives of preventing reward hacking and maintaining stable policy updates.<n>Our simple yet principled alignment objective yields a weighted supervised fine-tuning loss with a superior trade-off, which demonstrably improves both alignment results and implementation complexity.
arXiv Detail & Related papers (2026-02-12T03:31:19Z) - Toward a Sustainable Federated Learning Ecosystem: A Practical Least Core Mechanism for Payoff Allocation [71.86087908416255]
We introduce a payoff allocation framework based on the least core (LC) concept.<n>Unlike traditional methods, the LC prioritizes the cohesion of the federation by minimizing the maximum dissatisfaction.<n>Case studies in federated intrusion detection demonstrate that our mechanism correctly identifies pivotal contributors and strategic alliances.
arXiv Detail & Related papers (2026-02-03T11:10:50Z) - The Axiom of Consent: Friction Dynamics in Multi-Agent Coordination [0.0]
This paper derives a formal framework for analyzing coordination friction from a single axiom.<n>From this axiom of consent, we establish the kernel triple $(,, )$ characterizing any resource allocation configuration.<n> Applications to cryptocurrency governance and political systems demonstrate that the same equations govern friction dynamics across domains.
arXiv Detail & Related papers (2026-01-10T21:28:41Z) - Rational Adversaries and the Maintenance of Fragility: A Game-Theoretic Theory of Rational Stagnation [0.0]
This paper explains such "rational stagnation" as an equilibrium sustained by a rational adversary.<n> Applications to social-media algorithms and political trust illustrate how adversarial can deliberately preserve rationality.
arXiv Detail & Related papers (2025-10-25T09:28:15Z) - Learnable Mixed Nash Equilibria are Collectively Rational [17.93053401419066]
We show that uniform stability determines the last-iterate convergence behavior for the family of incremental smoothed best-response dynamics.<n>Unlike dynamics around strict equilibria, which can stabilize to socially-inefficient solutions, individually utility-seeking behaviors near mixed Nash equilibria lead to collective rationality.
arXiv Detail & Related papers (2025-10-16T17:25:32Z) - AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning [78.5751183537704]
AdvEvo-MARL is a co-evolutionary multi-agent reinforcement learning framework that internalizes safety into task agents.<n>Rather than relying on external guards, AdvEvo-MARL jointly optimize attackers and defenders.
arXiv Detail & Related papers (2025-10-02T02:06:30Z) - Integrating upstream and downstream reciprocity stabilizes cooperator-defector coexistence in N-player giving games [1.1381558444077822]
We show how pay-it-forward chains and reputation systems can jointly maintain social including cooperation despite cognitive limitations and group size challenges.<n>This framework demonstrates how pay-it-forward chains and reputation systems can jointly maintain social including cooperation despite cognitive limitations and group size challenges.
arXiv Detail & Related papers (2025-09-05T01:49:26Z) - Language Models and Logic Programs for Trustworthy Financial Reasoning [50.73061215297832]
Tax filing requires complex reasoning, combining application of overlapping rules with numerical calculations.<n>We propose an approach that integrates LLMs with a symbolic solver to calculate tax obligations.<n>We show how combining up-front translation of plain-text rules into formal logic programs, combined with intelligently retrieved exemplars for formal case representations, can dramatically improve performance.
arXiv Detail & Related papers (2025-08-28T17:55:07Z) - Resolving CAP Through Automata-Theoretic Economic Design: A Unified Mathematical Framework for Real-Time Partition-Tolerant Systems [0.0]
The CAP theorem asserts a trilemma between consistency, availability, and partition tolerance.<n>This paper introduces a rigorous automata-theoretic and economically grounded framework that reframes the CAP trade-off as a constraint optimization problem.
arXiv Detail & Related papers (2025-07-03T09:21:43Z) - On Tractable $Φ$-Equilibria in Non-Concave Games [53.212133025684224]
We study tractable $Phi$-equilibria in non-concave games.<n>We show that when $Phi$ is finite, there exists an efficient uncoupled learning algorithm that converges to the corresponding $Phi$-equilibria.
arXiv Detail & Related papers (2024-03-13T01:51:30Z) - Conservative DDPG -- Pessimistic RL without Ensemble [48.61228614796803]
DDPG is hindered by the overestimation bias problem.
Traditional solutions to this bias involve ensemble-based methods.
We propose a straightforward solution using a $Q$-target and incorporating a behavioral cloning (BC) loss penalty.
arXiv Detail & Related papers (2024-03-08T23:59:38Z) - How Bad is Selfish Driving? Bounding the Inefficiency of Equilibria in
Urban Driving Games [64.71476526716668]
We study the (in)efficiency of any equilibrium players might agree to play.
We obtain guarantees that refine existing bounds on the Price of Anarchy.
Although the obtained guarantees concern open-loop trajectories, we observe efficient equilibria even when agents employ closed-loop policies.
arXiv Detail & Related papers (2022-10-24T09:32:40Z) - Enhanced steady-state coherence via repeated system-bath interactions [0.0]
steady-state coherence (SSC) from system-bath interaction proves that quantum effects can appear without an external drive.
We predict the generation of SSC if the target system repeatedly interacts with independent and non-correlated bath elements.
We show that SSC substantially increases if the target system interacts collectively with more than one bath element at a time.
arXiv Detail & Related papers (2020-08-12T09:40:24Z) - Controlling the Outbreak of COVID-19: A Noncooperative Game Perspective [61.558752620308134]
Isolation and social distancing seem to be effective preventive measures to control this pandemic.
We propose a noncooperative game that can provide an incentive for maintaining social distancing to prevent the spread of COVID-19.
Numerical results show that the individual incentive increases more than 85% with an increasing percentage of home isolation.
arXiv Detail & Related papers (2020-07-27T04:28:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.