Social Network Structure Shapes Innovation: Experience-sharing in RL
with SAPIENS
- URL: http://arxiv.org/abs/2206.05060v1
- Date: Fri, 10 Jun 2022 12:47:45 GMT
- Title: Social Network Structure Shapes Innovation: Experience-sharing in RL
with SAPIENS
- Authors: Eleni Nisioti, Mateo Mahaut, Pierre-Yves Oudeyer, Ida Momennejad,
Cl\'ement Moulin-Frier
- Abstract summary: In dynamic topologies, humans oscillate between innovating individually or in small clusters, and then sharing outcomes with others.
We show that experience sharing within a dynamic topology achieves the highest level of innovation across tasks.
These contributions can advance our understanding of optimal AI-AI, human-human, and human-AI collaborative networks.
- Score: 16.388726429030346
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The human cultural repertoire relies on innovation: our ability to
continuously and hierarchically explore how existing elements can be combined
to create new ones. Innovation is not solitary, it relies on collective
accumulation and merging of previous solutions. Machine learning approaches
commonly assume that fully connected multi-agent networks are best suited for
innovation. However, human laboratory and field studies have shown that
hierarchical innovation is more robustly achieved by dynamic communication
topologies. In dynamic topologies, humans oscillate between innovating
individually or in small clusters, and then sharing outcomes with others. To
our knowledge, the role of multi-agent topology on innovation has not been
systematically studied in machine learning. It remains unclear a) which
communication topologies are optimal for which innovation tasks, and b) which
properties of experience sharing improve multi-level innovation. Here we use a
multi-level hierarchical problem setting (WordCraft), with three different
innovation tasks. We systematically design networks of DQNs sharing experiences
from their replay buffers in varying topologies (fully connected, small world,
dynamic, ring). Comparing the level of innovation achieved by different
experience-sharing topologies across different tasks shows that, first,
consistent with human findings, experience sharing within a dynamic topology
achieves the highest level of innovation across tasks. Second, experience
sharing is not as helpful when there is a single clear path to innovation.
Third, two metrics we propose, conformity and diversity of shared experience,
can explain the success of different topologies on different tasks. These
contributions can advance our understanding of optimal AI-AI, human-human, and
human-AI collaborative networks, inspiring future tools for fostering
collective innovation in large organizations.
Related papers
- Weak Ties Explain Open Source Innovation [9.399494734600164]
We study the correlation between developers' knowledge acquisition through three distinct interaction networks on GitHub and the innovativeness of the projects they develop.
Our findings suggest that the diversity of projects in which developers engage positively with the innovativeness of their future project developments, whereas the volume of interactions exerts minimal influence.
arXiv Detail & Related papers (2024-11-08T15:39:33Z) - Collective Innovation in Groups of Large Language Models [28.486116730339972]
We study Large Language Models (LLMs) that play Little Alchemy 2, a creative video game originally developed for humans.
We study groups of LLMs that share information related to their behaviour and focus on the effect of social connectivity on collective performance.
Our work reveals opportunities and challenges for future studies of collective innovation that are becoming increasingly relevant as Generative Artificial Intelligence algorithms and humans innovate alongside each other.
arXiv Detail & Related papers (2024-07-07T13:59:46Z) - Bidirectional Progressive Neural Networks with Episodic Return Progress
for Emergent Task Sequencing and Robotic Skill Transfer [1.7205106391379026]
We introduce a novel multi-task reinforcement learning framework named Episodic Return Progress with Bidirectional Progressive Neural Networks (ERP-BPNN)
The proposed ERP-BPNN model learns in a human-like interleaved manner by (2) autonomous task switching based on a novel intrinsic motivation signal.
We show that ERP-BPNN achieves faster cumulative convergence and improves performance in all metrics considered among morphologically different robots compared to the baselines.
arXiv Detail & Related papers (2024-03-06T19:17:49Z) - The language and social behavior of innovators [0.0]
We analyze about 38,000 posts available in the intranet forum of a large multinational company.
We find that innovators write more, use a more complex language, introduce new concepts/ideas, and use positive but factual-based language.
arXiv Detail & Related papers (2022-09-20T07:01:25Z) - Foundations and Recent Trends in Multimodal Machine Learning:
Principles, Challenges, and Open Questions [68.6358773622615]
This paper provides an overview of the computational and theoretical foundations of multimodal machine learning.
We propose a taxonomy of 6 core technical challenges: representation, alignment, reasoning, generation, transference, and quantification.
Recent technical achievements will be presented through the lens of this taxonomy, allowing researchers to understand the similarities and differences across new approaches.
arXiv Detail & Related papers (2022-09-07T19:21:19Z) - Modality Competition: What Makes Joint Training of Multi-modal Network
Fail in Deep Learning? (Provably) [75.38159612828362]
It has been observed that the best uni-modal network outperforms the jointly trained multi-modal network.
This work provides a theoretical explanation for the emergence of such performance gap in neural networks for the prevalent joint training framework.
arXiv Detail & Related papers (2022-03-23T06:21:53Z) - WenLan 2.0: Make AI Imagine via a Multimodal Foundation Model [74.4875156387271]
We develop a novel foundation model pre-trained with huge multimodal (visual and textual) data.
We show that state-of-the-art results can be obtained on a wide range of downstream tasks.
arXiv Detail & Related papers (2021-10-27T12:25:21Z) - MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale [103.7609761511652]
We show how a large-scale collective robotic learning system can acquire a repertoire of behaviors simultaneously.
New tasks can be continuously instantiated from previously learned tasks.
We train and evaluate our system on a set of 12 real-world tasks with data collected from 7 robots.
arXiv Detail & Related papers (2021-04-16T16:38:02Z) - Cognitive architecture aided by working-memory for self-supervised
multi-modal humans recognition [54.749127627191655]
The ability to recognize human partners is an important social skill to build personalized and long-term human-robot interactions.
Deep learning networks have achieved state-of-the-art results and demonstrated to be suitable tools to address such a task.
One solution is to make robots learn from their first-hand sensory data with self-supervision.
arXiv Detail & Related papers (2021-03-16T13:50:24Z) - Enterprise System Lifecycle-wide Innovation [0.0]
This study forms conceptual bridge between innovation and enterprise systems.
We introduce Continuous Restrained Innovation (CRI) as a new type of innovation specific to ES.
arXiv Detail & Related papers (2020-06-18T02:16:10Z) - Distributed and Democratized Learning: Philosophy and Research
Challenges [80.39805582015133]
We propose a novel design philosophy called democratized learning (Dem-AI)
Inspired by the societal groups of humans, the specialized groups of learning agents in the proposed Dem-AI system are self-organized in a hierarchical structure to collectively perform learning tasks more efficiently.
We present a reference design as a guideline to realize future Dem-AI systems, inspired by various interdisciplinary fields.
arXiv Detail & Related papers (2020-03-18T08:45:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.