User profile-driven large-scale multi-agent learning from demonstration
in federated human-robot collaborative environments
- URL: http://arxiv.org/abs/2103.16434v1
- Date: Tue, 30 Mar 2021 15:33:21 GMT
- Title: User profile-driven large-scale multi-agent learning from demonstration
in federated human-robot collaborative environments
- Authors: Georgios Th. Papadopoulos, Asterios Leonidis, Margherita Antona,
Constantine Stephanidis
- Abstract summary: This paper introduces a novel user profile formulation for providing a fine-grained representation of the exhibited human behavior.
The overall designed scheme enables both short- and long-term analysis/interpretation of the human behavior.
- Score: 5.218882272051637
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Learning from Demonstration (LfD) has been established as the dominant
paradigm for efficiently transferring skills from human teachers to robots. In
this context, the Federated Learning (FL) conceptualization has very recently
been introduced for developing large-scale human-robot collaborative
environments, targeting to robustly address, among others, the critical
challenges of multi-agent learning and long-term autonomy. In the current work,
the latter scheme is further extended and enhanced, by designing and
integrating a novel user profile formulation for providing a fine-grained
representation of the exhibited human behavior, adopting a Deep Learning
(DL)-based formalism. In particular, a hierarchically organized set of key
information sources is considered, including: a) User attributes (e.g.
demographic, anthropomorphic, educational, etc.), b) User state (e.g. fatigue
detection, stress detection, emotion recognition, etc.) and c)
Psychophysiological measurements (e.g. gaze, electrodermal activity, heart
rate, etc.) related data. Then, a combination of Long Short-Term Memory (LSTM)
and stacked autoencoders, with appropriately defined neural network
architectures, is employed for the modelling step. The overall designed scheme
enables both short- and long-term analysis/interpretation of the human behavior
(as observed during the feedback capturing sessions), so as to adaptively
adjust the importance of the collected feedback samples when aggregating
information originating from the same and different human teachers,
respectively.
Related papers
- Brain-Cognition Fingerprinting via Graph-GCCA with Contrastive Learning [28.681229869236393]
longitudinal neuroimaging studies aim to improve the understanding of brain aging and diseases by studying the dynamic interactions between brain function and cognition.
We propose an unsupervised learning model that encodes their relationship via Graph Attention Networks and generalized Correlational Analysis.
To create brain-cognition fingerprints reflecting unique neural and cognitive phenotype of each person, the model also relies on individualized and multimodal contrastive learning.
arXiv Detail & Related papers (2024-09-20T20:36:20Z) - PersLLM: A Personified Training Approach for Large Language Models [66.16513246245401]
We propose PersLLM, integrating psychology-grounded principles of personality: social practice, consistency, and dynamic development.
We incorporate personality traits directly into the model parameters, enhancing the model's resistance to induction, promoting consistency, and supporting the dynamic evolution of personality.
arXiv Detail & Related papers (2024-07-17T08:13:22Z) - Hello Again! LLM-powered Personalized Agent for Long-term Dialogue [63.65128176360345]
We introduce a model-agnostic framework, the Long-term Dialogue Agent (LD-Agent)
It incorporates three independently tunable modules dedicated to event perception, persona extraction, and response generation.
The effectiveness, generality, and cross-domain capabilities of LD-Agent are empirically demonstrated.
arXiv Detail & Related papers (2024-06-09T21:58:32Z) - DSAM: A Deep Learning Framework for Analyzing Temporal and Spatial Dynamics in Brain Networks [4.041732967881764]
Most rs-fMRI studies compute a single static functional connectivity matrix across brain regions of interest.
These approaches are at risk of oversimplifying brain dynamics and lack proper consideration of the goal at hand.
We propose a novel interpretable deep learning framework that learns goal-specific functional connectivity matrix directly from time series.
arXiv Detail & Related papers (2024-05-19T23:35:06Z) - Real-time Addressee Estimation: Deployment of a Deep-Learning Model on
the iCub Robot [52.277579221741746]
Addressee Estimation is a skill essential for social robots to interact smoothly with humans.
Inspired by human perceptual skills, a deep-learning model for Addressee Estimation is designed, trained, and deployed on an iCub robot.
The study presents the procedure of such implementation and the performance of the model deployed in real-time human-robot interaction.
arXiv Detail & Related papers (2023-11-09T13:01:21Z) - Learn to Predict How Humans Manipulate Large-sized Objects from
Interactive Motions [82.90906153293585]
We propose a graph neural network, HO-GCN, to fuse motion data and dynamic descriptors for the prediction task.
We show the proposed network that consumes dynamic descriptors can achieve state-of-the-art prediction results and help the network better generalize to unseen objects.
arXiv Detail & Related papers (2022-06-25T09:55:39Z) - Beyond Tracking: Using Deep Learning to Discover Novel Interactions in
Biological Swarms [3.441021278275805]
We propose training deep network models to predict system-level states directly from generic graphical features from the entire view.
Because the resulting predictive models are not based on human-understood predictors, we use explanatory modules.
This represents an example of augmented intelligence in behavioral ecology -- knowledge co-creation in a human-AI team.
arXiv Detail & Related papers (2021-08-20T22:50:41Z) - Backprop-Free Reinforcement Learning with Active Neural Generative
Coding [84.11376568625353]
We propose a computational framework for learning action-driven generative models without backpropagation of errors (backprop) in dynamic environments.
We develop an intelligent agent that operates even with sparse rewards, drawing inspiration from the cognitive theory of planning as inference.
The robust performance of our agent offers promising evidence that a backprop-free approach for neural inference and learning can drive goal-directed behavior.
arXiv Detail & Related papers (2021-07-10T19:02:27Z) - Cross-individual Recognition of Emotions by a Dynamic Entropy based on
Pattern Learning with EEG features [2.863100352151122]
We propose a deep-learning framework denoted as a dynamic entropy-based pattern learning (DEPL) to abstract informative indicators pertaining to the neurophysiological features among multiple individuals.
DEPL enhanced the capability of representations generated by a deep convolutional neural network by modelling the interdependencies between the cortical locations of dynamical entropy based features.
arXiv Detail & Related papers (2020-09-26T07:22:07Z) - Mechanisms for Handling Nested Dependencies in Neural-Network Language
Models and Humans [75.15855405318855]
We studied whether a modern artificial neural network trained with "deep learning" methods mimics a central aspect of human sentence processing.
Although the network was solely trained to predict the next word in a large corpus, analysis showed the emergence of specialized units that successfully handled local and long-distance syntactic agreement.
We tested the model's predictions in a behavioral experiment where humans detected violations in number agreement in sentences with systematic variations in the singular/plural status of multiple nouns.
arXiv Detail & Related papers (2020-06-19T12:00:05Z) - Human Action Recognition and Assessment via Deep Neural Network
Self-Organization [0.0]
This chapter introduces a set of hierarchical models for the learning and recognition of actions from depth maps and RGB images.
A particularity of these models is the use of growing self-organizing networks that quickly adapt to non-stationary distributions.
arXiv Detail & Related papers (2020-01-04T15:58:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.