TalkTive: A Conversational Agent Using Backchannels to Engage Older
Adults in Neurocognitive Disorders Screening
- URL: http://arxiv.org/abs/2202.08216v1
- Date: Wed, 16 Feb 2022 17:55:34 GMT
- Title: TalkTive: A Conversational Agent Using Backchannels to Engage Older
Adults in Neurocognitive Disorders Screening
- Authors: Zijian Ding, Jiawen Kang, Tinky Oi Ting HO, Ka Ho Wong, Helene H.
Fung, Helen Meng, Xiaojuan Ma
- Abstract summary: We analyzed 246 conversations of cognitive assessments between older adults and human assessors.
We derived the categories of reactive backchannels and proactive backchannels.
This is used in the development of TalkTive, a CA which can predict both timing and form of backchanneling.
- Score: 51.97352212369947
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Conversational agents (CAs) have the great potential in mitigating the
clinicians' burden in screening for neurocognitive disorders among older
adults. It is important, therefore, to develop CAs that can be engaging, to
elicit conversational speech input from older adult participants for supporting
assessment of cognitive abilities. As an initial step, this paper presents
research in developing the backchanneling ability in CAs in the form of a
verbal response to engage the speaker. We analyzed 246 conversations of
cognitive assessments between older adults and human assessors, and derived the
categories of reactive backchannels (e.g. "hmm") and proactive backchannels
(e.g. "please keep going"). This is used in the development of TalkTive, a CA
which can predict both timing and form of backchanneling during cognitive
assessments. The study then invited 36 older adult participants to evaluate the
backchanneling feature. Results show that proactive backchanneling is more
appreciated by participants than reactive backchanneling.
Related papers
- Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations [58.65755268815283]
Many real dialogues are interactive, meaning an agent's utterances will influence their conversational partner, elicit information, or change their opinion.
We use this fact to rewrite and augment existing suboptimal data, and train via offline reinforcement learning (RL) an agent that outperforms both prompting and learning from unaltered human demonstrations.
Our results in a user study with real humans show that our approach greatly outperforms existing state-of-the-art dialogue agents.
arXiv Detail & Related papers (2024-11-07T21:37:51Z) - CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy [67.23830698947637]
We propose a new benchmark, CBT-BENCH, for the systematic evaluation of cognitive behavioral therapy (CBT) assistance.
We include three levels of tasks in CBT-BENCH: I: Basic CBT knowledge acquisition, with the task of multiple-choice questions; II: Cognitive model understanding, with the tasks of cognitive distortion classification, primary core belief classification, and fine-grained core belief classification; III: Therapeutic response generation, with the task of generating responses to patient speech in CBT therapy sessions.
Experimental results indicate that while LLMs perform well in reciting CBT knowledge, they fall short in complex real-world scenarios
arXiv Detail & Related papers (2024-10-17T04:52:57Z) - Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling [30.099739460287566]
Autism spectrum disorder (ASD) is a neurodevelopmental condition characterized by challenges in social communication, repetitive behavior, and sensory processing.
One important research area in ASD is evaluating children's behavioral changes over time during treatment.
A fundamental aspect of understanding children's behavior in these interactions is automatic speech understanding.
arXiv Detail & Related papers (2024-09-14T07:03:08Z) - Learning to Generate Context-Sensitive Backchannel Smiles for Embodied
AI Agents with Applications in Mental Health Dialogues [21.706636640014594]
Embodied agents with advanced interactive capabilities emerge as a promising and cost-effective supplement to traditional caregiving methods.
We annotated backchannel smiles in videos of intimate face-to-face conversations over topics such as mental health, illness, and relationships.
Using cues from speech prosody and language along with the demographics of the speaker and listener, we found them to contain significant predictors of the intensity of backchannel smiles.
arXiv Detail & Related papers (2024-02-13T22:47:22Z) - A Cognitive Stimulation Dialogue System with Multi-source Knowledge
Fusion for Elders with Cognitive Impairment [15.921295369286161]
Data sparsity is the main challenge in building CS-based dialogue systems, particularly in the Chinese language.
Making chit chat while providing emotional support is overlooked by the majority of existing cognitive dialogue systems.
We propose a multi-source knowledge fusion method for CS dialogue (CSD), to generate open-ended responses guided by the CS principle and emotional support strategy.
arXiv Detail & Related papers (2023-05-14T16:52:20Z) - Leveraging Pretrained Representations with Task-related Keywords for
Alzheimer's Disease Detection [69.53626024091076]
Alzheimer's disease (AD) is particularly prominent in older adults.
Recent advances in pre-trained models motivate AD detection modeling to shift from low-level features to high-level representations.
This paper presents several efficient methods to extract better AD-related cues from high-level acoustic and linguistic features.
arXiv Detail & Related papers (2023-03-14T16:03:28Z) - Response-act Guided Reinforced Dialogue Generation for Mental Health
Counseling [25.524804770124145]
We present READER, a dialogue-act guided response generator for mental health counseling conversations.
READER is built on transformer to jointly predict a potential dialogue-act d(t+1) for the next utterance (aka response-act) and to generate an appropriate response u(t+1)
We evaluate READER on HOPE, a benchmark counseling conversation dataset.
arXiv Detail & Related papers (2023-01-30T08:53:35Z) - A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
Identity in Dysarthric Voice Conversion [50.040466658605524]
We propose a new paradigm for maintaining speaker identity in dysarthric voice conversion (DVC)
The poor quality of dysarthric speech can be greatly improved by statistical VC.
But as the normal speech utterances of a dysarthria patient are nearly impossible to collect, previous work failed to recover the individuality of the patient.
arXiv Detail & Related papers (2021-06-02T18:41:03Z) - You Impress Me: Dialogue Generation via Mutual Persona Perception [62.89449096369027]
The research in cognitive science suggests that understanding is an essential signal for a high-quality chit-chat conversation.
Motivated by this, we propose P2 Bot, a transmitter-receiver based framework with the aim of explicitly modeling understanding.
arXiv Detail & Related papers (2020-04-11T12:51:07Z) - Studying the Effects of Cognitive Biases in Evaluation of Conversational
Agents [10.248512149493443]
We conduct a study with 77 crowdsourced workers to understand the role of cognitive biases, specifically anchoring bias, when humans are asked to evaluate the output of conversational agents.
We find increased consistency in ratings across two experimental conditions may be a result of anchoring bias.
arXiv Detail & Related papers (2020-02-18T23:52:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.