Feature Fusion Strategies for End-to-End Evaluation of Cognitive
Behavior Therapy Sessions
- URL: http://arxiv.org/abs/2005.07809v2
- Date: Wed, 14 Oct 2020 20:53:36 GMT
- Title: Feature Fusion Strategies for End-to-End Evaluation of Cognitive
Behavior Therapy Sessions
- Authors: Zhuohao Chen, Nikolaos Flemotomos, Victor Ardulov, Torrey A. Creed,
Zac E. Imel, David C. Atkins, Shrikanth Narayanan
- Abstract summary: We develop an end-to-end pipeline that converts speech audio to diarized and transcribed text to code Cognitive Behavioral Therapy sessions automatically.
We propose a novel method to augment the word-based features with the utterance level tags for subsequent CBT code estimation.
- Score: 32.198800906972366
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cognitive Behavioral Therapy (CBT) is a goal-oriented psychotherapy for
mental health concerns implemented in a conversational setting with broad
empirical support for its effectiveness across a range of presenting problems
and client populations. The quality of a CBT session is typically assessed by
trained human raters who manually assign pre-defined session-level behavioral
codes. In this paper, we develop an end-to-end pipeline that converts speech
audio to diarized and transcribed text and extracts linguistic features to code
the CBT sessions automatically. We investigate both word-level and
utterance-level features and propose feature fusion strategies to combine them.
The utterance level features include dialog act tags as well as behavioral
codes drawn from another well-known talk psychotherapy called Motivational
Interviewing (MI). We propose a novel method to augment the word-based features
with the utterance level tags for subsequent CBT code estimation. Experiments
show that our new fusion strategy outperforms all the studied features, both
when used individually and when fused by direct concatenation. We also find
that incorporating a sentence segmentation module can further improve the
overall system given the preponderance of multi-utterance conversational turns
in CBT sessions.
Related papers
- CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy [67.23830698947637]
We propose a new benchmark, CBT-BENCH, for the systematic evaluation of cognitive behavioral therapy (CBT) assistance.
We include three levels of tasks in CBT-BENCH: I: Basic CBT knowledge acquisition, with the task of multiple-choice questions; II: Cognitive model understanding, with the tasks of cognitive distortion classification, primary core belief classification, and fine-grained core belief classification; III: Therapeutic response generation, with the task of generating responses to patient speech in CBT therapy sessions.
Experimental results indicate that while LLMs perform well in reciting CBT knowledge, they fall short in complex real-world scenarios
arXiv Detail & Related papers (2024-10-17T04:52:57Z) - Are Large Language Models Possible to Conduct Cognitive Behavioral Therapy? [13.0263170692984]
Large language models (LLMs) have been validated, providing new possibilities for psychological assistance therapy.
Many concerns have been raised by mental health experts regarding the use of LLMs for therapy.
Four LLM variants with excellent performance on natural language processing are evaluated.
arXiv Detail & Related papers (2024-07-25T03:01:47Z) - LLM Questionnaire Completion for Automatic Psychiatric Assessment [49.1574468325115]
We employ a Large Language Model (LLM) to convert unstructured psychological interviews into structured questionnaires spanning various psychiatric and personality domains.
The obtained answers are coded as features, which are used to predict standardized psychiatric measures of depression (PHQ-8) and PTSD (PCL-C)
arXiv Detail & Related papers (2024-06-09T09:03:11Z) - Speech-based Clinical Depression Screening: An Empirical Study [32.84863235794086]
This study investigates the utility of speech signals for AI-based depression screening across varied interaction scenarios.
participants include depressed patients recruited from the outpatient clinics of Peking University Sixth Hospital.
We extracted acoustic and deep speech features from each participant's segmented recordings.
arXiv Detail & Related papers (2024-06-05T09:43:54Z) - PsyCoT: Psychological Questionnaire as Powerful Chain-of-Thought for
Personality Detection [50.66968526809069]
We propose a novel personality detection method, called PsyCoT, which mimics the way individuals complete psychological questionnaires in a multi-turn dialogue manner.
Our experiments demonstrate that PsyCoT significantly improves the performance and robustness of GPT-3.5 in personality detection.
arXiv Detail & Related papers (2023-10-31T08:23:33Z) - Emotion Recognition in Conversation using Probabilistic Soft Logic [17.62924003652853]
emotion recognition in conversation (ERC) is a sub-field of emotion recognition that focuses on conversations that contain two or more utterances.
We implement our approach in a framework called Probabilistic Soft Logic (PSL), a declarative templating language.
PSL provides functionality for the incorporation of results from neural models into PSL models.
We compare our method with state-of-the-art purely neural ERC systems, and see almost a 20% improvement.
arXiv Detail & Related papers (2022-07-14T23:59:06Z) - CogAlign: Learning to Align Textual Neural Representations to Cognitive
Language Processing Signals [60.921888445317705]
We propose a CogAlign approach to integrate cognitive language processing signals into natural language processing models.
We show that CogAlign achieves significant improvements with multiple cognitive features over state-of-the-art models on public datasets.
arXiv Detail & Related papers (2021-06-10T07:10:25Z) - Automated Quality Assessment of Cognitive Behavioral Therapy Sessions
Through Highly Contextualized Language Representations [34.670548892766625]
A BERT-based model is proposed for automatic behavioral scoring of a specific type of psychotherapy, called Cognitive Behavioral Therapy (CBT)
The model is trained in a multi-task manner in order to achieve higher interpretability.
BERT-based representations are further augmented with available therapy metadata, providing relevant non-linguistic context and leading to consistent performance improvements.
arXiv Detail & Related papers (2021-02-23T09:22:29Z) - Pose-based Body Language Recognition for Emotion and Psychiatric Symptom
Interpretation [75.3147962600095]
We propose an automated framework for body language based emotion recognition starting from regular RGB videos.
In collaboration with psychologists, we extend the framework for psychiatric symptom prediction.
Because a specific application domain of the proposed framework may only supply a limited amount of data, the framework is designed to work on a small training set.
arXiv Detail & Related papers (2020-10-30T18:45:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.