Fugu-MT 論文翻訳(概要): Towards Trust Calibration in Socially Interactive Agents: Investigating Gendered Multimodal Behaviors Generation with LLMs

論文の概要: Towards Trust Calibration in Socially Interactive Agents: Investigating Gendered Multimodal Behaviors Generation with LLMs

arxiv url: http://arxiv.org/abs/2605.19798v1
Date: Tue, 19 May 2026 12:59:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-20 15:03:09.347697
Title: Towards Trust Calibration in Socially Interactive Agents: Investigating Gendered Multimodal Behaviors Generation with LLMs
Title（参考訳）: 社会的対話型エージェントにおける信頼の校正に向けて--LPMを用いたジェンダー型マルチモーダル行動生成の検討
Authors: Lucie Galland, Chloé Clavel, Magalie Ochs,
Abstract要約: GPT-5.4は様々なモードでコヒーレントな振舞いを生成できることを示す。また、ジェンダーがプロンプトで特定されると、LDMは社会的ジェンダーステレオタイプを再現する傾向がある。
参考スコア（独自算出の注目度）: 8.21736236281494
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: As Socially Interactive Agents (SIAs) become increasingly integrated into daily life, the ability to calibrate user trust to an agent's actual capabilities would help ensure appropriate usage of these agents. In this paper, we explore the capacity of Large Language Models (LLMs) to generate multimodal behaviors (verbal, vocal, gestural, and facial expression modalities) that reflect varying levels of ability and benevolence, two key dimensions of trustworthiness. We propose a novel method for automatically generating behaviors aligned with specific levels of these traits, a first step towards enabling nuanced and trust-calibrated interactions. By analyzing a large dataset of multimodal transcripts generated by LLMs, we demonstrate that GPT-5.4 is able to produce coherent behavior across different modalities (text, intonation, facial expression, and gesture). Using Random Forest feature importance analysis, we show that the generated behaviors align with theoretical expectations for ability and benevolence. However, we also find that when gender is specified in the prompt, LLMs tend to reproduce societal gender stereotypes, associating male agents' behaviors with high ability and female agents' behaviors with high benevolence. To validate our approach, we conducted a user study on Prolific using a within-subjects design. Participants perceived different levels of ability and benevolence in the generated behaviors align with the intended instructions.
Abstract（参考訳）: Socially Interactive Agents (SIAs) が日々の生活に統合されるにつれて、エージェントの実際の能力に対するユーザの信頼を調整できる能力は、これらのエージェントの適切な使用を確実にする助けとなる。本稿では,言語モデル(LLM)の多言語行動(言語,声,ジェスチャー,表情のモダリティ)を生成する能力について検討する。本稿では,これらの特徴の特定のレベルに沿った行動を自動的に生成する手法を提案する。 GPT-5.4 は LLM が生成する多モーダル文字の大規模なデータセットを解析することにより,様々なモダリティ(テキスト,イントネーション,表情,ジェスチャー)にまたがるコヒーレントな振る舞いを生成できることを実証する。ランダムフォレストの特徴的重要度分析を用いて, 生成した行動は, 能力とベネバンスに対する理論的期待と一致していることを示す。しかし, ジェンダーがプロンプトで特定されると, LLMは社会的ジェンダーのステレオタイプを再現し, 男性エージェントの行動と女性エージェントの行動と高いベネヴァンスを関連付ける傾向にあることがわかった。提案手法の有効性を検証するために,本研究では,内部オブジェクト設計を用いたProlificのユーザスタディを行った。参加者は、生成された行動における能力とベネヴァンスの異なるレベルが意図された指示と一致していると認識した。

論文の概要: Towards Trust Calibration in Socially Interactive Agents: Investigating Gendered Multimodal Behaviors Generation with LLMs

関連論文リスト