Fugu-MT 論文翻訳(概要): Beyond Static Personas: Situational Personality Steering for Large Language Models

論文の概要: Beyond Static Personas: Situational Personality Steering for Large Language Models

arxiv url: http://arxiv.org/abs/2604.13846v3
Date: Fri, 17 Apr 2026 08:59:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-20 13:38:49.374485
Title: Beyond Static Personas: Situational Personality Steering for Large Language Models
Title（参考訳）: 静的ペルソナを超えて:大規模言語モデルのための状況的パーソナリティステアリング
Authors: Zesheng Wei, Mengxiang Li, Zilei Wang, Yang Deng,
Abstract要約: IRIS(Identify-Retrieve-Steer framework for Advanced situational personality steering)を提案する。本手法は, 状況認識型ニューロン識別, 状況認識型ニューロン検索, 類似性重み付けによるステアリングを含む。我々は、パーソナリティベンチの枠組みと、包括的状況パーソナリティベンチマークであるSPBenchを実証的に検証した。
参考スコア（独自算出の注目度）: 52.33369223308941
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Personalized Large Language Models (LLMs) facilitate more natural, human-like interactions in human-centric applications. However, existing personalization methods are constrained by limited controllability and high resource demands. Furthermore, their reliance on static personality modeling restricts adaptability across varying situations. To address these limitations, we first demonstrate the existence of situation-dependency and consistent situation-behavior patterns within LLM personalities through a multi-perspective analysis of persona neurons. Building on these insights, we propose IRIS, a training-free, neuron-based Identify-Retrieve-Steer framework for advanced situational personality steering. Our approach comprises situational persona neuron identification, situation-aware neuron retrieval, and similarity-weighted steering. We empirically validate our framework on PersonalityBench and our newly introduced SPBench, a comprehensive situational personality benchmark. Experimental results show that our method surpasses best-performing baselines, demonstrating IRIS's generalization and robustness to complex, unseen situations and different models architecture.
Abstract（参考訳）: パーソナライズされた大規模言語モデル(LLM)は、人間中心のアプリケーションにおいて、より自然な人間のような相互作用を促進する。しかし、既存のパーソナライズ手法は、制限された制御性と高いリソース要求によって制約されている。さらに、静的パーソナリティモデリングへの依存は、様々な状況における適応性を制限する。これらの制約に対処するために、まず、ペルソナニューロンの多視点的解析を通して、LLM個人性における状況依存性と一貫した状況行動パターンの存在を実証する。これらの知見に基づいて、我々は、高度状況人格管理のためのトレーニングフリーでニューロンベースのIdentify-Retrieve-SteerフレームワークであるIRISを提案する。本手法は, 状況認識型ニューロン識別, 状況認識型ニューロン検索, 類似性重み付けによるステアリングを含む。我々は、パーソナリティベンチの枠組みと、包括的状況パーソナリティベンチマークであるSPBenchを実証的に検証した。実験の結果,本手法は,IRISの一般化と,複雑で目に見えない状況と異なるモデルアーキテクチャに対する堅牢性を実証し,最高の性能のベースラインを超えることを示した。

論文の概要: Beyond Static Personas: Situational Personality Steering for Large Language Models

関連論文リスト