Fugu-MT 論文翻訳(概要): SimPersona: Learning Discrete Buyer Personas from Raw Clickstreams for Grounded E-Commerce Agents

論文の概要: SimPersona: Learning Discrete Buyer Personas from Raw Clickstreams for Grounded E-Commerce Agents

arxiv url: http://arxiv.org/abs/2605.14205v1
Date: Thu, 14 May 2026 00:01:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.544468
Title: SimPersona: Learning Discrete Buyer Personas from Raw Clickstreams for Grounded E-Commerce Agents
Title（参考訳）: SimPersona: 現地のEコマースエージェントのために、生クリックストリームから個別の購入者ペルソナを学習する
Authors: Zahra Zanjani Foumani, Alberto Castelo, Shuang Xie, Ted Chaiwachirasak, Han Li, Lingyun Wang,
Abstract要約: SimPersonaは、歴史的トラフィックから個別のバイヤータイプを学習し、コンパクトなペルソナトークンとしてLLMベースのWebエージェントに公開する新しいフレームワークである。 SimPersonaは、実際の買い手とのコンバージョンレートのアライメントで78%を達成し、買い手の種類によって解釈可能な振る舞いの変化を示し、目標指向のショッピングタスクにおいて8倍のパラメータでベースラインを上回っている。
参考スコア（独自算出の注目度）: 6.722556076605006
License: http://creativecommons.org/licenses/by/4.0/
Abstract: LLM-based web agents can navigate live storefronts, yet they often collapse to a single "average buyer" policy, failing to capture the heterogeneous and distributional nature of real buyer populations. Existing personalization methods rely on hand-crafted prompt-based personas that are brittle, difficult to scale, context-inefficient, and unable to faithfully represent population-level behavior. We introduce SimPersona, a novel framework that learns discrete buyer types from historical traffic and exposes them to LLM-based web agents as compact persona tokens. Given raw clickstreams, a behavior-aware VQ-VAE induces a discrete buyer-type space that captures the statistical structure of real buyer behavior and merchant-specific buyer population distributions. To provide behavior-specific guidance to LLM-based web agents, SimPersona maps each learned buyer type to a dedicated persona token in the LLM agent vocabulary and fine-tunes the agent with these tokens on real browsing traces. At inference, each synthetic buyer is assigned to a learned buyer type with a single encoder forward pass, requiring no retraining or store-specific prompt engineering. For population-level simulation, SimPersona samples buyer types from each merchant's empirical distribution over the learned VQ-VAE codebook and instantiates agents with the corresponding persona tokens, preserving merchant-specific buyer population distributions. Evaluated on $8.37$M buyers across $42$ held-out live storefronts, SimPersona achieves $78\%$ conversion-rate alignment with real buyers, exhibits interpretable behavioral variation across buyer types, and outperforms a baseline with $8\times$ more parameters on goal-oriented shopping tasks. We further release an open-source data pipeline that converts raw e-commerce event logs into buyer representations and agent-training traces.
Abstract（参考訳）: LLMベースのウェブエージェントは、ライブストアフロントをナビゲートできるが、しばしば一つの「平均購入者」ポリシーに崩壊する。既存のパーソナライズ手法は、手作りのプロンプトベースのペルソナに頼っている。シムペルソナ(SimPersona)は、歴史的トラフィックから個別のバイヤータイプを学習し、コンパクトなペルソナトークンとしてLLMベースのWebエージェントに公開する新しいフレームワークである。生クリックストリームが与えられた場合、行動認識型VQ-VAEは、実際の購入者行動の統計構造と商人固有の購入者人口分布をキャプチャする個別の購入者型空間を誘導する。 LLMベースのWebエージェントに対する行動特異的ガイダンスを提供するため、SimPersonaは、学習した購入者のタイプをLLMエージェント語彙内の専用のペルソナトークンにマッピングし、実際のブラウジングトレース上でこれらのトークンでエージェントを微調整する。推測では、各合成バイヤーは、学習されたバイヤータイプに、単一のエンコーダフォワードパスで割り当てられ、再トレーニングやストア固有のプロンプトエンジニアリングを必要としない。人口レベルのシミュレーションでは、SimPersonaは学習したVQ-VAEコードブック上で各商人の経験的分布から購入者タイプをサンプリングし、エージェントを対応するペルソナトークンでインスタンス化し、商店固有の購入者人口分布を保存する。シムペルソナは、42ドル(約4万4000円)のライブ店舗で837ドル(約8万7000円)の買い手から評価され、実際の買い手とのコンバージョンレートの調整に78セント(約7万7000円)を達成し、買い手の種類によって解釈可能な行動変化を示し、目標指向のショッピングタスクにおいて8ドル(約8万7000円)以上のパラメータでベースラインを上回ります。さらに、生のEコマースイベントログをバイヤー表現やエージェントトレーニングトレースに変換する、オープンソースのデータパイプラインをリリースしています。

論文の概要: SimPersona: Learning Discrete Buyer Personas from Raw Clickstreams for Grounded E-Commerce Agents

関連論文リスト