Fugu-MT 論文翻訳(概要): Agentic Persona Generation with Critique-Refinement: An Industrial Evaluation

論文の概要: Agentic Persona Generation with Critique-Refinement: An Industrial Evaluation

arxiv url: http://arxiv.org/abs/2606.09637v1
Date: Mon, 08 Jun 2026 15:34:29 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:07.462191
Title: Agentic Persona Generation with Critique-Refinement: An Industrial Evaluation
Title（参考訳）: 批判的リファインメントによるエージェントペルソナ生成:工業的評価
Authors: Mohammad Hossein Amini, David Dewar, Shiva Nejati, Mehrdad Sabetzadeh,
Abstract要約: PerGentは業界グレードのペルソナ生成手法で、反復的批評・修正ループを中心に構築されている。専門家のin-situ評価では、PerGentは最高専門家の承認率(96.9%)を達成した。
参考スコア（独自算出の注目度）: 1.5821080783312833
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Personas are widely used in software engineering to support requirements elicitation, design, and validation, but their manual creation is costly, time-consuming, and hard to scale. Recent LLM-based approaches automate persona generation from textual data; however, they typically rely on single-shot generation and subjective evaluations, limiting practical reliability. We present PerGent, an industry-grade method for persona generation built around an iterative critique-refinement loop. Specifically, PerGent uses a generator and a critic LLM agent, coordinated by an orchestrator, to iteratively refine personas using external resources such as interviews, surveys, and job postings through a critique-refinement loop with a user-defined maximum number of rounds. We deploy and evaluate PerGent in an industrial setting at Kinaxis, comparing it with three baselines, including one-shot methods. In an expert in-situ evaluation, PerGent achieved the highest expert approval rate (96.9%), exceeding all baselines. We further compare PerGent-generated personas with best-practice personas manually created by domain experts prior to the adoption of LLMs. Compared to baselines, PerGent reproduces a larger proportion of expert content while also contributing substantial new content beyond the pre-LLM personas. We conclude with lessons learned from deploying and evaluating PerGent at Kinaxis.
Abstract（参考訳）: ペルソナは、要件の付与、設計、検証をサポートするために、ソフトウェアエンジニアリングで広く使用されているが、手作業による作成は、コストがかかり、時間がかかり、スケールが困難である。最近のLLMベースのアプローチは、テキストデータからペルソナ生成を自動化するが、通常は単発生成と主観評価に依存し、実用的信頼性を制限している。本稿では,反復的批評・修正ループを中心に構築された,業界レベルのペルソナ生成手法であるPerGentを提案する。特に、PerGentは、オーケストレータがコーディネートしたジェネレータと批評家のLLMエージェントを使用して、ユーザ定義の最大ラウンド数で批判的リファインメントループを通じて、インタビュー、調査、ジョブポストなどの外部リソースを使用して、ペルソナを反復的に洗練する。我々は、Kinaxisの産業環境でPerGentをデプロイし、評価し、ワンショットメソッドを含む3つのベースラインと比較した。専門家のin-situ評価では、PerGentはすべての基準を超える、最高専門家の承認率(96.9%)を達成した。さらに,LLM導入前にドメインの専門家が手作業で作成したペルジェント生成ペルソナとベストプラクティスペルソナを比較した。ベースラインと比較すると、PerGentはプロのコンテンツを多く再現すると同時に、LLM以前のペルソナを超えて、かなり新しいコンテンツに貢献している。 KinaxisでのPerGentのデプロイと評価から学んだ教訓で締めくくります。

論文の概要: Agentic Persona Generation with Critique-Refinement: An Industrial Evaluation

関連論文リスト