Fugu-MT 論文翻訳(概要): BaZi-Based Character Simulation Benchmark: Evaluating AI on Temporal and Persona Reasoning

論文の概要: BaZi-Based Character Simulation Benchmark: Evaluating AI on Temporal and Persona Reasoning

arxiv url: http://arxiv.org/abs/2510.23337v1
Date: Mon, 27 Oct 2025 13:51:13 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 15:28:15.562992
Title: BaZi-Based Character Simulation Benchmark: Evaluating AI on Temporal and Persona Reasoning
Title（参考訳）: BaZiベースの文字シミュレーションベンチマーク:時間とペルソナ推論におけるAIの評価
Authors: Siyuan Zheng, Pai Liu, Xi Chen, Jizheng Dong, Sihan Jia,
Abstract要約: BaZiベースのペルソナ推論のための最初のQAデータセットを作成します。本研究では,シンボル推論と大規模言語モデルを統合したBaZi-LLMシステムを提案する。
参考スコア（独自算出の注目度）: 3.3125111019129707
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Human-like virtual characters are crucial for games, storytelling, and virtual reality, yet current methods rely heavily on annotated data or handcrafted persona prompts, making it difficult to scale up and generate realistic, contextually coherent personas. We create the first QA dataset for BaZi-based persona reasoning, where real human experiences categorized into wealth, health, kinship, career, and relationships are represented as life-event questions and answers. Furthermore, we propose the first BaZi-LLM system that integrates symbolic reasoning with large language models to generate temporally dynamic and fine-grained virtual personas. Compared with mainstream LLMs such as DeepSeek-v3 and GPT-5-mini, our method achieves a 30.3%-62.6% accuracy improvement. In addition, when incorrect BaZi information is used, our model's accuracy drops by 20%-45%, showing the potential of culturally grounded symbolic-LLM integration for realistic character simulation.
Abstract（参考訳）: 人間のような仮想キャラクタはゲーム、ストーリーテリング、バーチャルリアリティーには不可欠だが、現在の手法は注釈付きデータや手作りのペルソナプロンプトに大きく依存しているため、現実的でコンテキストに整合したペルソナのスケールアップと生成が困難である。私たちは、BaZiベースのペルソナ推論のための最初のQAデータセットを作成します。さらに,大規模な言語モデルとシンボリック推論を統合し,時間的に動的かつ微細な仮想ペルソナを生成するBaZi-LLMシステムを提案する。 DeepSeek-v3 や GPT-5-mini といった主流の LLM と比較して, 精度が 30.3%-62.6% 向上した。さらに,誤ったBaZi情報を使用すると,モデルの精度が20%～45%低下し,現実的なキャラクタシミュレーションのための文化的なシンボル-LLM統合の可能性を示した。

論文の概要: BaZi-Based Character Simulation Benchmark: Evaluating AI on Temporal and Persona Reasoning

関連論文リスト