Fugu-MT 論文翻訳(概要): DuIVRS-2: An LLM-based Interactive Voice Response System for Large-scale POI Attribute Acquisition

論文の概要: DuIVRS-2: An LLM-based Interactive Voice Response System for Large-scale POI Attribute Acquisition

arxiv url: http://arxiv.org/abs/2605.17900v1
Date: Mon, 18 May 2026 06:06:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:48.908237
Title: DuIVRS-2: An LLM-based Interactive Voice Response System for Large-scale POI Attribute Acquisition
Title（参考訳）: DuIVRS-2:大規模POI属性獲得のためのLLMを用いた対話型音声応答システム
Authors: Le Zhang, Shengming Zhang, Rui Zha, Yunpeng Wu, Jingbo Zhou, Jizhou Huang,
Abstract要約: 本稿では,Baidu Mapsにおける大規模POI属性取得のための大規模言語モデル(LLM)に基づくエンドツーエンドフレームワークを提案する。 DuIVRS-2は毎日0.4万回の通話を処理し、83.9%のタスク成功率(TSR)を達成した。この研究は、大規模産業対話アプリケーションのためのロバストで費用対効果の高いLLMエージェントを開発するために、生産実績のあるリファレンスを提供する。
参考スコア（独自算出の注目度）: 28.52285542771737
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Accurate Point of Interest (POI) attribute acquisition is essential for location-based services, yet traditional modular Interactive Voice Response (IVR) systems suffer from error accumulation and high maintenance overhead. We present DuIVRS-2, a large language model (LLM)-based end-to-end framework designed for large-scale POI attribute acquisition at Baidu Maps. To address the long-tail distribution of real-world interactions, our methodology first employs a finite state machine (FSM)-guided data augmentation strategy to synthesize a balanced and diverse training dataset. We then streamline dialogue management via a selective generation scheme combined with a Chain-of-Thought (CoT) mechanism, which ensures output stability and effectively eliminates hallucinations in industrial settings. To facilitate continuous policy refinement with minimal manual effort, we design a cooperative iterative learning framework that leverages a dual-evaluator voting system. Deployed in production for two months, DuIVRS-2 processed 0.4 million calls daily and achieved a 83.9\% Task Success Rate (TSR), outperforming its predecessor by 4 percentage points while maintaining a low reaction time of 130ms. This work provides a production-proven reference for developing robust, cost-effective LLM agents for large-scale industrial dialogue applications.
Abstract（参考訳）: 正確なpoint of Interest(POI)属性の取得は位置情報ベースのサービスには不可欠だが、従来のモジュール型インタラクティブ音声応答(IVR)システムはエラーの蓄積と高いメンテナンスオーバーヘッドに悩まされている。 Baidu Mapsにおける大規模POI属性取得のために設計された,大規模言語モデル(LLM)に基づくエンドツーエンドフレームワークであるDuIVRS-2を提案する。実世界の相互作用の長期分布に対処するために,本手法ではまず有限状態マシン(FSM)誘導データ拡張戦略を用いて,バランスの取れた多様なトレーニングデータセットを合成する。次に、選択生成方式とChain-of-Thought(CoT)機構を組み合わせることで対話管理を効率化し、出力安定性を確保し、産業環境における幻覚を効果的に除去する。手作業が最小限に抑えられた継続的政策改善を容易にするために,両立投票システムを活用した協調的反復学習フレームワークを設計する。 2ヶ月の運用期間を経て、DuIVRS-2は毎日0.4万回の通話を処理し、83.9倍のタスク成功率(TSR)を達成した。この研究は、大規模産業対話アプリケーションのためのロバストで費用対効果の高いLLMエージェントを開発するために、生産実績のあるリファレンスを提供する。

論文の概要: DuIVRS-2: An LLM-based Interactive Voice Response System for Large-scale POI Attribute Acquisition

関連論文リスト