Fugu-MT 論文翻訳(概要): Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory

論文の概要: Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory

arxiv url: http://arxiv.org/abs/2603.16862v1
Date: Tue, 17 Mar 2026 17:59:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:07.471501
Title: Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory
Title（参考訳）: Chronos: 長期記憶のための構造化イベント検索機能付き時間認識対話エージェント
Authors: Sahil Sen, Elias Lumer, Anmol Gulati, Vamse Kumar Subbiah,
Abstract要約: 会話型AIのための時間認識メモリフレームワークであるChronosを紹介する。 Chronosは生の対話を、解決された日時範囲とエンティティエイリアスを持つ主観的動詞オブジェクトイベントに分解する。クエリ時に、Chronosは動的プロンプトを適用して、各質問に対して調整された検索ガイダンスを生成する。
参考スコア（独自算出の注目度）: 0.7723674433972977
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent advances in Large Language Models (LLMs) have enabled conversational AI agents to engage in extended multi-turn interactions spanning weeks or months. However, existing memory systems struggle to reason over temporally grounded facts and preferences that evolve across months of interaction and lack effective retrieval strategies for multi-hop, time-sensitive queries over long dialogue histories. We introduce Chronos, a novel temporal-aware memory framework that decomposes raw dialogue into subject-verb-object event tuples with resolved datetime ranges and entity aliases, indexing them in a structured event calendar alongside a turn calendar that preserves full conversational context. At query time, Chronos applies dynamic prompting to generate tailored retrieval guidance for each question, directing the agent on what to retrieve, how to filter across time ranges, and how to approach multi-hop reasoning through an iterative tool-calling loop over both calendars. We evaluate Chronos with 8 LLMs, both open-source and closed-source, on the LongMemEvalS benchmark comprising 500 questions spanning six categories of dialogue history tasks. Chronos Low achieves 92.60% and Chronos High scores 95.60% accuracy, setting a new state of the art with an improvement of 7.67% over the best prior system. Ablation results reveal the events calendar accounts for a 58.9% gain on the baseline while all other components yield improvements between 15.5% and 22.3%. Notably, Chronos Low alone surpasses prior approaches evaluated under their strongest model configurations.
Abstract（参考訳）: 近年のLarge Language Models (LLM) の進歩により、会話型AIエージェントは数週間から数ヶ月にわたって、多ターンインタラクションを拡張できるようになった。しかし、既存のメモリシステムは、数ヶ月にわたるインタラクションを通じて進化し、長い対話履歴に対するマルチホップで時間に敏感なクエリに対する効果的な検索戦略が欠如している、時間的に根ざした事実や嗜好を推論するのに苦労している。そこで我々はChronosという新しい時間対応メモリフレームワークを紹介した。これは、生の対話を日付範囲とエンティティエイリアスを含む主観的なイベントタプルに分解し、完全な会話コンテキストを保存するターンカレンダーと共に構造化されたイベントカレンダーにインデックス付けする。クエリ時に、Chronosは動的プロンプトを適用して、各質問の調整された検索ガイダンスを生成し、エージェントに何を検索するか、時間範囲をまたいでフィルタする方法、そして両方のカレンダー上で反復ツール呼び出しループを通じてマルチホップ推論にどのようにアプローチするかを指示する。我々は、LongMemEvalSベンチマークにおいて、Chronosをオープンソースとクローズドソースの両方で8つのLLMで評価し、対話履歴タスクの6つのカテゴリにまたがる500の質問について検討した。クロノス・ローは92.60%を獲得し、クロノス・ハイは95.60%の精度を記録し、最先端のシステムよりも7.67%向上した。アブレーションの結果、カレンダーはベースラインで58.9%上昇し、他の全てのコンポーネントは15.5%から22.3%改善した。特にChronos Lowは、最強のモデル構成で評価された従来のアプローチを上回る。

論文の概要: Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory

関連論文リスト