Fugu-MT 論文翻訳(概要): Dr-CiK: A Testbed for Foresight-Driven Agents

論文の概要: Dr-CiK: A Testbed for Foresight-Driven Agents

arxiv url: http://arxiv.org/abs/2605.27904v1
Date: Wed, 27 May 2026 03:26:42 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:55.716614
Title: Dr-CiK: A Testbed for Foresight-Driven Agents
Title（参考訳）: Dr-CiK: フォアサイト駆動エージェントのテストベッド
Authors: Yihong Tang, Andrew Robert Williams, Arjun Ashok, Vincent Zhihao Zheng, Lijun Sun, Alexandre Drouin, Issam H. Laradji, Étienne Marcotte, Valentina Zantedeschi,
Abstract要約: 本稿では,文書コーパスから予測関連コンテキストを検索できるかどうかを評価するベンチマークであるDr-CiKを紹介する。我々は,Dr-CiKの予測性能が,高品質な文脈で大幅に向上することを示す。我々の研究は、未来を予測するための適切なコンテキストを探索するフォレスト駆動エージェントの研究を動機付けている。
参考スコア（独自算出の注目度）: 58.303939183596015
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Time series forecasting in real-world settings often depends not only on historical observations, but also on external context that must be actively discovered from noisy, heterogeneous information sources. Yet existing context-aided forecasting benchmarks typically assume that the supporting context is already provided, leaving open whether agents can identify it on their own. Therefore, we introduce Dr-CiK, a benchmark for evaluating whether agents can retrieve forecasting-relevant supporting context from a document corpus, filter out distractors, distill the retrieved context into forecast-useful evidence, and generate forecasts supported by that evidence. Through context ablations and evaluations of state-of-the-art deep research and forecasting methods paired together, we show that high-quality context substantially improves forecasting performance in Dr-CiK. However, most existing DR agents recover only a small fraction of the ground-truth supporting evidence (usually <5%), are frequently misled by distractors (>80% distractor citations), and can cause forecasters to perform worse with retrieved context than without context. Our results motivate research on foresight-driven agents that search for the right context to predict the future.
Abstract（参考訳）: 実世界の環境での時系列予測は、しばしば歴史的観測だけでなく、ノイズの多い異種情報ソースから積極的に発見される必要がある外部の文脈にも依存する。しかし、既存のコンテキスト支援予測ベンチマークでは、サポート済みのコンテキストがすでに提供されていると仮定し、エージェントが自分自身でそれを識別できるかどうかを判断する。そこで本研究では,文書コーパスから予測関連コンテキストを検索し,イントラクタをフィルタリングし,検索したコンテキストを予測用エビデンスに抽出し,そのエビデンスによって支援された予測を生成することができるかどうかを評価するベンチマークであるDr-CiKを紹介する。最先端の深層研究と予測手法の組合わせによる文脈改善と評価により,Dr-CiKの予測性能を大幅に向上することを示す。しかし、ほとんどの既存のDRエージェントは、地上の真実を裏付ける証拠(通常、5%)のごく一部しか回収せず、しばしば妨害者によって誤解される(80%以上の妨害者による引用)。我々の研究は、未来を予測するための適切なコンテキストを探索するフォレスト駆動エージェントの研究を動機付けている。

論文の概要: Dr-CiK: A Testbed for Foresight-Driven Agents

関連論文リスト