Fugu-MT 論文翻訳(概要): SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning

論文の概要: SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning

arxiv url: http://arxiv.org/abs/2602.19455v1
Date: Mon, 23 Feb 2026 02:55:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-24 17:42:02.654823
Title: SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning
Title（参考訳）: SenTSR-Bench: 時系列推論のためのインジェクト知識による思考
Authors: Zelin He, Boran Han, Xiyuan Zhang, Shuai Zhang, Haotian Lin, Qi Zhu, Haoyang Fang, Danielle C. Maddix, Abdul Fatir Ansari, Akash Chandrayan, Abhinav Pradhan, Bernie Wang, Matthew Reimherr,
Abstract要約: 大きな推論モデル(GRLM)は、強い推論スキルを持っているが、複雑な時系列パターンを理解するためのドメイン固有の知識は欠如している。本稿では,TSLM生成した知見を直接GRLMの推論トレースに注入するハイブリッドな知識注入フレームワークを提案する。 SenTSR-Benchや他の公開データセット全体で、我々の方法はTSLMを9.1%-26.1%、GRLMを7.9%-22.4%上回る。
参考スコア（独自算出の注目度）: 27.91435057369479
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Time-series diagnostic reasoning is essential for many applications, yet existing solutions face a persistent gap: general reasoning large language models (GRLMs) possess strong reasoning skills but lack the domain-specific knowledge to understand complex time-series patterns. Conversely, fine-tuned time-series LLMs (TSLMs) understand these patterns but lack the capacity to generalize reasoning for more complicated questions. To bridge this gap, we propose a hybrid knowledge-injection framework that injects TSLM-generated insights directly into GRLM's reasoning trace, thereby achieving strong time-series reasoning with in-domain knowledge. As collecting data for knowledge injection fine-tuning is costly, we further leverage a reinforcement learning-based approach with verifiable rewards (RLVR) to elicit knowledge-rich traces without human supervision, then transfer such an in-domain thinking trace into GRLM for efficient knowledge injection. We further release SenTSR-Bench, a multivariate time-series-based diagnostic reasoning benchmark collected from real-world industrial operations. Across SenTSR-Bench and other public datasets, our method consistently surpasses TSLMs by 9.1%-26.1% and GRLMs by 7.9%-22.4%, delivering robust, context-aware time-series diagnostic insights.
Abstract（参考訳）: 大規模言語モデル(GRLM)は強力な推論スキルを持っているが、複雑な時系列パターンを理解するためのドメイン固有の知識が欠如している。逆に、微調整された時系列 LLM (TSLMs) はこれらのパターンを理解するが、より複雑な問題に対する推論を一般化する能力は欠如している。このギャップを埋めるために,TSLM生成した知見を直接GRLMの推論トレースに直接注入するハイブリッドな知識注入フレームワークを提案する。知識注入細調整のためのデータ収集はコストがかかるため,人間の監督なしに知識に富んだ痕跡を抽出するために,検証可能な報酬(RLVR)を用いた強化学習ベースのアプローチをさらに活用し,そのようなドメイン内思考トレースをGRLMに転送して,効率的な知識注入を行う。我々はさらに,実世界の産業運用から収集した多変量時系列に基づく診断推論ベンチマークであるSenTSR-Benchをリリースする。 SenTSR-Benchや他の公開データセット全体で、我々の手法はTSLMを9.1%-26.1%、GRLMを7.9%-22.4%上回り、堅牢でコンテキスト対応の時系列診断の洞察を提供する。

論文の概要: SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning

関連論文リスト