Fugu-MT 論文翻訳(概要): Can LLMs Time Travel? Enhancing Temporal Consistency in Legal Agentic Search through Reinforcement Learning

論文の概要: Can LLMs Time Travel? Enhancing Temporal Consistency in Legal Agentic Search through Reinforcement Learning

arxiv url: http://arxiv.org/abs/2605.25920v1
Date: Mon, 25 May 2026 14:57:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:20.341813
Title: Can LLMs Time Travel? Enhancing Temporal Consistency in Legal Agentic Search through Reinforcement Learning
Title（参考訳）: LLMのタイムトラベルは可能か? 強化学習による法的エージェント探索における時間的整合性を高める
Authors: Wei Fan, Yining Zhou, Mufan Zhang, Yanbing Weng, Yiran HU, Tianshi Zheng, Baixuan Xu, Chunyang Li, Jianhui Yang, Haoran Li, Yangqiu Song,
Abstract要約: 法律は、法律の遡及的適用が中核的な法的原則に違反し、誤った結論に至るため、各事件の時間的文脈と一致しなければならない。我々の観察では、現在の法的LLMはトレーニングの遮断に固定された時間的バイアスに悩まされているのに対し、検索エージェントはクエリに時間的制約を組み込むことは滅多にない。我々は,複数の修正期間にまたがる時間的インデクシングデータに基づいて学習し,時間的整合性を確保するために,オンラインWeb検索に適合する厳密な記事に局所法規RAGを併用する,エンドツーエンドの強化学習フレームワークであるLegalSearch-R1を提案する。
参考スコア（独自算出の注目度）: 45.13302016493955
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While large language models (LLMs) augmented with agentic search capabilities show promise for legal reasoning, they overlook a fundamental constraint that applicable law must match the temporal context of each case, as retroactive application of statutes violates core legal principles and leads to erroneous conclusions. Our observations reveal that current legal LLMs suffer from temporal bias anchored to their training cutoff, while search agents rarely incorporate temporal constraints into queries, and that web search alone cannot provide the precise statute and precedent citations that legal reasoning demands. To address these challenges, we propose LegalSearch-R1, an end-to-end reinforcement learning framework that pairs local statute RAG for precise article matching with online web search for broader legal knowledge, trained on temporally-indexed data spanning multiple amendment periods to enforce temporal consistency. Extensive experiments on our benchmark covering 13 legal tasks demonstrate that our 7B-parameter agent outperforms state-of-the-art deep research frameworks and specialized legal LLMs by 12.9% to 29.8%, surpasses baselines by 57.7% to 80.3% on temporal consistency, and exhibits robust out-of-domain generalization. The code and data are available at https://github.com/AlexFanw/LegalSearch-R1.
Abstract（参考訳）: エージェント検索機能を付加した大規模言語モデル(LLM)は法的推論の約束を示すが、法令の遡及的適用は基本的法原則に反し、誤った結論に至るため、適用法が各事件の時間的文脈に合致しなければならないという根本的な制約を見落としている。一方,検索エージェントは時間的制約をクエリに組み込むことは稀であり,Web検索だけでは法的な理由付けを求める正確な法規や前例的な引用は得られない。これらの課題に対処するために、LegalSearch-R1を提案する。これは、時間的整合性を確保するために、複数の修正期間にまたがる時間的インデクシングデータに基づいて訓練された、より広い法的知識のために、オンラインウェブ検索に適合する厳密な記事にローカルな法令RAGをペアリングするエンドツーエンドの強化学習フレームワークである。 13の法的なタスクをカバーするベンチマークにおいて、我々の7Bパラメーターエージェントは、最先端のディープ・リサーチ・フレームワークと特殊法的なLSMを12.9%から29.8%、ベースラインを57.7%から80.3%、時間的一貫性を57.7%から80.3%で上回り、ドメイン外の堅牢な一般化を示すことを示した。コードとデータはhttps://github.com/AlexFanw/LegalSearch-R1.comで公開されている。

論文の概要: Can LLMs Time Travel? Enhancing Temporal Consistency in Legal Agentic Search through Reinforcement Learning

関連論文リスト