Fugu-MT 論文翻訳(概要): Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient?

論文の概要: Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient?

arxiv url: http://arxiv.org/abs/2605.10848v1
Date: Mon, 11 May 2026 16:58:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:51.015649
Title: Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient?
Title（参考訳）: Pi-Seriniによるエージェント検索の再考:語彙検索は十分か?
Authors: Tz-Huan Hsu, Jheng-Hong Yang, Jimmy Lin,
Abstract要約: 本稿では,文書の検索,閲覧,読取を行う3つのツールを備えた検索エージェントであるPi-Seriniを紹介する。以上の結果から,BrowseComp-Plusでは,検索深度を十分に設定した語彙レトリバーが,より有能なLLMと組み合わせることで,効果的な深層学習を支援することができることがわかった。
参考スコア（独自算出の注目度）: 44.97027502229472
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Does a lexical retriever suffice as large language models (LLMs) become more capable in an agentic loop? This question naturally arises when building deep research systems. We revisit it by pairing BM25 with frontier LLMs that have better reasoning and tool-use abilities. To support researchers asking the same question, we introduce Pi-Serini, a search agent equipped with three tools for retrieving, browsing, and reading documents. Our results show that, on BrowseComp-Plus, a well-configured lexical retriever with sufficient retrieval depth can support effective deep research when paired with more capable LLMs. Specifically, Pi-Serini with gpt-5.5 achieves 83.1% answer accuracy and 94.7% surfaced evidence recall, outperforming released search agents that use dense retrievers. Controlled ablations further show that BM25 tuning improves answer accuracy by 18.0% and surfaced evidence recall by 11.1% over the default BM25 setting, while increasing retrieval depth further improves surfaced evidence recall by 25.3% over the shallow-retrieval setting. Source code is available at https://github.com/justram/pi-serini.
Abstract（参考訳）: 大規模言語モデル(LLM)がエージェントループでより有効になるにつれて、語彙レトリバーは十分か? この問題は、ディープリサーチシステムを構築する際に自然に発生する。 BM25 とフロンティア LLM を組み合わせて再検討する。そこで本研究では,文書の検索,閲覧,読解を行う3つのツールを備えた検索エージェントであるPi-Seriniを紹介する。以上の結果から,BrowseComp-Plusでは,検索深度を十分に設定した語彙レトリバーが,より有能なLLMと組み合わせることで,効果的な深層学習を支援することができることがわかった。具体的には、gpt-5.5 の Pi-Serini は83.1% の回答精度と94.7% の証拠リコールを達成し、より密集した検索エージェントよりも優れている。制御された改善により、BM25のチューニングにより解答精度が18.0%向上し、デフォルトのBM25設定では11.1%向上し、検索深度は浅い検索条件では25.3%向上した。ソースコードはhttps://github.com/justram/pi-serini.comで入手できる。

論文の概要: Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient?

関連論文リスト