Fugu-MT 論文翻訳(概要): QuarkMedSearch: A Long-Horizon Deep Search Agent for Exploring Medical Intelligence

論文の概要: QuarkMedSearch: A Long-Horizon Deep Search Agent for Exploring Medical Intelligence

arxiv url: http://arxiv.org/abs/2604.12867v4
Date: Thu, 23 Apr 2026 01:15:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-24 14:40:05.96989
Title: QuarkMedSearch: A Long-Horizon Deep Search Agent for Exploring Medical Intelligence
Title（参考訳）: QuarkMedSearch:医療情報探究のための長距離ディープサーチエージェント
Authors: Zhichao Lin, Zhichao Liang, Gaoqiang Liu, Meng Xu, Baoyu Xiang, Shuxin Zhao, Yao Wu, Jian Xu, Guanjun Jiang,
Abstract要約: 我々は,強力なエージェント基盤モデルであるTongyi DeepResearchを構築し,QuarkMedSearchを提案する。データ合成には、大規模医療知識グラフとリアルタイムオンライン探索を組み合わせることで、長期医療深層検索訓練データを構築する。ポストトレーニングでは、2段階のSFTおよびRLトレーニング戦略を採用し、モデルの計画、ツール呼び出し、リフレクション機能を徐々に強化する。
参考スコア（独自算出の注目度）: 7.965065668022068
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As agentic foundation models continue to evolve, how to further improve their performance in vertical domains has become an important challenge. To this end, building upon Tongyi DeepResearch, a powerful agentic foundation model, we focus on the Chinese medical deep search scenario and propose QuarkMedSearch, systematically exploring a full-pipeline approach spanning medical multi-hop data construction, training strategies, and evaluation benchmarks to further push and assess its performance upper bound in vertical domains. Specifically, for data synthesis, to address the scarcity of deep search training data in the medical domain, we combine a large-scale medical knowledge graph with real-time online exploration to construct long-horizon medical deep search training data; for post-training, we adopt a two-stage SFT and RL training strategy that progressively enhances the model's planning, tool invocation, and reflection capabilities required for deep search, while maintaining search efficiency; for evaluation, we collaborate with medical experts to construct the QuarkMedSearch Benchmark through rigorous manual verification. Experimental results demonstrate that QuarkMedSearch achieves state-of-the-art performance among open-source models of comparable scale on the QuarkMedSearch Benchmark, while also maintaining strong competitiveness on general benchmarks.
Abstract（参考訳）: エージェントファウンデーションモデルが進化を続けるにつれて、垂直領域におけるパフォーマンスをさらに向上する方法が重要な課題となっている。この目的のために,強力なエージェント基盤モデルであるTongyi DeepResearchを基盤として,中国における医療深層探索のシナリオに着目したQuarkMedSearchを提案する。具体的には、医用領域における深部検索訓練データの不足に対処するため、大規模医療知識グラフとリアルタイムオンライン探索を併用して、長期医療深部検索訓練データの構築を行い、訓練後、厳密な手作業によるQuarkMedSearch Benchmarkを構築するために、検索効率を維持しながら、モデルの計画、ツール呼び出し、リフレクション能力を段階的に向上する2段階のSFTおよびRLトレーニング戦略を採用する。実験により、QuarkMedSearchはQuarkMedSearchベンチマークにおいて、同等規模のオープンソースモデル間の最先端のパフォーマンスを達成し、また、一般的なベンチマークでは強力な競争力を維持していることが示された。

論文の概要: QuarkMedSearch: A Long-Horizon Deep Search Agent for Exploring Medical Intelligence

関連論文リスト