Fugu-MT 論文翻訳(概要): OpenThoughts-Agent: Data Recipes for Agentic Models

論文の概要: OpenThoughts-Agent: Data Recipes for Agentic Models

arxiv url: http://arxiv.org/abs/2606.24855v1
Date: Tue, 23 Jun 2026 17:34:29 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-24 22:16:49.134345
Title: OpenThoughts-Agent: Data Recipes for Agentic Models
Title（参考訳）: OpenThoughts-Agent:エージェントモデルのためのデータ準備
Authors: Negin Raoof, Richard Zhuang, Marianna Nezhurina, Etash Guha, Atula Tejaswi, Ryan Marten, Charlie F. Ruan, Tyler Griggs, Alexander Glenn Shaw, Hritik Bansal, E. Kelly Buchanan, Artem Gazizov, Reinhard Heckel, Chinmay Hegde, Sankalp Jajee, Daanish Khazi, Emmanouil Koukoumidis, Xiangyi Li, Hange Liu, Shlok Natarajan, Harsh Raj, Nicholas Roberts, Ethan Shen, Nishad Singhi, Michael Siu, Ashima Suvarna, Hanwen Xing, Patrick Yubeaton, Robert Zhang, Leon Liangyu Chen, Xiaokun Chen, Steven Dillmann, Saadia Gabriel, Xunyi Jiang, Anurag Kashyap, Boxuan Li, Yein Park, Minh Pham, Sujay Sanghavi, Lin Shi, Ke Sun, Yixin Wang, Zhiwei Xu, Erica Zhang, Siyan Zhao, Wanjia Zhao, Jenia Jitsev, Alex Dimakis, Benjamin Feuer, Ludwig Schmidt,
Abstract要約: OT-Agentプロジェクトはエージェントモデルをトレーニングするための完全にオープンなデータキュレーションパイプラインである。パイプラインの各段階について100以上の制御アブレーション実験を行った。このデータセットには、パイプラインから100Kのサンプルと微調整されたQwen3-32Bのトレーニングセットが組み込まれています。
参考スコア（独自算出の注目度）: 99.10629645072653
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Agentic language models dramatically expand the applications of AI yet little is publicly known about how to curate training data for broadly capable agents. Existing open efforts such as SWE-Smith, SERA, and Nemotron-Terminal typically target a single benchmark, leaving open the question of how to train models that generalize across diverse agentic tasks. The OpenThoughts-Agent (OT-Agent) project addresses this gap with a fully open data curation pipeline for training agentic models. We conduct more than 100 controlled ablation experiments to systematically investigate each stage of the pipeline, yielding insights on the importance of task sources and diversity. We then assemble a training set of 100K examples from our pipeline and fine-tune Qwen3-32B on this dataset, which yields an average accuracy of 44.8% across seven agentic benchmarks and a 3.9 percentage point improvement over the strongest existing open data agentic model (Nemotron-Terminal-32B, 40.9%). Moreover, our training data exhibits strong scaling properties, outperforming alternative open datasets at every training set size in compute-controlled comparisons. We publicly release our training sets, data pipeline, experimental data, and models at openthoughts.ai to support future open research on agentic model training.
Abstract（参考訳）: エージェント言語モデルはAIの応用を劇的に拡大するが、幅広い能力を持つエージェントのトレーニングデータをキュレートする方法については一般には知られていない。 SWE-Smith、SERA、Nemotron-Terminalといった既存のオープンな取り組みは、通常1つのベンチマークをターゲットにしており、さまざまなエージェントタスクにまたがって一般化するモデルをトレーニングする方法に関する疑問が残る。 OpenThoughts-Agent(OT-Agent)プロジェクトは、エージェントモデルをトレーニングするための完全にオープンなデータキュレーションパイプラインで、このギャップに対処する。我々は100以上の制御されたアブレーション実験を行い、パイプラインの各ステージを体系的に調査し、タスクソースと多様性の重要性についての洞察を得る。次に、このデータセット上でパイプラインから100Kのサンプルと微調整のQwen3-32Bのトレーニングセットを組み立てます。これにより、7つのエージェントベンチマークの平均精度が44.8%、最強のオープンデータエージェントモデル(Nemotron-Terminal-32B, 40.9%)よりも3.9%向上します。さらに、我々のトレーニングデータは、強力なスケーリング特性を示し、計算制御比較において、トレーニングセットのサイズ毎に、代替のオープンデータセットよりも優れています。エージェントモデルトレーニングに関する今後のオープンな研究を支援するため、オープン思想.aiでトレーニングセット、データパイプライン、実験データ、モデルを公開する。

論文の概要: OpenThoughts-Agent: Data Recipes for Agentic Models

関連論文リスト