Fugu-MT 論文翻訳(概要): Fara-1.5: Scalable Learning Environments for Computer Use Agents

論文の概要: Fara-1.5: Scalable Learning Environments for Computer Use Agents

arxiv url: http://arxiv.org/abs/2606.20785v1
Date: Thu, 18 Jun 2026 17:53:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-26 12:51:29.653613
Title: Fara-1.5: Scalable Learning Environments for Computer Use Agents
Title（参考訳）: Fara-1.5: コンピュータ利用エージェントのためのスケーラブルな学習環境
Authors: Ahmed Awadallah, Sahil Gupta, Yash Lara, Yadong Lu, Hussein Mozannar, Akshay Nambi, Zach Nussbaum, Yash Pandya, Aravind Rajeswaran, Corby Rosset, Alexey Taymanov, Luiz do Valle, Vibhav Vineet, Spencer Whitehead, Andrew Zhao,
Abstract要約: FaraGen1.5は、環境、ソルバ、検証器という3つのモジュールコンポーネントからなるコンピュータ利用エージェントのためのスケーラブルなデータパイプラインである。 FaraGen1.5は、認証によってゲートドメインを忠実にシミュレートする、あるいは不可逆的なアクションを必要とする、ライブWebサイトと合成環境の両方を使用している。結果の軌道を3つの補完的検証器でスコアし、タスクの正しさ、効率性、臨界点の順守をカバーしている。
参考スコア（独自算出の注目度）: 34.72158889421745
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Collecting computer use data from human demonstrations is expensive and slow, motivating the need for scalable generation strategies. This requires two key ingredients: environments in which agents can act and verifiers that can judge whether their demonstrations succeeded. We introduce FaraGen1.5, a scalable data pipeline for computer use agents composed of three modular components: environments, solvers, and verifiers. FaraGen1.5 uses both live websites and synthetic environments that faithfully simulate domains gated by authentication or that require irreversible actions. It employs a solver harness that can be powered by multiple models, including strong frontier models such as GPT-5.4, and also incorporates a user simulator to enable multi-turn rollouts. Finally, FaraGen1.5 scores the resulting trajectories with three complementary verifiers covering task correctness, efficiency, and critical-point adherence. Using data produced by this pipeline, we train Fara1.5, a family of native computer use agents (CUAs) at three scales built on Qwen3.5 (4B, 9B, and 27B). To train these models, we employ a supervised finetuning (SFT) recipe that carefully balances data from FaraGen1.5 for broad coverage, specific high-value tasks, and target model deficiencies in an iterative approach. Each model sets a new state of the art for its size class on browser-use benchmarks: Fara1.5-9B reaches 63.4% on Online-Mind2Web and 86.6% on WebVoyager, while Fara1.5-27B achieves 72.3% on Online-Mind2Web, which is competitive with much larger proprietary systems.
Abstract（参考訳）: 人間のデモからコンピュータ使用データを収集するのは高価で遅く、スケーラブルな生成戦略の必要性を動機付けている。エージェントが動作可能な環境と、デモが成功したかどうかを判断できる検証ツールの2つの重要な要素が必要です。 FaraGen1.5は、環境、ソルバ、検証器という3つのモジュールコンポーネントからなるコンピュータ利用エージェントのためのスケーラブルなデータパイプラインである。 FaraGen1.5は、認証によってゲートされたドメインを忠実にシミュレートする、あるいは不可逆的なアクションを必要とする、ライブWebサイトと合成環境の両方を使用している。 GPT-5.4のような強力なフロンティアモデルを含む複数のモデルで利用でき、マルチターンロールアウトを可能にするユーザーシミュレータも搭載している。最後に、FaraGen1.5は、タスクの正確性、効率性、臨界点の付着性をカバーする3つの補完的検証器を用いて、結果の軌跡をスコアする。このパイプラインで生成されたデータを使用して、Qwen3.5(4B、9B、27B)上に構築された3つのスケールで、ネイティブコンピュータ使用エージェント(CUA)ファミリーであるFara1.5をトレーニングします。これらのモデルのトレーニングには、FaraGen1.5からのデータ、特定の高価値タスク、反復的アプローチにおけるターゲットモデル欠陥を注意深くバランスする教師付き微調整(SFT)レシピを用いる。 Fara1.5-9B は Online-Mind2Web で63.4%、WebVoyager で86.6%、Fara1.5-27B は Online-Mind2Web で72.3%に達する。

論文の概要: Fara-1.5: Scalable Learning Environments for Computer Use Agents

関連論文リスト