Fugu-MT 論文翻訳(概要): PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System

論文の概要: PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System

arxiv url: http://arxiv.org/abs/2509.24855v1
Date: Mon, 29 Sep 2025 14:40:53 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:20.051497
Title: PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System
Title（参考訳）: 物理ミニオン: 進化的マルチモーダルマルチエージェントシステムによる最新の物理オリンピックで金メダルを獲得
Authors: Fangchen Yu, Junchi Yao, Ziyi Wang, Haiyuan Wan, Youling Huang, Bo Zhang, Shuyue Hu, Dongzhan Zhou, Ning Ding, Ganqu Cui, Lei Bai, Wanli Ouyang, Peng Ye,
Abstract要約: 物理は現実世界の理解と形成の中心であり、物理問題を解く能力は現実世界の物理知能の重要な指標である。既存のアプローチは主にシングルモデルベースであり、オープンソースのMLLMはゴールドメディカルレベルのパフォーマンスに達することは滅多にない。我々は,物理オリンピアードの共進化的マルチエージェントシステムであるPhysorMinionsを提案する。アーキテクチャには、ダイアグラムを解釈するVisual Studio、ソリューションを定式化するLogic Studio、デュアルステージ検証を実行するReview Studioの3つのシナジスティックスタジオがある。
参考スコア（独自算出の注目度）: 65.02248709992442
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Physics is central to understanding and shaping the real world, and the ability to solve physics problems is a key indicator of real-world physical intelligence. Physics Olympiads, renowned as the crown of competitive physics, provide a rigorous testbed requiring complex reasoning and deep multimodal understanding, yet they remain largely underexplored in AI research. Existing approaches are predominantly single-model based, and open-source MLLMs rarely reach gold-medal-level performance. To address this gap, we propose PhysicsMinions, a coevolutionary multi-agent system for Physics Olympiad. Its architecture features three synergistic studios: a Visual Studio to interpret diagrams, a Logic Studio to formulate solutions, and a Review Studio to perform dual-stage verification. The system coevolves through an iterative refinement loop where feedback from the Review Studio continuously guides the Logic Studio, enabling the system to self-correct and converge towards the ground truth. Evaluated on the HiPhO benchmark spanning 7 latest physics Olympiads, PhysicsMinions delivers three major breakthroughs: (i) Strong generalization: it consistently improves both open-source and closed-source models of different sizes, delivering clear benefits over their single-model baselines; (ii) Historic breakthroughs: it elevates open-source models from only 1-2 to 6 gold medals across 7 Olympiads, achieving the first-ever open-source gold medal in the latest International Physics Olympiad (IPhO) under the average-score metric; and (iii) Scaling to human expert: it further advances the open-source Pass@32 score to 26.8/30 points on the latest IPhO, ranking 4th of 406 contestants and far surpassing the top single-model score of 22.7 (ranked 22nd). Generally, PhysicsMinions offers a generalizable framework for Olympiad-level problem solving, with the potential to extend across disciplines.
Abstract（参考訳）: 物理は現実世界の理解と形成の中心であり、物理問題を解く能力は現実世界の物理知能の重要な指標である。物理オリンピアードは、競争物理学の王座として知られ、複雑な推論と深いマルチモーダルな理解を必要とする厳密なテストベッドを提供するが、AI研究においてほとんど未熟なままである。既存のアプローチは主にシングルモデルベースであり、オープンソースのMLLMはゴールドメディカルレベルのパフォーマンスに達することは滅多にない。このギャップに対処するために,我々は物理オリンピアードの共進化的マルチエージェントシステムである物理ミニオンを提案する。アーキテクチャには、ダイアグラムを解釈するVisual Studio、ソリューションを定式化するLogic Studio、デュアルステージ検証を実行するReview Studioの3つのシナジスティックスタジオがある。システムは反復的な改善ループを通じて進化し、レビュースタジオからのフィードバックが継続的にLogic Studioを案内し、システムは自己修正され、根底的な真実に向かって収束する。 HiPhOベンチマークで評価すると、Olympiadsは3つの大きなブレークスルーをもたらす。 (i)強力な一般化: 異なるサイズのオープンソースモデルとクローズドソースモデルの両方を一貫して改善し、単一のモデルベースラインに対して明確なメリットを提供する。 (二歴史的ブレークスルー平均スコアで国際物理オリンピック(IPhO)において、オリンピアード7個にまたがる1-2から6個の金メダルから、初めてオープンソース金メダルを達成し、オープンソースモデルを上昇させる。 3) 人材へのスケーリング: オープンソースのPass@32スコアを最新のIPhOで26.8/30ポイントに引き上げ、406人中4位にランクインし、シングルモデルスコアの22.7(22位)をはるかに上回っている。一般に、物理ミニオンズはオリンピアードレベルの問題解決のための一般化可能なフレームワークを提供し、分野を超えて拡張する可能性がある。

論文の概要: PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System

関連論文リスト