Fugu-MT 論文翻訳(概要): International AI Safety Report 2025: First Key Update: Capabilities and Risk Implications

論文の概要: International AI Safety Report 2025: First Key Update: Capabilities and Risk Implications

arxiv url: http://arxiv.org/abs/2510.13653v1
Date: Wed, 15 Oct 2025 15:13:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-16 20:13:28.729361
Title: International AI Safety Report 2025: First Key Update: Capabilities and Risk Implications
Title（参考訳）: 国際AI安全レポート2025:最初の重要なアップデート:能力とリスク含意
Authors: Yoshua Bengio, Stephen Clare, Carina Prunkl, Shalaleh Rismani, Maksym Andriushchenko, Ben Bucknall, Philip Fox, Tiancheng Hu, Cameron Jones, Sam Manning, Nestor Maslej, Vasilios Mavroudis, Conor McGlynn, Malcolm Murray, Charlotte Stix, Lucia Velasco, Nicole Wheeler, Daniel Privitera, Sören Mindermann, Daron Acemoglu, Thomas G. Dietterich, Fredrik Heintz, Geoffrey Hinton, Nick Jennings, Susan Leavy, Teresa Ludermir, Vidushi Marda, Helen Margetts, John McDermid, Jane Munga, Arvind Narayanan, Alondra Nelson, Clara Neppel, Gopal Ramchurn, Stuart Russell, Marietje Schaake, Bernhard Schölkopf, Alavaro Soto, Lee Tiedrich, Gaël Varoquaux, Andrew Yao, Ya-Qin Zhang, Leandro Aguirre, Olubunmi Ajala, Fahad Albalawi Noora AlMalek, Christian Busch, André Carvalho, Jonathan Collas, Amandeep Gill, Ahmet Hatip, Juha Heikkilä, Chris Johnson, Gill Jolly, Ziv Katzir, Mary Kerema, Hiroaki Kitano, Antonio Krüger, Aoife McLysaght, Oleksii Molchanovskyi, Andrea Monti, Kyoung Mu Lee, Mona Nemer, Nuria Oliver, Raquel Pezoa, Audrey Plonk, José Portillo, Balaraman Ravindran, Hammam Riza, Crystal Rugege, Haroon Sheikh, Denise Wong, Yi Zeng, Liming Zhu,
Abstract要約: 今回のアップデートは、AIの能力が最初のAI Safety Reportからどのように改善されたかを調べるものだ。重大な新たな証拠が評価を更新する重要なリスク領域に焦点を当てている。
参考スコア（独自算出の注目度）: 118.49965571969089
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Since the publication of the first International AI Safety Report, AI capabilities have continued to improve across key domains. New training techniques that teach AI systems to reason step-by-step and inference-time enhancements have primarily driven these advances, rather than simply training larger models. As a result, general-purpose AI systems can solve more complex problems in a range of domains, from scientific research to software development. Their performance on benchmarks that measure performance in coding, mathematics, and answering expert-level science questions has continued to improve, though reliability challenges persist, with systems excelling on some tasks while failing completely on others. These capability improvements also have implications for multiple risks, including risks from biological weapons and cyber attacks. Finally, they pose new challenges for monitoring and controllability. This update examines how AI capabilities have improved since the first Report, then focuses on key risk areas where substantial new evidence warrants updated assessments.
Abstract（参考訳）: 第1回国際AI安全レポートの公開以来、AI能力は主要な領域にわたって改善され続けている。ステップバイステップと推論時間の強化を推論するようにAIシステムに教える新しいトレーニングテクニックは、単により大きなモデルをトレーニングするのではなく、主にこれらの進歩を導いてきた。その結果、汎用AIシステムは、科学研究からソフトウェア開発まで、さまざまな領域でより複雑な問題を解決することができる。コーディング、数学、専門家レベルの科学問題に対する回答のパフォーマンスを測定するベンチマークのパフォーマンスは改善を続けているが、信頼性の課題は継続し、システムはいくつかのタスクに優れており、他のタスクは完全に失敗している。これらの能力の改善は、生物兵器によるリスクやサイバー攻撃など、複数のリスクにも影響する。最後に、監視と制御性に新たな課題を提起する。今回のアップデートでは、AIの能力が最初のレポートからどのように改善されたかを調べ、重要なリスク領域に焦点が当てられている。

論文の概要: International AI Safety Report 2025: First Key Update: Capabilities and Risk Implications

関連論文リスト