Fugu-MT 論文翻訳(概要): Muse Spark Safety & Preparedness Report

論文の概要: Muse Spark Safety & Preparedness Report

arxiv url: http://arxiv.org/abs/2606.12429v1
Date: Thu, 14 May 2026 23:12:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 07:09:36.89155
Title: Muse Spark Safety & Preparedness Report
Title（参考訳）: Muse Spark Safety and Preparedness Report
Authors: Cristina Menghini, Peter Ney, Hamza Kwisaba, Zifan, Wang, Miles Turpin, Felix Binder, Jean-Christophe Testud, Aidan Boyd, Nathaniel Li, Ivan Evtimov, Klaudia Krawiecka, Arman Zharmagambetov, Jeremy Kritz, Alexander R. Fabbri, Daniel Song, Jinpeng Miao, Joonas Hjelt, Meghna Ramani, Leona Lan, Reza Aghajani, Joanna Bitton, Mahesh Pasupuleti, Devin Norder, Khalid El-Arini, Paridhi Singh, Vítor Albiero, Sahana CB, Rashnil Chaturvedi, Elahe Dabir, Edoardo Debenedetti, Jim Gust, Ziwen Han, Kat He, Sean Hendryx, Lifeng Jin, Polina Kirichenko, Sandra Lefdal, Kenneth Li, Asad Liaqat, Inna Lin, Despoina Magka, Neal Mangaokar, Ishita Mediratta, Zach Miller, Smitha Milli, Niloofar Mireshghallah, Saba Nazir, Hung Nguyen, Maximilian Nickel, Kelvin Niu, Kerem Oktar, Bhargavi Paranjape, Parth Pathak, Maya Pavlova, Emmanuel Ramirez, David Renardy, Candace Ross, Yasha Sheynin, Claudia Shi, Shivam Singhal, Evangelia Spiliopoulou, Rakshith Sharma Srinivasa, Jamelle Watson-Daniels, Spencer Whitman, Adina Williams, Chen Xing, Andy Zou, Tommy Ma, Siqi Deng, James Beldock, Prashant Ratanchandani, Kate Plawiak, Taesung Lee, Ryan Victory, Lindsay Hundley, Rachad Alao, Himaghna Bhattacharjee, Jianfeng Chi, Gary Frost, Pegah Ghahremani, Niki Howe, Yuheng Huang, Saeed Jahed, Hannah Korevaar, Trang Le, Zhe Liu, Jinghong Luo, Qin Lyu, Nina Mehrabi, Abraham Montilla, Chirag Nagpal, Cyrus Nikolaidis, Rajvardhan Oak, Manoj Ravi, Vidya Sarma, Aman Shankar, Alana Shine, Eric Michael Smith, Mariana Tandon, Michael Tontchev, Caoyu Wang, Zihan Wang, Corinne Wong, Zheng Wu, Hongyuan Zhan, Justin Zhao, Zexuan Zhong, Chengxu Zhuang, Tristan Goodman, Ayaz Minhas, Harrison Rudolph, Victoria Jeffries, Ingrid Dickinson, Alex Vaughan, Lauren Deason, Kamalika Chaudhuri, Julian Michael, Shengjia Zhao, Summer Yue,
Abstract要約: Muse SparkはMetaが開発した最新の大規模言語モデルだ。われわれはまず,MetaのAdvanced AI Scaling Frameworkの下で破滅的なリスクドメインの評価を行った。次に、Muse Sparkの広範なコンテンツ安全性や行動プロファイルなど、さらなる考慮事項について論じる。
参考スコア（独自算出の注目度）: 106.21435337776768
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Muse Spark is the latest large language model developed by Meta. In this report, we first present evaluations for catastrophic risk domains under Meta's Advanced AI Scaling Framework, along with the evidence that informed our launch decision. We then discuss additional considerations, such as Muse Spark's broader content safety and behavioral profile, that are relevant to overall safety but fall outside the catastrophic risk domains governed by the Framework. Our preparedness results covering Chemical and Biological, Cybersecurity, and Loss of Control risks assess Muse Spark's deployment within Meta AI as presenting acceptable levels of residual risks under our Advanced AI Scaling Framework. We conducted a broad set of evaluations targeting dual-use and high-risk capabilities across these catastrophic risk domains. Those evaluations identified elevated risks prior to mitigations, with Chemical and Biological capabilities assessed as likely reaching the "high risk" category under the Advanced AI Scaling Framework before safeguards were applied. We have implemented a multi-layered set of mitigations that address the identified risks, and Muse Spark demonstrates state-of-the-art refusal across a range of benchmarks related to hazardous workflows in chemistry and biology. We therefore release Muse Spark as the underlying model of Meta AI.
Abstract（参考訳）: Muse SparkはMetaが開発した最新の大規模言語モデルだ。本稿では,Metaの高度なAIスケーリングフレームワーク(Advanced AI Scaling Framework)の下で,破滅的なリスクドメインの評価を行った。次に、Muse Sparkの広範なコンテンツ安全性と行動プロファイルなど、全体的な安全性に関連するが、フレームワークが管理する破滅的なリスクドメインの外にある、追加の考慮事項について論じる。化学、生物学、サイバーセキュリティ、制御損失のリスクをカバーした準備結果では、Meta AI内のMuse Sparkのデプロイメントを、Advanced AI Scaling Frameworkの下で許容される残留リスクレベルとして評価しています。これらの破滅的なリスクドメインにまたがって、両用および高リスク機能を対象とした幅広い評価を行った。これらの評価では、予防措置が適用される前に、Advanced AI Scaling Frameworkの下で「高いリスク」のカテゴリに到達する可能性があると評価された。 Muse Sparkは、化学物質や生物学における有害なワークフローに関連するさまざまなベンチマークにおいて、最先端の拒絶を実証しています。そのため、Meta AIの基盤モデルとしてMuse Sparkをリリースしています。

論文の概要: Muse Spark Safety & Preparedness Report

関連論文リスト