Fugu-MT 論文翻訳(概要): Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model

論文の概要: Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model

arxiv url: http://arxiv.org/abs/2604.11154v1
Date: Mon, 13 Apr 2026 08:15:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:16.421868
Title: Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model
Title（参考訳）: GenAI研究の環境フットプリント:Moshi Foundation Modelからの考察
Authors: Marta López-Rauhut, Loic Landrieu, Mathieu Aubry, Anne-Laure Ligozat,
Abstract要約: 本研究では、計算集約型MLLM研究の解剖学を掘り下げ、特定のモデルコンポーネントやトレーニングフェーズに費やされたGPU時間の定量化を行う。ライフサイクルアセスメント手法を用いて,モシの創出による環境影響の評価を行った。
参考スコア（独自算出の注目度）: 21.84427881226263
License: http://creativecommons.org/licenses/by/4.0/
Abstract: New multi-modal large language models (MLLMs) are continuously being trained and deployed, following rapid development cycles. This generative AI frenzy is driving steady increases in energy consumption, greenhouse gas emissions, and a plethora of other environmental impacts linked to datacenter construction and hardware manufacturing. Mitigating the environmental consequences of GenAI remains challenging due to an overall lack of transparency by the main actors in the field. Even when the environmental impacts of specific models are mentioned, they are typically restricted to the carbon footprint of the final training run, omitting the research and development stages. In this work, we explore the impact of GenAI research through a fine-grained analysis of the compute spent to create Moshi, a 7B-parameter speech-text foundation model for real-time dialogue developed by Kyutai, a leading privately funded open science AI lab. For the first time, our study dives into the anatomy of compute-intensive MLLM research, quantifying the GPU-time invested in specific model components and training phases, as well as early experimental stages, failed training runs, debugging, and ablation studies. Additionally, we assess the environmental impacts of creating Moshi from beginning to end using a life cycle assessment methodology: we quantify energy and water consumption, greenhouse gas emissions, and mineral resource depletion associated with the production and use of datacenter hardware. Our detailed analysis allows us to provide actionable guidelines to reduce compute usage and environmental impacts of MLLM research, paving the way for more sustainable AI research.
Abstract（参考訳）: 新しいマルチモーダルな大規模言語モデル(MLLM)は、急速な開発サイクルを経て継続的に訓練され、デプロイされている。この生成的AIは、エネルギー消費の着実に増加、温室効果ガスの排出、そしてデータセンターの建設やハードウェア製造に関連する多くの環境影響を駆動している。この分野の主要アクターによる透明性の全体的な欠如により、GenAIの環境影響を緩和することは依然として困難である。特定のモデルによる環境影響が言及されているとしても、それらは通常最終トレーニングランの炭素フットプリントに制限され、研究と開発段階が省略される。そこで本研究では,民間のオープンサイエンスAIラボである久泰が開発した,リアルタイム対話のための7Bパラメータ音声テキスト基盤モデルであるMoshiの作成に要する計算の詳細な分析を通じて,GenAI研究の影響について検討する。計算集約型MLLM研究の解剖学を初めて調査し、特定のモデルコンポーネントやトレーニングフェーズ、初期の実験段階、トレーニング実行の失敗、デバッグ、アブレーション研究に費やされたGPU時間の定量化を行った。さらに, ライフサイクル評価手法として, エネルギー・水消費, 温室効果ガス排出, およびデータセンターハードウェアの生産・使用に伴う鉱物資源の枯渇を定量化する。我々の詳細な分析により、MLLM研究の計算使用量と環境への影響を減らすための実行可能なガイドラインを提供し、より持続可能なAI研究への道を開くことができる。

論文の概要: Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model

関連論文リスト