Fugu-MT 論文翻訳(概要): An Empirical Study of World Model Quantization

論文の概要: An Empirical Study of World Model Quantization

arxiv url: http://arxiv.org/abs/2602.02110v1
Date: Mon, 02 Feb 2026 13:54:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-03 19:28:34.182903
Title: An Empirical Study of World Model Quantization
Title（参考訳）: 世界モデル量子化の実証的研究
Authors: Zhongqian Fu, Tianyi Zhao, Kai Han, Hang Zhou, Xinghao Chen, Yunhe Wang,
Abstract要約: DINO-WMを用いた世界モデル量子化の系統的研究を行った。我々は、幅広いビット幅、量子化の粒度、最大50イテレーションまでの計画地平線を含む様々な視覚的計画タスクについて実験を行う。その結果,世界モデルにおける量子化効果は,標準精度やビット幅トレードオフを超えていることがわかった。
参考スコア（独自算出の注目度）: 34.94388089174202
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: World models learn an internal representation of environment dynamics, enabling agents to simulate and reason about future states within a compact latent space for tasks such as planning, prediction, and inference. However, running world models rely on hevay computational cost and memory footprint, making model quantization essential for efficient deployment. To date, the effects of post-training quantization (PTQ) on world models remain largely unexamined. In this work, we present a systematic empirical study of world model quantization using DINO-WM as a representative case, evaluating diverse PTQ methods under both weight-only and joint weight-activation settings. We conduct extensive experiments on different visual planning tasks across a wide range of bit-widths, quantization granularities, and planning horizons up to 50 iterations. Our results show that quantization effects in world models extend beyond standard accuracy and bit-width trade-offs: group-wise weight quantization can stabilize low-bit rollouts, activation quantization granularity yields inconsistent benefits, and quantization sensitivity is highly asymmetric between encoder and predictor modules. Moreover, aggressive low-bit quantization significantly degrades the alignment between the planning objective and task success, leading to failures that cannot be remedied by additional optimization. These findings reveal distinct quantization-induced failure modes in world model-based planning and provide practical guidance for deploying quantized world models under strict computational constraints. The code will be available at https://github.com/huawei-noah/noah-research/tree/master/QuantWM.
Abstract（参考訳）: 世界モデルは環境力学の内部表現を学習し、エージェントは計画、予測、推論といったタスクのために、コンパクトな潜在空間内の将来の状態をシミュレートし、推論することができる。しかし、世界モデルの実行は計算コストとメモリフットプリントに依存するため、効率的なデプロイメントにはモデル量子化が不可欠である。現在まで、世界モデルに対するポストトレーニング量子化(PTQ)の影響はほとんど検討されていない。本研究では, DINO-WMを用いた世界モデル量子化の系統的研究を行い, 重量限定および共同重量活性化条件下での多様なPTQ手法の評価を行った。我々は、幅広いビット幅、量子化の粒度、最大50イテレーションまでの計画地平線を含む様々な視覚的計画タスクについて広範な実験を行う。その結果,世界モデルにおける量子化効果は,標準精度やビット幅トレードオフを超えて,低ビットロールアウトの安定化,アクティベーション量子化粒度の不整合性,エンコーダと予測モジュール間での量子化感度は極めて非対称であることがわかった。さらに、アグレッシブな低ビット量子化は計画目標とタスク成功の整合性を著しく低下させ、さらなる最適化では修正できない失敗につながります。これらの結果は、世界モデルに基づく計画において、量子化によって引き起こされる障害モードを明らかにし、厳密な計算制約の下で量子化世界モデルをデプロイするための実践的なガイダンスを提供する。コードはhttps://github.com/huawei-noah/noah-research/tree/master/QuantWMで入手できる。

論文の概要: An Empirical Study of World Model Quantization

関連論文リスト