Fugu-MT 論文翻訳(概要): GS-Scale: Unlocking Large-Scale 3D Gaussian Splatting Training via Host Offloading

論文の概要: GS-Scale: Unlocking Large-Scale 3D Gaussian Splatting Training via Host Offloading

arxiv url: http://arxiv.org/abs/2509.15645v1
Date: Fri, 19 Sep 2025 06:13:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-22 18:18:11.023453
Title: GS-Scale: Unlocking Large-Scale 3D Gaussian Splatting Training via Host Offloading
Title（参考訳）: GS-Scale:大規模3Dガウスプラッティングトレーニングをホストオフロードで開始
Authors: Donghyun Lee, Dawoon Jeong, Jae W. Lee, Hongil Yoon,
Abstract要約: 3D Gaussian Splattingは、高画質と高速なレンダリング速度を提供することによって、グラフィックレンダリングに革命をもたらした。大規模なシーンを高品質でトレーニングすることは、かなりのメモリ需要のため、依然として困難である。本稿では,3次元ガウス平滑化のための高速かつメモリ効率のトレーニングシステムであるGS-Scaleを提案する。
参考スコア（独自算出の注目度）: 9.776813771006358
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: The advent of 3D Gaussian Splatting has revolutionized graphics rendering by delivering high visual quality and fast rendering speeds. However, training large-scale scenes at high quality remains challenging due to the substantial memory demands required to store parameters, gradients, and optimizer states, which can quickly overwhelm GPU memory. To address these limitations, we propose GS-Scale, a fast and memory-efficient training system for 3D Gaussian Splatting. GS-Scale stores all Gaussians in host memory, transferring only a subset to the GPU on demand for each forward and backward pass. While this dramatically reduces GPU memory usage, it requires frustum culling and optimizer updates to be executed on the CPU, introducing slowdowns due to CPU's limited compute and memory bandwidth. To mitigate this, GS-Scale employs three system-level optimizations: (1) selective offloading of geometric parameters for fast frustum culling, (2) parameter forwarding to pipeline CPU optimizer updates with GPU computation, and (3) deferred optimizer update to minimize unnecessary memory accesses for Gaussians with zero gradients. Our extensive evaluations on large-scale datasets demonstrate that GS-Scale significantly lowers GPU memory demands by 3.3-5.6x, while achieving training speeds comparable to GPU without host offloading. This enables large-scale 3D Gaussian Splatting training on consumer-grade GPUs; for instance, GS-Scale can scale the number of Gaussians from 4 million to 18 million on an RTX 4070 Mobile GPU, leading to 23-35% LPIPS (learned perceptual image patch similarity) improvement.
Abstract（参考訳）: 3D Gaussian Splattingの出現は、高画質と高速なレンダリング速度を提供することによって、グラフィックレンダリングに革命をもたらした。しかし、パラメータ、勾配、オプティマイザ状態を保存するために必要なメモリ要求がかなりあるため、高品質で大規模なシーンをトレーニングすることは依然として難しい。これらの制約に対処するために,3次元ガウススプラッティングのための高速かつメモリ効率のトレーニングシステムであるGS-Scaleを提案する。 GS-Scaleはすべてのガウシアンをホストメモリに格納し、フォワードパスとバックパスの要求に応じてサブセットだけGPUに転送する。これはGPUメモリの使用を劇的に削減するが、CPU上で実行するためにフラストタルカリングとオプティマイザの更新が必要である。これを軽減するため、GS-Scaleでは、(1)高速フラストタルカリングのための幾何パラメータの選択的オフロード、(2)GPU計算によるCPUオプティマイザ更新のためのパラメータフォワード、(3)グラデーションゼロガウスの不要メモリアクセスを最小限に抑えるための遅延オプティマイザ更新の3つのシステムレベル最適化を採用している。大規模データセットに対する我々の広範な評価は、GS-ScaleがGPUメモリの要求を3.3-5.6倍削減し、ホストオフロードなしでGPUに匹敵するトレーニング速度を達成することを示した。例えば、GS-Scaleは、RTX 4070 Mobile GPU上で、ガウスの数を400万から1800万に拡張することができ、23-35%のLPIPS(知覚的イメージパッチ類似性)が改善される。

論文の概要: GS-Scale: Unlocking Large-Scale 3D Gaussian Splatting Training via Host Offloading

関連論文リスト