Fugu-MT 論文翻訳(概要): Incoherent Deformation, Not Capacity: Diagnosing and Mitigating Overfitting in Dynamic Gaussian Splatting

論文の概要: Incoherent Deformation, Not Capacity: Diagnosing and Mitigating Overfitting in Dynamic Gaussian Splatting

arxiv url: http://arxiv.org/abs/2604.16747v1
Date: Fri, 17 Apr 2026 23:41:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.156534
Title: Incoherent Deformation, Not Capacity: Diagnosing and Mitigating Overfitting in Dynamic Gaussian Splatting
Title（参考訳）: 非コヒーレントな変形、容量ではない:動的ガウスめっきにおけるオーバーフィッティングの診断と緩和
Authors: Ahmad Droby,
Abstract要約: 動的3次元ガウス散乱法は、単眼ビデオでは強いトレーニングビューPSNRを実現するが、D-NeRFベンチマークでは不十分である。鉄道車両の平均PSNR間隔は6.18dBで、個々のシーンで11dBまで上昇する。 EER(Elastic Energy Regularization)は、クラウドを85%拡大しながら、ギャップを40.8%削減する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Dynamic 3D Gaussian Splatting methods achieve strong training-view PSNR on monocular video but generalize poorly: on the D-NeRF benchmark we measure an average train-test PSNR gap of 6.18 dB, rising to 11 dB on individual scenes. We report two findings that together account for most of that gap. Finding 1 (the role of splitting). A systematic ablation of the Adaptive Density Control pipeline (split, clone, prune, frequency, threshold, schedule) shows that splitting is responsible for over 80% of the gap: disabling split collapses the cloud from 44K to 3K Gaussians and the gap from 6.18 dB to 1.15 dB. Across all threshold-varying ablations, gap is log-linear in count (r = 0.995, bootstrap 95% CI [0.99, 1.00]), which suggests a capacity-based explanation. Finding 2 (the role of deformation coherence). We show that the capacity explanation is incomplete. A local-smoothness penalty on the per-Gaussian deformation field -- Elastic Energy Regularization (EER) -- reduces the gap by 40.8% while growing the cloud by 85%. Measuring per-Gaussian strain directly on trained checkpoints, EER reduces mean strain by 99.72% (median 99.80%) across all 8 scenes; on 8/8 scenes the median Gaussian under EER is less strained than the 1st-percentile (best-behaved) Gaussian under baseline. Alongside EER, we evaluate two further regularizers: GAD, a loss-rate-aware densification threshold, and PTDrop, a jitter-weighted Gaussian dropout. GAD+EER reduces the gap by 48%; adding PTDrop and a soft growth cap reaches 57%. We confirm that coherence generalizes to (a) a different deformation architecture (Deformable-3DGS, +40.6% gap reduction at re-tuned lambda), and (b) real monocular video (4 HyperNeRF scenes, reducing the mean PSNR gap by 14.9% at the same lambda as D-NeRF, with near-zero quality cost). The overfitting in dynamic 3DGS is driven by incoherent deformation, not parameter count.
Abstract（参考訳）: D-NeRFベンチマークでは、各シーンで平均6.18dBのPSNRギャップを測定し、11dBまで上昇する。我々はそのギャップの大半を一緒に説明できる2つの発見を報告した。発見1(分裂の役割)。適応密度制御パイプライン(スプリット、クローン、プルー、周波数、しきい値、スケジュール)の体系的なアブレーションは、分裂がギャップの80%以上を占めることを示している。すべてのしきい値の変動により、ギャップは数で対数直線(r = 0.995, bootstrap 95% CI [0.99, 1.00])となり、キャパシティに基づく説明が示唆される。発見2(変形コヒーレンスの役割)。キャパシティの説明が不完全であることを示す。ガウス単位の変形場(弾性エネルギー正規化(EER))における局所滑らかさのペナルティは、雲を85%増加させながら、ギャップを40.8%減少させる。 EERは、訓練されたチェックポイントでガウスあたりのひずみを直接測定し、平均ひずみを全8シーンで99.72%(中央99.80%)削減する。 EERと並行して、損失レート対応密度閾値であるGADと、ジッタ重み付きガウス降下点であるPTDropの2つのレギュレータを評価した。 GAD+EERはギャップを48%減らし、PTDropとソフト成長キャップは57%に達する。我々はコヒーレンスが一般化することを確認する (a)異なる変形アーキテクチャ(Deformable-3DGS,+40.6%のギャップ削減、および (b)実際のモノクロビデオ(4つのHyperNeRFシーン、D-NeRFと同じラムダで平均PSNRギャップを14.9%削減し、ほぼゼロ品質のコストがかかる)。動的3DGSのオーバーフィッティングはパラメータ数ではなく非コヒーレントな変形によって駆動される。

関連論文リスト

Quantization Dominates Rank Reduction for KV-Cache Compression [0.0]
量子化は、モデルと圧縮レベルに応じて、4-364 PPLのランク低下を一貫して上回る。我々は、ソフトマックスフィッシャー計量の下で、投射損傷が1方向に3 x 2 (2b) の量子化損傷を超える結果によってこれを定式化する。
論文参考訳（メタデータ） (2026-04-13T14:06:18Z)
When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems [0.6875312133832079]
生成モデルは、AIトレーニングパイプラインのクラス不均衡を補うために広く使用されている。 FastGAN拡張は、非常に低いトレーニングセットサイズで性能が劣るだけでなく、バイアスを積極的に増加させる。低ランク適応による安定拡散は全体として最良の結果を得た。
論文参考訳（メタデータ） (2026-03-17T05:37:17Z)
BadCLIP++: Stealthy and Persistent Backdoors in Multimodal Contrastive Learning [73.46118996284888]
マルチモーダル・コントラスト学習モデルに対するバックドア攻撃の研究は、ステルスネスと永続性という2つの大きな課題に直面している。両課題に対処する統合フレームワークであるBadCLIP++を提案する。ステルスネスのために,タスク関連領域付近に知覚不可能なパターンを埋め込むセマンティックフュージョンQRマイクロトリガーを導入する。持続性については、半径縮小とセントロイドアライメントによるトリガ埋め込みを安定化する。
論文参考訳（メタデータ） (2026-02-19T08:31:16Z)
Potential-energy gating for robust state estimation in bistable stochastic systems [0.0]
ダブルウェル・ダイナミクスによって制御されるシステムにおけるロバストな状態推定法である電位エネルギーゲーティングを導入する。拡張フィルタ,アンセントフィルタ,アンサンブルフィルタ,適応カルマンフィルタ内にゲーティングを実装した。
論文参考訳（メタデータ） (2026-02-12T08:43:34Z)
Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization [6.908972852063454]
時間的行動の局所化は、正確な境界検出と計算効率の両方を必要とする。我々は、境界距離回帰(BDR)と適応時間制限(ATR)という2つの補完的なイノベーションを通じてこの問題に対処する。 THUMOS14では、ActionFormer++ (55.7% mAP@0.7 at 235G) よりも36%少ないFLOPを用いて、151GのFLOPで56.5% mAP@0.7を達成する。
論文参考訳（メタデータ） (2025-11-06T00:41:54Z)
Ensemble Threshold Calibration for Stable Sensitivity Control [0.0]
本稿では,数千万組の幾何対もの幾何に対して,過度に分散した正確なリコールを実現するエンド・ツー・エンドのフレームワークを提案する。我々のアプローチは、小さなエラーで常にリコールターゲットにヒットし、他のキャリブレーションと比較して冗長な検証を減らし、単一のTPU v3コア上でエンドツーエンドで実行します。
論文参考訳（メタデータ） (2025-10-02T15:22:28Z)
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS [55.85673901231235]
光ガウシアン(LightGaussian)は、3次元ガウシアンをよりコンパクトなフォーマットに変換する方法である。ネットワーク・プルーニングにインスパイアされたLightGaussianは、ガウシアンをシーン再構築において最小限のグローバルな重要性で特定した。 LightGaussian は 3D-GS フレームワークで FPS を 144 から 237 に上げながら,平均 15 倍の圧縮率を達成する。
論文参考訳（メタデータ） (2023-11-28T21:39:20Z)
Patch-Level Contrasting without Patch Correspondence for Accurate and Dense Contrastive Representation Learning [79.43940012723539]
ADCLRは、正確で高密度な視覚表現を学習するための自己教師型学習フレームワークである。提案手法は, コントラッシブな手法のための新しい最先端性能を実現する。
論文参考訳（メタデータ） (2023-06-23T07:38:09Z)
On the Training Instability of Shuffling SGD with Batch Normalization [44.28777474091466]
単一シャッフル(SS)とランダムリシャッフル(RR)は、バッチ正規化の存在下で驚くほど異なる相互作用をする。 SSは回帰と分類のばらつきを生じるが,RRは歪みとばらつきの両方を避けている。
論文参考訳（メタデータ） (2023-02-24T04:10:54Z)
On the Double Descent of Random Features Models Trained with SGD [78.0918823643911]
勾配降下(SGD)により最適化された高次元におけるランダム特徴(RF)回帰特性について検討する。本研究では, RF回帰の高精度な非漸近誤差境界を, 定常および適応的なステップサイズSGD設定の下で導出する。理論的にも経験的にも二重降下現象を観察する。
論文参考訳（メタデータ） (2021-10-13T17:47:39Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。