Fugu-MT 論文翻訳(概要): MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction

論文の概要: MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction

arxiv url: http://arxiv.org/abs/2603.15330v1
Date: Mon, 16 Mar 2026 14:21:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 18:28:58.424754
Title: MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction
Title（参考訳）: 3DレコンストラクションをストリーミングするMeMix
Authors: Jiacheng Dong, Huan Li, Sicheng Zhou, Wenhao Hu, Weili Xu, Yan Wang,
Abstract要約: トレーニング不要のプラグイン・アンド・プレイモジュールであるMeMixを,リカレント状態をメモリミックスに再キャストすることで,ストリーミング再生を改善する。 MeMixは状態を複数の独立したメモリパッチに分割し、他のメモリを保存しながら最下位のメモリパッチだけを更新する。 7シーンで300-500フレームストリームにわたる平均で15.3%(最大40.0%)の復元完全性エラーを低減する。
参考スコア（独自算出の注目度）: 11.01217345606816
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Reconstruction is a fundamental task in 3D vision and a fundamental capability for spatial intelligence. Particularly, streaming 3D reconstruction is central to real-time spatial perception, yet existing recurrent online models often suffer from progressive degradation on long sequences due to state drift and forgetting, motivating inference-time remedies. We present MeMix, a training-free, plug-and-play module that improves streaming reconstruction by recasting the recurrent state into a Memory Mixture. MeMix partitions the state into multiple independent memory patches and updates only the least-aligned memory patches while exactly preserving others. This selective update mitigates catastrophic forgetting while retaining $O(1)$ inference memory, and requires no fine-tuning or additional learnable parameters, making it directly applicable to existing recurrent reconstruction models. Across standard benchmarks (ScanNet, 7-Scenes, KITTI, etc.), under identical backbones and inference settings, MeMix reduces reconstruction completeness error by 15.3% on average (up to 40.0%) across 300--500 frame streams on 7-Scenes. The code is available at https://dongjiacheng06.github.io/MeMix/
Abstract（参考訳）: レコンストラクションは3次元視覚の基本課題であり、空間知能の基本的な能力である。特に、ストリーミング3D再構成は、リアルタイムな空間認識の中心であるが、既存のオンラインモデルは、状態の漂流と忘れ、推論時間の改善を動機とする長いシーケンスの進行劣化に悩まされることが多い。トレーニング不要のプラグイン・アンド・プレイモジュールであるMeMixを,リカレント状態をメモリミックスに再キャストすることで,ストリーミング再生を改善する。 MeMixは状態を複数の独立したメモリパッチに分割し、最小整列のメモリパッチのみを更新すると同時に、他を正確に保存する。この選択的な更新は、$O(1)$の推論メモリを保持しながら破滅的な忘れを軽減し、微調整や追加の学習可能なパラメータを必要としないため、既存のリカレントリコンストラクションモデルに直接適用することができる。標準ベンチマーク(ScanNet, 7-Scenes, KITTIなど)を通じて、同じバックボーンと推論設定の下で、MeMixは7-Scenes上の300-500フレームストリームの平均(最大40.0%)で復元完全性エラーを15.3%削減する。コードはhttps://dongjiacheng06.github.io/MeMix/で公開されている。

論文の概要: MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction

関連論文リスト