Fugu-MT 論文翻訳(概要): VidSplice: Towards Coherent Video Inpainting via Explicit Spaced Frame Guidance

論文の概要: VidSplice: Towards Coherent Video Inpainting via Explicit Spaced Frame Guidance

arxiv url: http://arxiv.org/abs/2510.21461v1
Date: Fri, 24 Oct 2025 13:44:09 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 09:00:15.491825
Title: VidSplice: Towards Coherent Video Inpainting via Explicit Spaced Frame Guidance
Title（参考訳）: VidSplice:Explicit Spaced Frame Guidanceによるコヒーレントなビデオインペインティングを目指す
Authors: Ming Xie, Junqiu Yu, Qiaole Dong, Xiangyang Xue, Yanwei Fu,
Abstract要約: VidSpliceは、テンポラリな手口でペンキを塗るプロセスをガイドする新しいフレームワークである。 VidSpliceは様々な映像のインパインティングシナリオで競争力を発揮することを示す。
参考スコア（独自算出の注目度）: 57.57195766748601
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent video inpainting methods often employ image-to-video (I2V) priors to model temporal consistency across masked frames. While effective in moderate cases, these methods struggle under severe content degradation and tend to overlook spatiotemporal stability, resulting in insufficient control over the latter parts of the video. To address these limitations, we decouple video inpainting into two sub-tasks: multi-frame consistent image inpainting and masked area motion propagation. We propose VidSplice, a novel framework that introduces spaced-frame priors to guide the inpainting process with spatiotemporal cues. To enhance spatial coherence, we design a CoSpliced Module to perform first-frame propagation strategy that diffuses the initial frame content into subsequent reference frames through a splicing mechanism. Additionally, we introduce a delicate context controller module that encodes coherent priors after frame duplication and injects the spliced video into the I2V generative backbone, effectively constraining content distortion during generation. Extensive evaluations demonstrate that VidSplice achieves competitive performance across diverse video inpainting scenarios. Moreover, its design significantly improves both foreground alignment and motion stability, outperforming existing approaches.
Abstract（参考訳）: 近年の映像塗装法では、マスクフレーム間の時間的一貫性をモデル化するために、I2V(Image-to-Video)プリミティブを用いることが多い。中程度のケースでは有効であるが、これらの手法は厳しい内容劣化に苦慮し、時空間安定性を見落とし、ビデオの後半部分の制御が不十分になる傾向にある。これらの制約に対処するため、ビデオのインペイントを2つのサブタスクに分割する。 VidSpliceは空間フレームの先行処理を導入し,時空間的手法による塗布プロセスのガイドを行う新しいフレームワークである。空間コヒーレンスを高めるために,最初のフレームをスプライシング機構を通じて参照フレームに拡散させる第1フレームの伝搬戦略を行うCoSpliced Moduleを設計する。さらに,フレーム重複後のコヒーレント先行を符号化し,スプリケートされた映像をI2V生成バックボーンに注入し,生成時のコンテンツ歪みを効果的に抑制する,繊細なコンテキスト制御モジュールを導入する。 VidSpliceの大規模な評価は、多様なビデオインパインティングシナリオ間での競争性能を実現することを示している。さらに、その設計は前景アライメントと運動安定性の両方を著しく改善し、既存のアプローチよりも優れている。

論文の概要: VidSplice: Towards Coherent Video Inpainting via Explicit Spaced Frame Guidance

関連論文リスト