Fugu-MT 論文翻訳(概要): Sci-Fi: Symmetric Constraint for Frame Inbetweening

論文の概要: Sci-Fi: Symmetric Constraint for Frame Inbetweening

arxiv url: http://arxiv.org/abs/2505.21205v1
Date: Tue, 27 May 2025 13:53:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-28 17:05:58.689765
Title: Sci-Fi: Symmetric Constraint for Frame Inbetweening
Title（参考訳）: Sci-Fi:フレーム間通信における対称性制約
Authors: Liuhan Chen, Xiaodong Cun, Xiaoyu Li, Xianyi He, Shenghai Yuan, Jie Chen, Ying Shan, Li Yuan,
Abstract要約: フレーム間インベントワイニングは、与えられた開始フレームと終了フレームに条件付き中間映像シーケンスを合成することを目的としている。現在の最先端手法は、主に大規模な事前訓練された画像-映像拡散モデルを拡張している。 Sci-Fiと呼ばれる新しいフレームワークを提案し、より小さなトレーニングスケールの制約に対してより強力なインジェクションを適用する。
参考スコア（独自算出の注目度）: 52.6883373124261
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Frame inbetweening aims to synthesize intermediate video sequences conditioned on the given start and end frames. Current state-of-the-art methods mainly extend large-scale pre-trained Image-to-Video Diffusion models (I2V-DMs) by incorporating end-frame constraints via directly fine-tuning or omitting training. We identify a critical limitation in their design: Their injections of the end-frame constraint usually utilize the same mechanism that originally imposed the start-frame (single image) constraint. However, since the original I2V-DMs are adequately trained for the start-frame condition in advance, naively introducing the end-frame constraint by the same mechanism with much less (even zero) specialized training probably can't make the end frame have a strong enough impact on the intermediate content like the start frame. This asymmetric control strength of the two frames over the intermediate content likely leads to inconsistent motion or appearance collapse in generated frames. To efficiently achieve symmetric constraints of start and end frames, we propose a novel framework, termed Sci-Fi, which applies a stronger injection for the constraint of a smaller training scale. Specifically, it deals with the start-frame constraint as before, while introducing the end-frame constraint by an improved mechanism. The new mechanism is based on a well-designed lightweight module, named EF-Net, which encodes only the end frame and expands it into temporally adaptive frame-wise features injected into the I2V-DM. This makes the end-frame constraint as strong as the start-frame constraint, enabling our Sci-Fi to produce more harmonious transitions in various scenarios. Extensive experiments prove the superiority of our Sci-Fi compared with other baselines.
Abstract（参考訳）: フレーム間インベントワイニングは、与えられた開始フレームと終了フレームに条件付き中間映像シーケンスを合成することを目的としている。現在の最先端手法は、直接微調整やオミッティングのトレーニングを通じて、エンドフレーム制約を組み込むことで、大規模な事前訓練画像拡散モデル(I2V-DM)を主に拡張している。エンドフレーム制約の注入は通常、最初に開始フレーム(単一イメージ)制約を課したのと同じメカニズムを使用します。しかし、元々のI2V-DMは事前に開始フレーム条件に適切に訓練されているため、初期フレームのような中間コンテンツに十分な影響を及ぼさないよう、より少ない(ゼロであっても)特別な訓練で同じ機構でエンドフレーム制約を鼻で導入することはおそらく不可能である。この中間量に対する2つのフレームの非対称的な制御強度は、生成されたフレーム内での不整合運動や出現崩壊を引き起こす可能性がある。開始フレームと終了フレームの対称的制約を効果的に達成するために,より小さなトレーニングスケールの制約に対してより強力なインジェクションを適用した,Sci-Fiと呼ばれる新しいフレームワークを提案する。具体的には、以前のようにスタートフレームの制約を扱うと同時に、改善されたメカニズムによってエンドフレームの制約を導入する。 EF-Netはエンドフレームのみをエンコードし、I2V-DMに注入された時間適応的なフレームワイド機能に拡張する。これにより、エンドフレームの制約はスタートフレームの制約と同じくらい強くなり、Sci-Fiはさまざまなシナリオでより調和したトランジションを生成することができます。大規模な実験は、Sci-Fiが他のベースラインよりも優れていることを証明している。

論文の概要: Sci-Fi: Symmetric Constraint for Frame Inbetweening

関連論文リスト