Fugu-MT 論文翻訳(概要): GS-STVSR: Ultra-Efficient Continuous Spatio-Temporal Video Super-Resolution via 2D Gaussian Splatting

論文の概要: GS-STVSR: Ultra-Efficient Continuous Spatio-Temporal Video Super-Resolution via 2D Gaussian Splatting

arxiv url: http://arxiv.org/abs/2604.18047v1
Date: Mon, 20 Apr 2026 10:11:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.804059
Title: GS-STVSR: Ultra-Efficient Continuous Spatio-Temporal Video Super-Resolution via 2D Gaussian Splatting
Title（参考訳）: GS-STVSR:2Dガウス法による超高効率連続時空間ビデオ超解像
Authors: Mingyu Shi, Xin Di, Long Peng, Boxiang Cao, Anran Wu, Zhanfeng Feng, Jiaming Guo, Renjing Pei, Xueyang Fu, Yang Cao, Zhengjun Zha,
Abstract要約: Continuous Video Super-Resolution (C-STVSR) は、任意のスケールファクタによる動画の空間的・フレーム的レートの同時向上を目的としている。 2次元ガウシアンティング(2D-GS)に基づく超効率的なC-STVSRフレームワークであるGS-STVSRを提案する。 GS-STVSRは,すべてのベンチマークにおいて最先端の品質を実現していることを示す。
参考スコア（独自算出の注目度）: 71.12303582606019
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Continuous Spatio-Temporal Video Super-Resolution (C-STVSR) aims to simultaneously enhance the spatial resolution and frame rate of videos by arbitrary scale factors, offering greater flexibility than fixed-scale methods that are constrained by predefined upsampling ratios. In recent years, methods based on Implicit Neural Representations (INR) have made significant progress in C-STVSR by learning continuous mappings from spatio-temporal coordinates to pixel values. However, these methods fundamentally rely on dense pixel-wise grid queries, causing computational cost to scale linearly with the number of interpolated frames and severely limiting inference efficiency. We propose GS-STVSR, an ultra-efficient C-STVSR framework based on 2D Gaussian Splatting (2D-GS) that drives the spatiotemporal evolution of Gaussian kernels through continuous motion modeling, bypassing dense grid queries entirely. We exploit the strong temporal stability of covariance parameters for lightweight intermediate fitting, design an optical flow-guided motion module to derive Gaussian position and color at arbitrary time steps, introduce a Covariance resampling alignment module to prevent covariance drift, and propose an adaptive offset window for large-scale motion. Extensive experiments on Vid4, GoPro, and Adobe240 show that GS-STVSR achieves state-of-the-art quality across all benchmarks. Moreover, its inference time remains nearly constant at conventional temporal scales (X2--X8) and delivers over X3 speedup at extreme scales X32, demonstrating strong practical applicability.
Abstract（参考訳）: 連続時空間ビデオ超解法(Continuous Spatio-Temporal Video Super-Resolution, C-STVSR)は、ビデオの空間分解能とフレームレートを任意のスケールファクターで同時に向上することを目的としており、事前定義されたアップサンプリング比によって制約される固定スケール法よりも高い柔軟性を提供する。近年, Inlicit Neural Representations (INR) に基づく手法は, 時空間座標から画素値への連続写像を学習することによってC-STVSRにおいて大きな進歩を遂げている。しかし、これらの手法は基本的に高密度のピクセルワイドグリッドクエリに依存しており、計算コストは補間フレームの数と線形にスケールし、推論効率を著しく制限する。 2D-GSをベースとした超効率的なC-STVSRフレームワークであるGS-STVSRを提案する。我々は、軽量中間嵌合における共分散パラメータの強い時間的安定性を活用し、ガウスの位置と色を任意の時間ステップで導くための光フロー誘導運動モジュールを設計し、共分散ドリフトを防止するための共分散再サンプリングアライメントモジュールを導入し、大規模運動のための適応オフセットウィンドウを提案する。 Vid4、GoPro、Adobe240の大規模な実験によると、GS-STVSRはすべてのベンチマークで最先端の品質を実現している。さらに、その推定時間は従来の時間スケール(X2--X8)でほぼ一定であり、極端スケールのX32でX3スピードアップを達成し、強力な実用性を示す。

論文の概要: GS-STVSR: Ultra-Efficient Continuous Spatio-Temporal Video Super-Resolution via 2D Gaussian Splatting

関連論文リスト