Fugu-MT 論文翻訳(概要): EVDI++: Event-based Video Deblurring and Interpolation via Self-Supervised Learning

論文の概要: EVDI++: Event-based Video Deblurring and Interpolation via Self-Supervised Learning

arxiv url: http://arxiv.org/abs/2509.08260v1
Date: Wed, 10 Sep 2025 03:36:24 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-11 15:16:52.299636
Title: EVDI++: Event-based Video Deblurring and Interpolation via Self-Supervised Learning
Title（参考訳）: EVDI++: 自己監督型学習によるイベントベースのビデオ分解と補間
Authors: Chi Zhang, Xiang Zhang, Chenxu Jiang, Gui-Song Xia, Lei Yu,
Abstract要約: イベントベースのビデオデブロワーリングと補間のための自己教師型フレームワークEVDI++を紹介する。イベントカメラの高時間分解能を利用して、動きのぼやけを軽減し、中間フレーム予測を可能にする。実世界のぼやけたビデオやイベントによるネットワークトレーニングを実現するために,自己教師付き学習フレームワークを提案する。
参考スコア（独自算出の注目度）: 36.86635176661841
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Frame-based cameras with extended exposure times often produce perceptible visual blurring and information loss between frames, significantly degrading video quality. To address this challenge, we introduce EVDI++, a unified self-supervised framework for Event-based Video Deblurring and Interpolation that leverages the high temporal resolution of event cameras to mitigate motion blur and enable intermediate frame prediction. Specifically, the Learnable Double Integral (LDI) network is designed to estimate the mapping relation between reference frames and sharp latent images. Then, we refine the coarse results and optimize overall training efficiency by introducing a learning-based division reconstruction module, enabling images to be converted with varying exposure intervals. We devise an adaptive parameter-free fusion strategy to obtain the final results, utilizing the confidence embedded in the LDI outputs of concurrent events. A self-supervised learning framework is proposed to enable network training with real-world blurry videos and events by exploring the mutual constraints among blurry frames, latent images, and event streams. We further construct a dataset with real-world blurry images and events using a DAVIS346c camera, demonstrating the generalizability of the proposed EVDI++ in real-world scenarios. Extensive experiments on both synthetic and real-world datasets show that our method achieves state-of-the-art performance in video deblurring and interpolation tasks.
Abstract（参考訳）: 露光時間が延長されたフレームベースのカメラは、視界のぼやけやフレーム間の情報損失を生じさせ、映像品質を著しく低下させる。この課題に対処するために、イベントベースのビデオデブリと補間のための統合されたセルフ教師付きフレームワークEVDI++を導入し、イベントカメラの高時間分解能を活用して、動きのぼやけを軽減し、中間フレームの予測を可能にする。具体的には、Learable Double Integral (LDI) ネットワークは、参照フレームとシャープ潜在画像のマッピング関係を推定するように設計されている。そして、粗い結果を洗練し、学習に基づく分割再構成モジュールを導入して総合的な訓練効率を最適化し、様々な露光間隔で画像を変換できるようにする。並列イベントのLDI出力に埋め込まれた信頼度を利用して最終結果を得るための適応型パラメータフリー融合戦略を考案する。ぼやけたフレーム,潜画像,イベントストリーム間の相互制約を探索し,実世界のぼやけたビデオやイベントによるネットワークトレーニングを実現するための,自己教師型学習フレームワークを提案する。さらに,DAVIS346cカメラを用いた実世界のぼやけた画像やイベントを用いたデータセットを構築し,実世界のシナリオにおけるEVDI++の一般化可能性を示す。合成と実世界の両方のデータセットに対する大規模な実験により,ビデオのデブロアリングと補間作業における最先端のパフォーマンスが得られた。

論文の概要: EVDI++: Event-based Video Deblurring and Interpolation via Self-Supervised Learning

関連論文リスト