Fugu-MT 論文翻訳(概要): LoCAtion: Long-time Collaborative Attention Framework for High Dynamic Range Video Reconstruction

論文の概要: LoCAtion: Long-time Collaborative Attention Framework for High Dynamic Range Video Reconstruction

arxiv url: http://arxiv.org/abs/2603.14377v1
Date: Sun, 15 Mar 2026 13:34:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.782231
Title: LoCAtion: Long-time Collaborative Attention Framework for High Dynamic Range Video Reconstruction
Title（参考訳）: LoCAtion:高ダイナミックレンジビデオ再構成のための長時間協調注意フレームワーク
Authors: Qianyu Zhang, Bolun Zheng, Lingyu Zhu, Aiai Huang, Zongpeng Li, Shiqi Wang,
Abstract要約: 本稿では,脆弱な空間ワープタスクからHDR映像を生成するフレームワークであるLoCAtionを,頑健でアライメントのない協調的特徴ルーティング問題に再構成する。 Locationは最先端の視覚的品質と時間的安定性を実現し、精度と計算効率の非常に競争力のあるバランスを提供する。
参考スコア（独自算出の注目度）: 17.88716377235245
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Prevailing High Dynamic Range (HDR) video reconstruction methods are fundamentally trapped in a fragile alignment-and-fusion paradigm. While explicit spatial alignment can successfully recover fine details in controlled environments, it becomes a severe bottleneck in unconstrained dynamic scenes. By forcing rigid alignment across unpredictable motions and varying exposures, these methods inevitably translate registration errors into severe ghosting artifacts and temporal flickering. In this paper, we rethink this conventional prerequisite. Recognizing that explicit alignment is inherently vulnerable to real-world complexities, we propose LoCAtion, a Long-time Collaborative Attention framework that reformulates HDR video generation from a fragile spatial warping task into a robust, alignment-free collaborative feature routing problem. Guided by this new formulation, our architecture explicitly decouples the highly entangled reconstruction task. Rather than struggling to rigidly warp neighboring frames, we anchor the scene on a continuous medium-exposure backbone and utilize collaborative attention to dynamically harvest and inject reliable irradiance cues from unaligned exposures. Furthermore, we introduce a learned global sequence solver. By leveraging bidirectional context and long-range temporal modeling, it propagates corrective signals and structural features across the entire sequence, inherently enforcing whole-video coherence and eliminating jitter. Extensive experiments demonstrate that LoCAtion achieves state-of-the-art visual quality and temporal stability, offering a highly competitive balance between accuracy and computational efficiency.
Abstract（参考訳）: 高ダイナミックレンジ (HDR) ビデオ再構成法は, 脆弱なアライメント・アンド・フュージョンのパラダイムに根本的に閉じ込められている。空間的アライメントは制御された環境の細部を再現できるが、制約のない動的シーンでは深刻なボトルネックとなる。予測不可能な動きと様々な露出に厳密なアライメントを強制することにより、登録ミスを必然的に深刻なゴーストや時間的ひねりに翻訳する。本稿では,従来の前提条件を再考する。実世界の複雑度に対して,明示的なアライメントが本質的に脆弱であることを認識し,脆弱な空間整合タスクからHDRビデオ生成を頑健でアライメントのない協調的特徴ルーティング問題に再構成する長期協調型アテンションフレームワークであるLoCAtionを提案する。この新たな定式化によって、アーキテクチャは、高度に絡み合った再構築タスクを明示的に分離する。周囲のフレームを厳格に歪めるのに苦労する代わりに、連続した中露出バックボーンにシーンを固定し、協調的な注意を生かして動的に収穫し、不整合露光から信頼性のある照射キューを注入する。さらに,学習したグローバルシーケンスソルバを導入する。双方向のコンテキストと長距離時間モデリングを活用することで、全シーケンスにわたって補正信号と構造的特徴を伝播し、本質的にビデオ全体のコヒーレンスを強制しジッタを除去する。大規模な実験により、LoCationは最先端の視覚的品質と時間的安定性を達成し、精度と計算効率の高度に競争力のあるバランスを提供することを示した。

論文の概要: LoCAtion: Long-time Collaborative Attention Framework for High Dynamic Range Video Reconstruction

関連論文リスト