Fugu-MT 論文翻訳(概要): Ultra-Fast Neural Video Compression

論文の概要: Ultra-Fast Neural Video Compression

arxiv url: http://arxiv.org/abs/2606.04410v1
Date: Wed, 03 Jun 2026 03:38:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-04 20:44:18.518218
Title: Ultra-Fast Neural Video Compression
Title（参考訳）: 超高速ニューラルビデオ圧縮
Authors: Jiahao Li, Wenxuan Xie, Zhaoyang Jia, Bin Li, Zongyu Guo, Xiaoyi Zhang, Yan Lu,
Abstract要約: 本稿では,レート・歪み・複雑さのトレードオフを大幅に改善するために,チャンクベースのコーディングフレームワークを提案する。フレームを逐次処理する代わりに、複数のフレームの断片を1つのコンパクトな潜在表現に符号化し、同時にデコードする。これらのイノベーションに基づいて、我々は、新しいSOTAのパフォーマンスを規定する新しいNVCであるDCVC-UFを紹介します。
参考スコア（独自算出の注目度）: 29.065189275652944
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While neural video codecs (NVCs) have demonstrated superior compression ratio, their prohibitive computational complexity remains a critical barrier to real-world deployment. This paper introduces a chunk-based coding framework designed to significantly improve the rate-distortion-complexity trade-off. Instead of processing frames sequentially, our approach encodes a chunk of multiple frames into a single compact latent representation and decodes them simultaneously. This is enabled by cross-frame interaction modules for joint spatial-temporal modeling and frame-specific decoders for parallel reconstruction. This paradigm not only dramatically enhances coding throughput but also facilitates more effective modeling of long-term temporal correlations. To further boost speed, we propose a streamlined entropy coding mechanism that consolidates bit-stream interactions into a single step, substantially reducing decoding overhead. Building on these innovations, we present DCVC-UF (Ultra-Fast), a new NVC that sets a new SOTA in performance. Our experiments show that DCVC-UF can achieve ultra-fast encoding and decoding speeds, significantly outperforming previous leading codecs. DCVC-UF serves as a notable landmark in the journey of NVC evolution. The code is at https://github.com/microsoft/DCVC.
Abstract（参考訳）: ニューラルビデオコーデック(NVC)はより優れた圧縮比を示してきたが、その禁止された計算複雑性は、現実世界のデプロイメントにとって重要な障壁である。本稿では,レート・歪み・複雑さのトレードオフを大幅に改善するために,チャンクベースのコーディングフレームワークを提案する。フレームを逐次処理する代わりに、複数のフレームの断片を1つのコンパクトな潜在表現に符号化し、同時にデコードする。これは、共同空間時間モデリングのためのクロスフレーム相互作用モジュールと、並列再構築のためのフレーム固有デコーダによって実現されている。このパラダイムは、コーディングスループットを劇的に向上するだけでなく、長期的な時間的相関のより効率的なモデリングを促進する。さらに高速化するために,ビットストリームインタラクションを単一ステップに統合し,デコードオーバーヘッドを大幅に低減するエントロピー符号化機構を提案する。これらのイノベーションに基づいて、我々は、新しいSOTAのパフォーマンスを設定できる新しいNVCであるDCVC-UF(Ultra-Fast)を紹介します。実験の結果,DCVC-UFは超高速な符号化および復号化が可能であり,従来のコーデックよりも優れていた。 DCVC-UFは、NVC進化の旅で注目すべきランドマークとなっている。コードはhttps://github.com/microsoft/DCVCにある。

論文の概要: Ultra-Fast Neural Video Compression

関連論文リスト