Fugu-MT 論文翻訳(概要): Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding

論文の概要: Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding

arxiv url: http://arxiv.org/abs/2606.01701v1
Date: Mon, 01 Jun 2026 05:11:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:31.383094
Title: Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding
Title（参考訳）: 垂直ビデオ符号化における時空間相関ガイドによる幾何学的分割
Authors: Xuewei Meng, Chuanmin Jia, Xinfeng Zhang, Shanshe Wang, Siwei Ma,
Abstract要約: 映像符号化の運動場における物体情報信号を記述するためのガイド付き幾何分割法(STGEO)を提案する。提案手法は,分割モードや移動情報を含む側情報に対して消費されるビットをエコノマイズすることができる。主なアイデアは、選択可能性の高いSTGEOモードと運動候補を予測することである。
参考スコア（独自算出の注目度）: 55.405068056883614
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Geometric partitioning has attracted increasing attention by its remarkable motion field description capability in the hybrid video coding framework. However, the existing geometric partitioning (GEO) scheme in Versatile Video Coding (VVC) causes a non-negligible burden for signaling the side information. Consequently, the coding efficiency is limited. In view of this, we propose a spatio-temporal correlation guided geometric partitioning (STGEO) scheme to efficiently describe the object information in the motion field of video coding. The proposed method can economize the bits consumed for side information signaling, including the partitioning mode and motion information. We firstly analyze the characteristics of partitioning mode decision and motion vector selection in a statistically-sound way. Based on the observed spatio-temporal correlation, we design a mode prediction and coding method to reduce the overhead for representing the above mentioned side information. The main idea is to predict the STGEO modes and motion candidates that have higher selection possibilities, which can guide the entropy coding, i.e., representing the predicted high-probability modes and motion candidates with fewer bits. In particular, the high-probability STGEO modes are predicted based on the edge information and history modes of adjacent STGEO-coded blocks. The corresponding motion information is represented by the index in a merge candidate list, which is adaptively inferred based on the off-line trained merge candidate selection probability. Simulation results show that the proposed approach achieves 0.95% and 1.98% bit-rate savings on average compared to VTM-8.0 without GEO for Random Access and Low-Delay B configurations, respectively.
Abstract（参考訳）: ハイブリッドビデオ符号化フレームワークにおける運動場記述機能により,幾何学的分割が注目されている。しかし、Versatile Video Coding (VVC) における既存の幾何分割(GEO)方式は、側情報を信号する上で、無視できない負担を生じさせる。これにより、符号化効率が制限される。そこで本研究では,映像符号化の運動場におけるオブジェクト情報を効率的に記述するための時空間相関型幾何分割法(STGEO)を提案する。提案手法は,分割モードや移動情報を含む側情報信号に使用されるビットをエコノマイズすることができる。まず,分割モード決定と移動ベクトル選択の特徴を統計的に解析する。観測された時空間相関に基づいて、上記の側情報を表すオーバーヘッドを低減するモード予測および符号化法を設計する。主な考え方は、高い選択可能性を持つSTGEOモードと運動候補を予測することであり、これはエントロピー符号化、すなわち予測される高確率モードと少ないビットの運動候補を表現することができる。特に、隣接するSTGEO符号化ブロックのエッジ情報と履歴モードに基づいて、高確率STGEOモードを予測する。対応する動作情報は、オフライン訓練されたマージ候補選択確率に基づいて適応的に推測されるマージ候補リストにおいてインデックスで表現される。シミュレーションの結果,提案手法は平均0.95%と1.98%のビットレートを,GEOを使わずにGEOを使わずに,それぞれVTM-8.0と比較した。

論文の概要: Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding

関連論文リスト