Fugu-MT 論文翻訳(概要): Robust Promptable Video Object Segmentation

論文の概要: Robust Promptable Video Object Segmentation

arxiv url: http://arxiv.org/abs/2605.12006v1
Date: Tue, 12 May 2026 11:55:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-13 21:48:56.834996
Title: Robust Promptable Video Object Segmentation
Title（参考訳）: 頑丈なプロンプトブルビデオオブジェクトセグメンテーション
Authors: Sohyun Lee, Yeho Gwon, Lukas Hoyer, Konrad Schindler, Christos Sakaridis, Suha Kwak,
Abstract要約: 本稿では,ロバストPVOS(RobustPVOS)の総合的研究について述べる。まず,351本のビデオクリップと2500枚以上のオブジェクトマスクの2つの実世界評価データセットを用いて,新しい総合的ベンチマークを構築した。メモリオブジェクト条件付きGated-rank Adaptation (MoGA) と呼ばれる新しいロバストPVOS法を提案する。
参考スコア（独自算出の注目度）: 67.1533741758339
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The performance of promptable video object segmentation (PVOS) models substantially degrades under input corruptions, which prevents PVOS deployment in safety-critical domains. This paper offers the first comprehensive study on robust PVOS (RobustPVOS). We first construct a new, comprehensive benchmark with two real-world evaluation datasets of 351 video clips and more than 2,500 object masks under real-world adverse conditions. At the same time, we generate synthetic training data by applying diverse and temporally varying corruptions to existing VOS datasets. Moreover, we present a new RobustPVOS method, dubbed Memory-object-conditioned Gated-rank Adaptation (MoGA). The key to successfully performing RobustPVOS is two-fold: effectively handling object-specific degradation and ensuring temporal consistency in predictions. MoGA leverages object-specific representations maintained in memory across frames to condition the robustification process, which allows the model to handle each tracked object differently in a temporally consistent way. Extensive experiments on our benchmark validate MoGA's efficacy, showing consistent and significant improvements across diverse corruption types on both synthetic and real-world datasets, establishing a strong baseline for future RobustPVOS research. Our benchmark is publicly available at https://sohyun-l.github.io/RobustPVOS_project_page/.
Abstract（参考訳）: 迅速なビデオオブジェクト分割(PVOS)モデルの性能は、入力の破損により大幅に低下し、PVOSの安全クリティカルドメインへの展開が妨げられる。本稿では,ロバストPVOS (RobustPVOS) に関する総合的研究を行う。まず、実世界の悪条件下で351本のビデオクリップと2500枚以上のオブジェクトマスクの2つの実世界の評価データセットを用いて、新しい総合的なベンチマークを構築した。同時に、既存のVOSデータセットに多様かつ時間的に異なる汚職を適用して、合成トレーニングデータを生成する。さらに,メモリオブジェクト条件付きGated-rank Adaptation (MoGA) と呼ばれる新しいロバストPVOS法を提案する。 RobustPVOSを成功させる鍵は2つある。オブジェクト固有の劣化を効果的に処理し、予測における時間的一貫性を確保することだ。 MoGAはメモリに保持されるオブジェクト固有の表現を活用して、ロバスト化プロセスを条件にすることで、各追跡対象を時間的に一貫した方法で別々に扱うことができる。私たちのベンチマークにおける大規模な実験は、MoGAの有効性を検証し、合成データセットと実世界のデータセットの両方において、さまざまな汚職タイプ間で一貫性と大幅な改善を示し、将来のRobostPVOS研究の強力なベースラインを確立します。私たちのベンチマークはhttps://sohyun-l.github.io/RobustPVOS_project_page/で公開されています。

論文の概要: Robust Promptable Video Object Segmentation

関連論文リスト