Fugu-MT 論文翻訳(概要): Learning to Track Objects from Unlabeled Videos

論文の概要: Learning to Track Objects from Unlabeled Videos

arxiv url: http://arxiv.org/abs/2108.12711v1
Date: Sat, 28 Aug 2021 22:10:06 GMT
ステータス: 翻訳完了
システム内更新日: 2021-08-31 14:42:18.638494
Title: Learning to Track Objects from Unlabeled Videos
Title（参考訳）: ラベルのないビデオからオブジェクトを追跡する学習
Authors: Jilai Zheng, Chao Ma, Houwen Peng and Xiaokang Yang
Abstract要約: 本稿では,Unsupervised Single Object Tracker (USOT) をスクラッチから学習することを提案する。教師なしトラッカーと教師なしトラッカーのギャップを狭めるために,3段階からなる効果的な教師なし学習手法を提案する。実験の結果、未ラベルのビデオから得られたUSOTは、最先端の教師なしトラッカーよりも大きなマージンでうまく機能していることがわかった。
参考スコア（独自算出の注目度）: 63.149201681380305
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In this paper, we propose to learn an Unsupervised Single Object Tracker (USOT) from scratch. We identify that three major challenges, i.e., moving object discovery, rich temporal variation exploitation, and online update, are the central causes of the performance bottleneck of existing unsupervised trackers. To narrow the gap between unsupervised trackers and supervised counterparts, we propose an effective unsupervised learning approach composed of three stages. First, we sample sequentially moving objects with unsupervised optical flow and dynamic programming, instead of random cropping. Second, we train a naive Siamese tracker from scratch using single-frame pairs. Third, we continue training the tracker with a novel cycle memory learning scheme, which is conducted in longer temporal spans and also enables our tracker to update online. Extensive experiments show that the proposed USOT learned from unlabeled videos performs well over the state-of-the-art unsupervised trackers by large margins, and on par with recent supervised deep trackers. Code is available at https://github.com/VISION-SJTU/USOT.
Abstract（参考訳）: 本稿では,Unsupervised Single Object Tracker (USOT) をスクラッチから学習することを提案する。我々は、オブジェクト発見、リッチな時間的変動回避、オンライン更新という3つの大きな課題が、既存の教師なしトラッカーのパフォーマンスボトルネックの中心的な原因であると特定した。教師なしトラッカと教師なしトラッカのギャップを狭めるため,3段階からなる効果的な教師なし学習手法を提案する。まず、ランダムな切り抜きではなく、教師なしの光フローと動的プログラミングで順次動くオブジェクトをサンプリングする。第2に、単一フレームペアを使用して、ネイティブなSiameseトラッカーをスクラッチからトレーニングする。第3に,トラッカを長い時間スパンで実行し,オンライン更新を可能にする,新たなサイクルメモリ学習スキームによるトレーニングを継続する。広範囲にわたる実験の結果、unlabeledビデオから得られたusotは、最先端のunsupervisedトラッカーよりも大きなマージンで、そして最近のsupervised deep trackerと同等の性能を発揮していることがわかった。コードはhttps://github.com/VISION-SJTU/USOTで入手できる。

論文の概要: Learning to Track Objects from Unlabeled Videos

関連論文リスト