Fugu-MT 論文翻訳(概要): FARTrack: Fast Autoregressive Visual Tracking with High Performance

論文の概要: FARTrack: Fast Autoregressive Visual Tracking with High Performance

arxiv url: http://arxiv.org/abs/2602.03214v1
Date: Tue, 03 Feb 2026 07:29:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-04 18:37:15.312066
Title: FARTrack: Fast Autoregressive Visual Tracking with High Performance
Title（参考訳）: FARTrack: 高速な自動回帰ビジュアルトラッキングと高性能
Authors: Guijie Wang, Tong Lin, Yifan Bai, Anjia Cao, Shiyi Liang, Wangbo Zhao, Xing Wei,
Abstract要約: FARTrackは高速自動回帰トラッキングフレームワークである。 GOT-10kで70.6%のAOをリアルタイムで提供する。我々の最速モデルは、GPU上で343FPS、CPU上で121FPSの速度を達成する。
参考スコア（独自算出の注目度）: 17.53171333786429
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Inference speed and tracking performance are two critical evaluation metrics in the field of visual tracking. However, high-performance trackers often suffer from slow processing speeds, making them impractical for deployment on resource-constrained devices. To alleviate this issue, we propose FARTrack, a Fast Auto-Regressive Tracking framework. Since autoregression emphasizes the temporal nature of the trajectory sequence, it can maintain high performance while achieving efficient execution across various devices. FARTrack introduces Task-Specific Self-Distillation and Inter-frame Autoregressive Sparsification, designed from the perspectives of shallow-yet-accurate distillation and redundant-to-essential token optimization, respectively. Task-Specific Self-Distillation achieves model compression by distilling task-specific tokens layer by layer, enhancing the model's inference speed while avoiding suboptimal manual teacher-student layer pairs assignments. Meanwhile, Inter-frame Autoregressive Sparsification sequentially condenses multiple templates, avoiding additional runtime overhead while learning a temporally-global optimal sparsification strategy. FARTrack demonstrates outstanding speed and competitive performance. It delivers an AO of 70.6% on GOT-10k in real-time. Beyond, our fastest model achieves a speed of 343 FPS on the GPU and 121 FPS on the CPU.
Abstract（参考訳）: 推論速度とトラッキング性能は、視覚的トラッキングの分野における2つの重要な評価指標である。しかし、高性能トラッカーは処理速度の遅さに悩まされることが多く、リソース制限されたデバイスへの展開には実用的ではない。この問題を軽減するために、我々は高速自動回帰トラッキングフレームワークであるFARTrackを提案する。自己回帰は、軌道列の時間的性質を強調するため、様々なデバイス間で効率的な実行を実現しながら高い性能を維持することができる。 FARTrackは、浅口蒸留と冗長なトークン最適化の観点から設計されたタスク特化自己蒸留とフレーム間自己回帰スパシフィケーションを導入している。 Task-Specific Self-Distillation は,タスク固有のトークン層を層単位で蒸留することでモデル圧縮を実現する。一方、フレーム間のオートレグレッシブ・スパシフィケーションは、時間的にグローバルな最適なスパシフィケーション戦略を学習しながら、複数のテンプレートをシーケンシャルに凝縮し、追加のランタイムオーバーヘッドを回避する。 FARTrackは、優れたスピードと競争性能を示す。 GOT-10kで70.6%のAOをリアルタイムで提供する。さらに、私たちの最速モデルは、GPU上で343FPS、CPU上で121FPSの速度を実現しています。

論文の概要: FARTrack: Fast Autoregressive Visual Tracking with High Performance

関連論文リスト