Fugu-MT 論文翻訳(概要): SAMOFT: Robust Multi-Object Tracking via Region and Flow

論文の概要: SAMOFT: Robust Multi-Object Tracking via Region and Flow

arxiv url: http://arxiv.org/abs/2605.09417v1
Date: Sun, 10 May 2026 08:39:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.23768
Title: SAMOFT: Robust Multi-Object Tracking via Region and Flow
Title（参考訳）: SAMOFT: 領域とフローによるロバストなマルチオブジェクト追跡
Authors: Yanchao Wang, Dawei Zhang, Chengzhuan Yang, Wei Liu, Minglu Li, Hua Wang, Zhonglong Zheng, Ming-Hsuan Yang,
Abstract要約: マルチオブジェクトトラッキング(MOT)は、コンピュータビジョンにおける基本的なタスクであり、複数のターゲットを継続的に追跡する必要がある。既存のアプローチのほとんどは、軌道関連のためのインスタンスレベルのオブジェクト機能に依存しています。我々は,複雑な動作シナリオ下でのロバスト性向上のために,画素レベルのキューを活用するロバストトラッカーSAMOFTを提案する。
参考スコア（独自算出の注目度）: 42.6206516982406
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multi-object tracking (MOT) is a fundamental task in computer vision that requires continuously tracking multiple targets while maintaining consistent identities across frames. However, most existing approaches primarily rely on instance-level object features for trajectory association, which often leads to degraded performance under challenging conditions such as object deformation, nonlinear motion, and occlusion. In this work, we propose SAMOFT, a robust tracker that leverages pixel-level cues to improve robustness under complex motion scenarios. Specifically, we introduce a Pixel Motion Matching (PMM) module that integrates the Segment Anything Model (SAM) with dense optical flow to refine Kalman filter-based motion prediction using instantaneous foreground pixel motion. To further enhance robustness under unreliable detections, we design a Centroid Distance Matching (CDM) module that performs flexible mask-based centroid matching for low-confidence or partially occluded observations. Moreover, a Distribution-Based Correction (DBC) module models long-tailed motion patterns in a training-free manner using historical optical flow statistics and dynamically corrects trajectory states online. We also incorporate a Cluster-Aware ReID (CA-ReID) strategy to improve the stability and discriminative power of trajectory appearance features. Extensive experiments on the DanceTrack and MOTChallenge benchmarks demonstrate that SAMOFT consistently improves baseline trackers and achieves competitive performance compared with recent state-of-the-art methods, validating the effectiveness of leveraging pixel-level cues for robust multi-object tracking.
Abstract（参考訳）: マルチオブジェクト追跡(MOT)はコンピュータビジョンにおける基本的なタスクであり、フレーム間の一貫したアイデンティティを維持しながら、複数のターゲットを継続的に追跡する必要がある。しかし、既存のほとんどのアプローチは、主に軌道アソシエーションのインスタンスレベルのオブジェクト特徴に依存しており、しばしばオブジェクトの変形、非線形運動、閉塞といった困難な条件下での劣化性能をもたらす。本研究では,複雑な動作シナリオ下でのロバスト性向上のために画素レベルのキューを活用するロバストトラッカーSAMOFTを提案する。具体的には,Segment Anything Model (SAM) と高密度光流を統合したPixel Motion Matching (PMM) モジュールを導入する。信頼性の低い検出において、より堅牢性を高めるために、低信頼または部分的に排除された観察のためにフレキシブルマスクベースのセントロイドマッチングを実行するCentroid Distance Matching (CDM) モジュールを設計する。さらに、分散ベース補正(DBC)モジュールは、歴史的光学的フロー統計を用いて、トレーニング不要な長い尾運動パターンをモデル化し、オンラインで軌道状態の動的修正を行う。また,CA-ReID(Cluster-Aware ReID)戦略を取り入れ,軌道外観特徴の安定性と識別力を向上させる。 DanceTrackとMOTChallengeベンチマークの大規模な実験により、SAMOFTはベースライントラッカーを一貫して改善し、最近の最先端手法と比較して競争性能を向上し、堅牢なマルチオブジェクトトラッキングにピクセルレベルのキューを活用することの有効性を検証した。

論文の概要: SAMOFT: Robust Multi-Object Tracking via Region and Flow

関連論文リスト