Fugu-MT 論文翻訳(概要): FSMC-Pose: Frequency and Spatial Fusion with Multiscale Self-calibration for Cattle Mounting Pose Estimation

論文の概要: FSMC-Pose: Frequency and Spatial Fusion with Multiscale Self-calibration for Cattle Mounting Pose Estimation

arxiv url: http://arxiv.org/abs/2603.16596v1
Date: Tue, 17 Mar 2026 14:42:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:07.347355
Title: FSMC-Pose: Frequency and Spatial Fusion with Multiscale Self-calibration for Cattle Mounting Pose Estimation
Title（参考訳）: FSMC-Pose:マルチスケール自己校正による牛馬場推定のための周波数・空間融合
Authors: Fangjing Li, Zhihai Wang, Xinxin Ding, Haiyang Liu, Ronghua Gao, Rong Wang, Yao Zhu, Ming Jin,
Abstract要約: 乗馬姿勢は乳牛のエストロスを視覚的に表す重要な指標である。本稿では,軽量な周波数空間バックボーンであるCattleMountNetと,大規模自己校正ヘッドであるSC2Headを統合したFSMC-Poseを提案する。 FSMC-Poseは複雑な環境下での牛の姿勢を効果的に把握し,推定する。
参考スコア（独自算出の注目度）: 27.324966368385773
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Mounting posture is an important visual indicator of estrus in dairy cattle. However, achieving reliable mounting pose estimation in real-world environments remains challenging due to cluttered backgrounds and frequent inter-animal occlusion. We present FSMC-Pose, a top-down framework that integrates a lightweight frequency-spatial fusion backbone, CattleMountNet, and a multiscale self-calibration head, SC2Head. Specifically, we design two algorithmic components for CattleMountNet: the Spatial Frequency Enhancement Block (SFEBlock) and the Receptive Aggregation Block (RABlock). SFEBlock separates cattle from cluttered backgrounds, while RABlock captures multiscale contextual information. The Spatial-Channel Self-Calibration Head (SC2Head) attends to spatial and channel dependencies and introduces a self-calibration branch to mitigate structural misalignment under inter-animal overlap. We construct a mounting dataset, MOUNT-Cattle, covering 1176 mounting instances, which follows the COCO format and supports drop-in training across pose estimation models. Using a comprehensive dataset that combines MOUNT-Cattle with the public NWAFU-Cattle dataset, FSMC-Pose achieves higher accuracy than strong baselines, with markedly lower computational and parameter costs, while maintaining real-time inference on commodity GPUs. Extensive experiments and qualitative analyses show that FSMC-Pose effectively captures and estimates cattle mounting pose in complex and cluttered environments. Dataset and code are available at https://github.com/elianafang/FSMC-Pose.
Abstract（参考訳）: 乗馬姿勢は乳牛のエストロスを視覚的に表す重要な指標である。しかし, 背景が散らばり, 動物間閉塞が頻発しているため, 実世界の環境において, 信頼性の高いポーズ推定を実現することは依然として困難である。本稿では,軽量な周波数空間融合バックボーンであるCattleMountNetと,大規模自己校正ヘッドであるSC2Headを統合したトップダウンフレームワークFSMC-Poseを紹介する。具体的には、CattleMountNetのための2つのアルゴリズムコンポーネント、空間周波数拡張ブロック(SFEBlock)と受容集約ブロック(RABlock)を設計する。 SFEBlockは牛を乱雑な背景から切り離し、RABlockはマルチスケールのコンテキスト情報をキャプチャする。 Space-Channel Self-Calibration Head (SC2Head) は、空間的およびチャネル的依存関係に参画し、アニマル間重なりの下で構造的不整合を軽減する自己校正分岐を導入する。我々は、COCOフォーマットに従い、ポーズ推定モデル全体でドロップイントレーニングをサポートする1176インスタンスをカバーする実装データセットMOUNT-Cattleを構築した。 MOUNT-CattleとパブリックなNWAFU-Cattleデータセットを組み合わせた包括的なデータセットを使用することで、FSMC-Poseは強力なベースラインよりも高い精度を実現し、計算コストとパラメータコストを著しく低減し、コモディティGPUのリアルタイム推論を維持している。大規模な実験と定性的分析により、FSMC-Poseは複雑で散在した環境下での牛の装着効果を効果的に捉え、推定することを示した。データセットとコードはhttps://github.com/elianafang/FSMC-Pose.comで入手できる。

論文の概要: FSMC-Pose: Frequency and Spatial Fusion with Multiscale Self-calibration for Cattle Mounting Pose Estimation

関連論文リスト