Fugu-MT 論文翻訳(概要): Closed-Loop Transfer for Weakly-supervised Affordance Grounding

論文の概要: Closed-Loop Transfer for Weakly-supervised Affordance Grounding

arxiv url: http://arxiv.org/abs/2510.17384v1
Date: Mon, 20 Oct 2025 10:21:35 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 00:56:39.405873
Title: Closed-Loop Transfer for Weakly-supervised Affordance Grounding
Title（参考訳）: 弱教師付き集合住宅における閉ループ移動
Authors: Jiajin Tang, Zhengxuan Wei, Ge Zheng, Sibei Yang,
Abstract要約: LoopTransは、知識をエキソセントリックからエゴセントリックに移行する、新しいクローズドループフレームワークである。 LoopTrans内では、クロスモーダルな局所化や知識蒸留の認知など、いくつかの革新的なメカニズムが導入されている。実験によると、LoopTransは画像とビデオのベンチマークですべてのメトリクスで一貫した改善を実現している。
参考スコア（独自算出の注目度）: 35.34120640245943
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Humans can perform previously unexperienced interactions with novel objects simply by observing others engage with them. Weakly-supervised affordance grounding mimics this process by learning to locate object regions that enable actions on egocentric images, using exocentric interaction images with image-level annotations. However, extracting affordance knowledge solely from exocentric images and transferring it one-way to egocentric images limits the applicability of previous works in complex interaction scenarios. Instead, this study introduces LoopTrans, a novel closed-loop framework that not only transfers knowledge from exocentric to egocentric but also transfers back to enhance exocentric knowledge extraction. Within LoopTrans, several innovative mechanisms are introduced, including unified cross-modal localization and denoising knowledge distillation, to bridge domain gaps between object-centered egocentric and interaction-centered exocentric images while enhancing knowledge transfer. Experiments show that LoopTrans achieves consistent improvements across all metrics on image and video benchmarks, even handling challenging scenarios where object interaction regions are fully occluded by the human body.
Abstract（参考訳）: 人間は、他者との関わりを観察するだけで、新しい物体との未経験の相互作用を行うことができる。弱教師付きアベイランスグラウンドディングはこのプロセスを模倣し、画像レベルのアノテーションとエゴセントリックな相互作用画像を用いて、エゴセントリックなイメージ上のアクションを可能にするオブジェクト領域を見つけることを学習する。しかしながら、エキソセントリックな画像のみから手頃な知識を抽出し、一方的にエゴセントリックな画像に移すことは、複雑な相互作用シナリオにおける過去の研究の適用性を制限している。この研究は、新しいクローズドループフレームワークであるLoopTransを紹介した。このフレームワークは、知識をエゴセントリックからエゴセントリックに転送するだけでなく、エゴセントリックな知識抽出を強化するために再転送する。 LoopTrans内では、クロスモーダルなローカライゼーションとデノナイズドナレッジ蒸留を含むいくつかの革新的なメカニズムを導入し、知識伝達を高めながら、オブジェクト中心のエゴセントリック画像と相互作用中心のエゴセントリック画像の間の領域ギャップを橋渡しする。実験によると、LoopTransは画像とビデオのベンチマークのすべての指標で一貫した改善を実現している。

論文の概要: Closed-Loop Transfer for Weakly-supervised Affordance Grounding

関連論文リスト