Fugu-MT 論文翻訳(概要): HODN: Disentangling Human-Object Feature for HOI Detection

論文の概要: HODN: Disentangling Human-Object Feature for HOI Detection

arxiv url: http://arxiv.org/abs/2308.10158v2
Date: Thu, 7 Dec 2023 08:04:48 GMT
ステータス: 翻訳完了
システム内更新日: 2023-12-08 18:38:46.271931
Title: HODN: Disentangling Human-Object Feature for HOI Detection
Title（参考訳）: HODN:HOI検出のためのヒューマンオブジェクト機能
Authors: Shuman Fang, Zhiwen Lin, Ke Yan, Jie Li, Xianming Lin, Rongrong Ji
Abstract要約: 本稿では,Human and Object Disentangling Network (HODN) を提案し,Human-Object Interaction (HOI) の関係を明示的にモデル化する。インタラクションに人間的特徴がより寄与していることを考慮し,インタラクションデコーダが人間中心の領域に焦点を当てていることを確認するためのヒューマンガイドリンク手法を提案する。提案手法は,V-COCOとHICO-Det Linkingデータセットの競合性能を実現する。
参考スコア（独自算出の注目度）: 51.48164941412871
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The task of Human-Object Interaction (HOI) detection is to detect humans and their interactions with surrounding objects, where transformer-based methods show dominant advances currently. However, these methods ignore the relationship among humans, objects, and interactions: 1) human features are more contributive than object ones to interaction prediction; 2) interactive information disturbs the detection of objects but helps human detection. In this paper, we propose a Human and Object Disentangling Network (HODN) to model the HOI relationships explicitly, where humans and objects are first detected by two disentangling decoders independently and then processed by an interaction decoder. Considering that human features are more contributive to interaction, we propose a Human-Guide Linking method to make sure the interaction decoder focuses on the human-centric regions with human features as the positional embeddings. To handle the opposite influences of interactions on humans and objects, we propose a Stop-Gradient Mechanism to stop interaction gradients from optimizing the object detection but to allow them to optimize the human detection. Our proposed method achieves competitive performance on both the V-COCO and the HICO-Det datasets. It can be combined with existing methods easily for state-of-the-art results.
Abstract（参考訳）: 人間と物体の相互作用(hoi:human-object interaction)検出のタスクは、人間とその周囲の物体との相互作用を検出することである。しかし、これらの方法は人間、物体、相互作用の関係を無視する。 1) 人的特徴は,対話予測に対する対象的特徴よりも帰属的である。 2)対話的情報は物体の検出を妨害するが,人間の検出を助ける。本稿では,Human and Object Disentangling Network (HODN) を提案する。Human and Object Disentangling Network (HODN) は,Human and Object Disentangling Network (HOI) の関係を明示的にモデル化する。人間の特徴がよりインタラクションに寄与することを考えると,人間の特徴を組み込んだ人間中心領域に対話デコーダを集中させるヒューマンガイドリンク手法を提案する。人間と物体との相互作用の反対の影響に対処するために、相互作用勾配が物体検出の最適化を妨げ、人間の検出を最適化するストップグレードのメカニズムを提案する。提案手法は,V-COCOデータセットとHICO-Detデータセットの競合性能を実現する。最新の結果を得るために、既存のメソッドと簡単に組み合わせることができる。

関連論文リスト

Prototype Embedding Optimization for Human-Object Interaction Detection in Livestreaming [14.838579323779914]
人-物体相互作用検出(PeO-HOI)のためのプロトタイプ組込み最適化を提案する。プロトタイプ埋め込み最適化は、オブジェクトバイアスがHOIに与える影響を軽減するために採用されている。その結果,提案手法の精度は37.19%@full, 51.42%@non-rare, 26.20%@rareと推定された。
論文参考訳（メタデータ） (2025-05-28T06:19:37Z)
Visual-Geometric Collaborative Guidance for Affordance Learning [63.038406948791454]
本稿では,視覚的・幾何学的手がかりを取り入れた視覚・幾何学的協調学習ネットワークを提案する。本手法は,客観的指標と視覚的品質の代表的なモデルより優れている。
論文参考訳（メタデータ） (2024-10-15T07:35:51Z)
Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection [70.96299509159981]
ヒューマン・オブジェクト・インタラクション(HOI)検出は、人間中心の画像理解のコアタスクである。最近のワンステージ手法では、対話予測に有用な画像ワイドキューの収集にトランスフォーマーデコーダを採用している。従来の2段階の手法は、非絡み合いで説明可能な方法で相互作用特徴を構成する能力から大きな恩恵を受ける。
論文参考訳（メタデータ） (2023-12-04T08:02:59Z)
Learn to Predict How Humans Manipulate Large-sized Objects from Interactive Motions [82.90906153293585]
本稿では,動きデータと動的記述子を融合させるグラフニューラルネットワークHO-GCNを提案する。動的記述子を消費するネットワークは、最先端の予測結果が得られ、未確認オブジェクトへのネットワークの一般化に役立つことを示す。
論文参考訳（メタデータ） (2022-06-25T09:55:39Z)
Detecting Human-to-Human-or-Object (H2O) Interactions with DIABOLO [29.0200561485714]
我々は,Human-to-Human-or-Object(H2O)という2種類のインタラクションを扱う新しいインタラクションデータセットを提案する。さらに, 人間の身体的態度の記述に近づき, 周囲の相互作用の標的について記述することを目的とした, 動詞の新たな分類法を導入する。提案手法は,1回のフォワードパスにおける全てのインタラクションを検出するための,効率的な主観中心単発撮影法であるDIABOLOを提案する。
論文参考訳（メタデータ） (2022-01-07T11:00:11Z)
GTNet:Guided Transformer Network for Detecting Human-Object Interactions [10.809778265707916]
人-物間相互作用(Human-object Interaction、HOI)検出タスクは、人間を局所化し、対象を局所化し、各人-物間の相互作用を予測する。 HOIを検出するためには,相対的な空間構成やオブジェクトの意味論を利用して,画像の空間領域の空間領域を見つけることが重要である。この問題は、自己注意に基づくガイド型トランスネットワークであるGTNetによって解決されている。
論文参考訳（メタデータ） (2021-08-02T02:06:33Z)
Human Object Interaction Detection using Two-Direction Spatial Enhancement and Exclusive Object Prior [28.99655101929647]
Human-Object Interaction (HOI) 検出は、画像中の人間とオブジェクトの視覚的関係を検出することを目的とする。非インタラクティブな人-物対は、容易に誤分類され、アクションとして分類される。本論文では, 空間的制約を2方向から強化する空間拡張手法を提案する。
論文参考訳（メタデータ） (2021-05-07T07:18:27Z)
HOTR: End-to-End Human-Object Interaction Detection with Transformers [26.664864824357164]
そこで本研究では, HOTRが提唱する, 画像からヒト, オブジェクト, 相互作用> トリプレットの集合を直接予測する新しいフレームワークを提案する。提案アルゴリズムは,2つのHOI検出ベンチマークにおいて,オブジェクト検出後1ms以下の推論時間で最新の性能を実現する。
論文参考訳（メタデータ） (2021-04-28T10:10:29Z)
DRG: Dual Relation Graph for Human-Object Interaction Detection [65.50707710054141]
人-物間相互作用(HOI)検出の課題に対処する。既存の方法は、人間と物体の対の相互作用を独立に認識するか、複雑な外観に基づく共同推論を行う。本稿では,抽象的空間意味表現を活用して,各対象対を記述し,二重関係グラフを用いてシーンの文脈情報を集約する。
論文参考訳（メタデータ） (2020-08-26T17:59:40Z)
Learning Human-Object Interaction Detection using Interaction Points [140.0200950601552]
本研究では,人間と物体の相互作用を直接検出する新しい完全畳み込み手法を提案する。我々のネットワークは相互作用点を予測し、その相互作用を直接ローカライズし、分類する。 V-COCOとHICO-DETの2つの人気のあるベンチマークで実験が行われる。
論文参考訳（メタデータ） (2020-03-31T08:42:06Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。