Chairs Can be Stood on: Overcoming Object Bias in Human-Object
  Interaction Detection
        - URL: http://arxiv.org/abs/2207.02400v1
- Date: Wed, 6 Jul 2022 01:55:28 GMT
- Title: Chairs Can be Stood on: Overcoming Object Bias in Human-Object
  Interaction Detection
- Authors: Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli
- Abstract summary: Human-Object Interaction (HOI) in images is an important step towards high-level visual comprehension.
We propose a novel plug-and-play Object-wise Debiasing Memory (ODM) method for re-balancing the distribution of interactions under detected objects.
Our method brings consistent and significant improvements over baselines, especially on rare interactions under each object.
- Score: 22.3445174577181
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract:   Detecting Human-Object Interaction (HOI) in images is an important step
towards high-level visual comprehension. Existing work often shed light on
improving either human and object detection, or interaction recognition.
However, due to the limitation of datasets, these methods tend to fit well on
frequent interactions conditioned on the detected objects, yet largely ignoring
the rare ones, which is referred to as the object bias problem in this paper.
In this work, we for the first time, uncover the problem from two aspects:
unbalanced interaction distribution and biased model learning. To overcome the
object bias problem, we propose a novel plug-and-play Object-wise Debiasing
Memory (ODM) method for re-balancing the distribution of interactions under
detected objects. Equipped with carefully designed read and write strategies,
the proposed ODM allows rare interaction instances to be more frequently
sampled for training, thereby alleviating the object bias induced by the
unbalanced interaction distribution. We apply this method to three advanced
baselines and conduct experiments on the HICO-DET and HOI-COCO datasets. To
quantitatively study the object bias problem, we advocate a new protocol for
evaluating model performance. As demonstrated in the experimental results, our
method brings consistent and significant improvements over baselines,
especially on rare interactions under each object. In addition, when evaluating
under the conventional standard setting, our method achieves new
state-of-the-art on the two benchmarks.
 
      
        Related papers
        - A Review of Human-Object Interaction Detection [6.1941885271010175]
 Human-object interaction (HOI) detection plays a key role in high-level visual understanding.
This paper systematically summarizes and discusses the recent work in image-based HOI detection.
 arXiv  Detail & Related papers  (2024-08-20T08:32:39Z)
- A Plug-and-Play Method for Rare Human-Object Interactions Detection by   Bridging Domain Gap [50.079224604394]
 We present a novel model-agnostic framework called textbfContext-textbfEnhanced textbfFeature textbfAment (CEFA)
CEFA consists of a feature alignment module and a context enhancement module.
Our method can serve as a plug-and-play module to improve the detection performance of HOI models on rare categories.
 arXiv  Detail & Related papers  (2024-07-31T08:42:48Z)
- Disentangled Interaction Representation for One-Stage Human-Object
  Interaction Detection [70.96299509159981]
 Human-Object Interaction (HOI) detection is a core task for human-centric image understanding.
Recent one-stage methods adopt a transformer decoder to collect image-wide cues that are useful for interaction prediction.
Traditional two-stage methods benefit significantly from their ability to compose interaction features in a disentangled and explainable manner.
 arXiv  Detail & Related papers  (2023-12-04T08:02:59Z)
- HODN: Disentangling Human-Object Feature for HOI Detection [51.48164941412871]
 We propose a Human and Object Disentangling Network (HODN) to model the Human-Object Interaction (HOI) relationships explicitly.
Considering that human features are more contributive to interaction, we propose a Human-Guide Linking method to make sure the interaction decoder focuses on the human-centric regions.
Our proposed method achieves competitive performance on both the V-COCO and the HICO-Det Linking datasets.
 arXiv  Detail & Related papers  (2023-08-20T04:12:50Z)
- Distance Matters in Human-Object Interaction Detection [22.3445174577181]
 We propose a novel two-stage method for better handling distant interactions in HOI detection.
One essential component in our method is a novel Far Near Distance Attention module.
Besides, we devise a novel Distance-Aware loss function which leads the model to focus more on distant yet rare interactions.
 arXiv  Detail & Related papers  (2022-07-05T08:06:05Z)
- ACP++: Action Co-occurrence Priors for Human-Object Interaction
  Detection [102.9428507180728]
 A common problem in the task of human-object interaction (HOI) detection is that numerous HOI classes have only a small number of labeled examples.
We observe that there exist natural correlations and anti-correlations among human-object interactions.
We present techniques to learn these priors and leverage them for more effective training, especially on rare classes.
 arXiv  Detail & Related papers  (2021-09-09T06:02:50Z)
- Detecting Human-Object Interaction via Fabricated Compositional Learning [106.37536031160282]
 Human-Object Interaction (HOI) detection is a fundamental task for high-level scene understanding.
Human has extremely powerful compositional perception ability to cognize rare or unseen HOI samples.
We propose Fabricated Compositional Learning (FCL) to address the problem of open long-tailed HOI detection.
 arXiv  Detail & Related papers  (2021-03-15T08:52:56Z)
- Detecting Human-Object Interactions with Action Co-occurrence Priors [108.31956827512376]
 A common problem in human-object interaction (HOI) detection task is that numerous HOI classes have only a small number of labeled examples.
We observe that there exist natural correlations and anti-correlations among human-object interactions.
We present techniques to learn these priors and leverage them for more effective training, especially in rare classes.
 arXiv  Detail & Related papers  (2020-07-17T02:47:45Z)
- Learning Human-Object Interaction Detection using Interaction Points [140.0200950601552]
 We propose a novel fully-convolutional approach that directly detects the interactions between human-object pairs.
Our network predicts interaction points, which directly localize and classify the inter-action.
Experiments are performed on two popular benchmarks: V-COCO and HICO-DET.
 arXiv  Detail & Related papers  (2020-03-31T08:42:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.