Diagnosing Human-object Interaction Detectors
- URL: http://arxiv.org/abs/2308.08529v2
- Date: Fri, 1 Dec 2023 18:57:27 GMT
- Title: Diagnosing Human-object Interaction Detectors
- Authors: Fangrui Zhu, Yiming Xie, Weidi Xie, Huaizu Jiang
- Abstract summary: We introduce a diagnosis toolbox to provide detailed quantitative break-down analysis of HOI detection models.
We analyze eight state-of-the-art HOI detection models and provide valuable diagnosis insights to foster future research.
- Score: 42.283857276076596
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We have witnessed significant progress in human-object interaction (HOI)
detection. The reliance on mAP (mean Average Precision) scores as a summary
metric, however, does not provide sufficient insight into the nuances of model
performance (e.g., why one model is better than another), which can hinder
further innovation in this field. To address this issue, in this paper, we
introduce a diagnosis toolbox to provide detailed quantitative break-down
analysis of HOI detection models, inspired by the success of object detection
diagnosis toolboxes. We first conduct holistic investigations in the pipeline
of HOI detection. By defining a set of errors and the oracles to fix each of
them, we can have a quantitative analysis of the significance of different
errors according to the mAP improvement obtained from fixing each error. We
then delve into two sub-tasks of HOI detection: human-object pair detection and
interaction classification, respectively. For the first detection task, we
compute the coverage of ground-truth human-object pairs as well as the
noisiness level in the detection results. For the second classification task,
we measure a model's performance of differentiating positive and negative
detection results and also classifying the actual interactions when the
human-object pairs are correctly detected. We analyze eight state-of-the-art
HOI detection models and provide valuable diagnosis insights to foster future
research. For instance, our diagnosis shows that state-of-the-art model RLIPv2
outperforms others mainly because it significantly improves the multi-label
interaction classification accuracy. Our toolbox is applicable for different
methods across different datasets and available at
https://github.com/neu-vi/Diag-HOI.
Related papers
- Unsupervised Model Diagnosis [49.36194740479798]
This paper proposes Unsupervised Model Diagnosis (UMO) to produce semantic counterfactual explanations without any user guidance.
Our approach identifies and visualizes changes in semantics, and then matches these changes to attributes from wide-ranging text sources.
arXiv Detail & Related papers (2024-10-08T17:59:03Z) - Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech [60.08015780474457]
Alzheimer's Disease (AD) detection has emerged as a promising research area that employs machine learning classification models.
We identify within-class variation as a critical challenge in AD detection: individuals with AD exhibit a spectrum of cognitive impairments.
We propose two novel methods: Soft Target Distillation (SoTD) and Instance-level Re-balancing (InRe), targeting two problems respectively.
arXiv Detail & Related papers (2024-09-22T02:06:05Z) - Systematic Review: Anomaly Detection in Connected and Autonomous Vehicles [0.0]
This systematic review focuses on anomaly detection for connected and autonomous vehicles.
The most commonly used Artificial Intelligence (AI) algorithms employed in anomaly detection are neural networks like LSTM, CNN, and autoencoders, alongside one-class SVM.
There is a need for future research to investigate the deployment of anomaly detection to a vehicle to assess its performance on the road.
arXiv Detail & Related papers (2024-05-04T18:31:38Z) - Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark [101.23684938489413]
Anomaly detection (AD) is often focused on detecting anomalies for industrial quality inspection and medical lesion examination.
This work first constructs a large-scale and general-purpose COCO-AD dataset by extending COCO to the AD field.
Inspired by the metrics in the segmentation field, we propose several more practical threshold-dependent AD-specific metrics.
arXiv Detail & Related papers (2024-04-16T17:38:26Z) - Diagnosing Errors in Video Relation Detectors [46.792264699927436]
Video relation detection forms a new and challenging problem in computer vision.
Overall performance is still marginal and it remains unclear what the key factors are towards solving the problem.
We introduce a diagnostic tool for analyzing the sources of detection errors.
arXiv Detail & Related papers (2021-10-25T17:04:08Z) - Confidence-based Out-of-Distribution Detection: A Comparative Study and
Analysis [17.398553230843717]
We assess the capability of various state-of-the-art approaches for confidence-based OOD detection.
First, we leverage a computer vision benchmark to reproduce and compare multiple OOD detection methods.
We then evaluate their capabilities on the challenging task of disease classification using chest X-rays.
arXiv Detail & Related papers (2021-07-06T12:10:09Z) - DecAug: Augmenting HOI Detection via Decomposition [54.65572599920679]
Current algorithms suffer from insufficient training samples and category imbalance within datasets.
We propose an efficient and effective data augmentation method called DecAug for HOI detection.
Experiments show that our method brings up to 3.3 mAP and 1.6 mAP improvements on V-COCO and HICODET dataset.
arXiv Detail & Related papers (2020-10-02T13:59:05Z) - DRG: Dual Relation Graph for Human-Object Interaction Detection [65.50707710054141]
We tackle the challenging problem of human-object interaction (HOI) detection.
Existing methods either recognize the interaction of each human-object pair in isolation or perform joint inference based on complex appearance-based features.
In this paper, we leverage an abstract spatial-semantic representation to describe each human-object pair and aggregate the contextual information of the scene via a dual relation graph.
arXiv Detail & Related papers (2020-08-26T17:59:40Z) - An Extensive Study on Cross-Dataset Bias and Evaluation Metrics
Interpretation for Machine Learning applied to Gastrointestinal Tract
Abnormality Classification [2.985964157078619]
Automatic analysis of diseases in the GI tract is a hot topic in computer science and medical-related journals.
A clear understanding of evaluation metrics and machine learning models with cross datasets is crucial to bring research in the field to a new quality level.
We present comprehensive evaluations of five distinct machine learning models that can classify 16 different GI tract conditions.
arXiv Detail & Related papers (2020-05-08T08:59:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.