Self-Classification Enhancement and Correction for Weakly Supervised Object Detection
- URL: http://arxiv.org/abs/2505.16294v1
- Date: Thu, 22 May 2025 06:45:58 GMT
- Title: Self-Classification Enhancement and Correction for Weakly Supervised Object Detection
- Authors: Yufei Yin, Lechao Cheng, Wengang Zhou, Jiajun Deng, Zhou Yu, Houqiang Li,
- Abstract summary: weakly supervised object detection (WSOD) has attracted much attention due to its low labeling cost.<n>In this work, we introduce a novel WSOD framework to ameliorate these two issues.<n>For one thing, we propose a self-classification enhancement module that integrates intra-class binary classification (ICBC) to bridge the gap between the two distinct MCC tasks.<n>For another, we propose a self-classification correction algorithm during inference, which combines the results of both MCC tasks to effectively reduce the mis-classified predictions.
- Score: 113.51483527300496
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, weakly supervised object detection (WSOD) has attracted much attention due to its low labeling cost. The success of recent WSOD models is often ascribed to the two-stage multi-class classification (MCC) task, i.e., multiple instance learning and online classification refinement. Despite achieving non-trivial progresses, these methods overlook potential classification ambiguities between these two MCC tasks and fail to leverage their unique strengths. In this work, we introduce a novel WSOD framework to ameliorate these two issues. For one thing, we propose a self-classification enhancement module that integrates intra-class binary classification (ICBC) to bridge the gap between the two distinct MCC tasks. The ICBC task enhances the network's discrimination between positive and mis-located samples in a class-wise manner and forges a mutually reinforcing relationship with the MCC task. For another, we propose a self-classification correction algorithm during inference, which combines the results of both MCC tasks to effectively reduce the mis-classified predictions. Extensive experiments on the prevalent VOC 2007 & 2012 datasets demonstrate the superior performance of our framework.
Related papers
- Clustering-Guided Multi-Layer Contrastive Representation Learning for Citrus Disease Classification [17.627760587507737]
Citrus is one of the most economically important fruit crops globally.<n> Accurate disease detection and classification serve as critical prerequisites for implementing targeted control measures.<n>Recent advancements in artificial intelligence, particularly deep learning-based computer vision algorithms, have substantially decreased time and labor requirements.
arXiv Detail & Related papers (2025-07-15T10:22:52Z) - Decoupled Competitive Framework for Semi-supervised Medical Image Segmentation [2.146676124065199]
Semi-supervised medical image segmentation (SSMIS) is a promising solution for insufficiently annotated samples in medical domain.<n>Most approaches following the Mean Teacher (MT) or Dual Students (DS) architecture have achieved commendable results.<n>A Decoupled Competitive Framework (DCF) is elaborated in this work, which utilizes a straightforward competition mechanism for the update of EMA.<n>The DCF undergoes rigorous validation on three publicly accessible datasets, which encompass both 2D and 3D datasets.
arXiv Detail & Related papers (2025-05-30T14:56:00Z) - Completely Weakly Supervised Class-Incremental Learning for Semantic Segmentation [2.7855886538423182]
This work addresses the task of completely weakly supervised class-incremental learning for semantic segmentation.<n>To achieve this, we propose to generate robust pseudo-labels by combining pseudo-labels from a localizer and a sequence of foundation models.<n>We also introduce an exemplar-guided data augmentation method that generates diverse images containing both previous and novel classes with guidance.
arXiv Detail & Related papers (2025-05-16T01:43:36Z) - Cross-Hierarchical Bidirectional Consistency Learning for Fine-Grained Visual Classification [1.757077789361314]
We propose a novel Cross-Hierarchical Bidirectional Consistency Learning (CHBC) framework to improve classification accuracy and consistency.<n>The CHBC framework extracts discriminative features across various hierarchies using a specially designed module to decompose and enhance attention masks.<n> Experiments on three widely used FGVC datasets validate the effectiveness of the CHBC framework.
arXiv Detail & Related papers (2025-04-18T10:30:17Z) - CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation [75.35841972192684]
Effective Class Incremental (CIS) requires simultaneously mitigating catastrophic forgetting and ensuring sufficient plasticity to integrate new classes.<n>We introduce a novel approach, Conflict Mitigation via Branched Optimization(CoMBO)<n>Within this approach, we present the Query Conflict Reduction module, designed to explicitly refine queries for new classes.
arXiv Detail & Related papers (2025-04-05T12:34:51Z) - Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning [49.65841002338575]
This paper focuses on the challenging yet practical multi-label class-incremental learning (MLCIL) problem.<n>We propose a novel class-independent incremental network (CINet) to extract multiple class-level embeddings for multi-label samples.<n>It learns and preserves the knowledge of different classes by constructing class-specific tokens.
arXiv Detail & Related papers (2025-03-01T14:40:52Z) - Joint Input and Output Coordination for Class-Incremental Learning [84.36763449830812]
We propose a joint input and output coordination (JIOC) mechanism to address these issues.
This mechanism assigns different weights to different categories of data according to the gradient of the output score.
It can be incorporated into different incremental learning approaches that use memory storage.
arXiv Detail & Related papers (2024-09-09T13:55:07Z) - Constrained Multi-Layer Contrastive Learning for Implicit Discourse Relationship Recognition [9.644808110078833]
We adapt a supervised contrastive learning (CL) method, label- and instance-centered CL, to enhance representation learning.
Experimental results on PDTB 2.0 and PDTB 3.0 show that our approach can significantly improve the performance on both multi-class classification and binary classification.
arXiv Detail & Related papers (2024-09-07T17:55:41Z) - Cross-supervised Dual Classifiers for Semi-supervised Medical Image
Segmentation [10.18427897663732]
Semi-supervised medical image segmentation offers a promising solution for large-scale medical image analysis.
This paper proposes a cross-supervised learning framework based on dual classifiers (DC-Net)
Experiments on LA and Pancreas-CT dataset illustrate that DC-Net outperforms other state-of-the-art methods for semi-supervised segmentation.
arXiv Detail & Related papers (2023-05-25T16:23:39Z) - Semi-supervised Domain Adaptive Structure Learning [72.01544419893628]
Semi-supervised domain adaptation (SSDA) is a challenging problem requiring methods to overcome both 1) overfitting towards poorly annotated data and 2) distribution shift across domains.
We introduce an adaptive structure learning method to regularize the cooperation of SSL and DA.
arXiv Detail & Related papers (2021-12-12T06:11:16Z) - Few-Shot Fine-Grained Action Recognition via Bidirectional Attention and
Contrastive Meta-Learning [51.03781020616402]
Fine-grained action recognition is attracting increasing attention due to the emerging demand of specific action understanding in real-world applications.
We propose a few-shot fine-grained action recognition problem, aiming to recognize novel fine-grained actions with only few samples given for each class.
Although progress has been made in coarse-grained actions, existing few-shot recognition methods encounter two issues handling fine-grained actions.
arXiv Detail & Related papers (2021-08-15T02:21:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.