Deep Learning for Weakly-Supervised Object Detection and Object
Localization: A Survey
- URL: http://arxiv.org/abs/2105.12694v1
- Date: Wed, 26 May 2021 17:15:53 GMT
- Title: Deep Learning for Weakly-Supervised Object Detection and Object
Localization: A Survey
- Authors: Feifei Shao, Long Chen, Jian Shao, Wei Ji, Shaoning Xiao, Lu Ye,
Yueting Zhuang, Jun Xiao
- Abstract summary: Weakly-Supervised Object Detection (WSOD) and localization (WSOL) are long-standing and challenging CV tasks.
With the success of deep neural networks in object detection, both WSOD and WSOL have received unprecedented attention.
Hundreds of WSOD and WSOL methods and numerous techniques have been proposed in the deep learning era.
- Score: 37.545083067084306
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), i.e.,
detecting multiple and single instances with bounding boxes in an image using
image-level labels, are long-standing and challenging tasks in the CV
community. With the success of deep neural networks in object detection, both
WSOD and WSOL have received unprecedented attention. Hundreds of WSOD and WSOL
methods and numerous techniques have been proposed in the deep learning era. To
this end, in this paper, we consider WSOL is a sub-task of WSOD and provide a
comprehensive survey of the recent achievements of WSOD. Specifically, we
firstly describe the formulation and setting of the WSOD, including the
background, challenges, basic framework. Meanwhile, we summarize and analyze
all advanced techniques and training tricks for improving detection
performance. Then, we introduce the widely-used datasets and evaluation metrics
of WSOD. Lastly, we discuss the future directions of WSOD. We believe that
these summaries can help pave a way for future research on WSOD and WSOL.
Related papers
- Beyond Few-shot Object Detection: A Detailed Survey [25.465534270637523]
Researchers have introduced few-shot object detection (FSOD) approaches that merge few-shot learning and object detection principles.
These approaches play a vital role in reducing the reliance on extensive labeled datasets.
This survey paper aims to provide a comprehensive understanding of the above-mentioned few-shot settings and explore the methodologies for each FSOD task.
arXiv Detail & Related papers (2024-08-26T13:09:23Z) - A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning [51.7818820745221]
Underwater image enhancement (UIE) presents a significant challenge within computer vision research.
Despite the development of numerous UIE algorithms, a thorough and systematic review is still absent.
arXiv Detail & Related papers (2024-05-30T04:46:40Z) - Few-Shot Object Detection: Research Advances and Challenges [15.916463121997843]
Few-shot object detection (FSOD) combines few-shot learning and object detection techniques to rapidly adapt to novel objects with limited annotated samples.
This paper presents a comprehensive survey to review the significant advancements in the field of FSOD in recent years.
arXiv Detail & Related papers (2024-04-07T03:37:29Z) - Remote Sensing Object Detection Meets Deep Learning: A Meta-review of
Challenges and Advances [51.70835702029498]
This review aims to present a comprehensive review of the recent achievements in deep learning based RSOD methods.
We identify five main challenges in RSOD, including multi-scale object detection, rotated object detection, weak object detection, tiny object detection, and object detection with limited supervision.
We also review the widely used benchmark datasets and evaluation metrics within the field of RSOD, as well as the application scenarios for RSOD.
arXiv Detail & Related papers (2023-09-13T06:48:32Z) - Improved Region Proposal Network for Enhanced Few-Shot Object Detection [23.871860648919593]
Few-shot object detection (FSOD) methods have emerged as a solution to the limitations of classic object detection approaches.
We develop a semi-supervised algorithm to detect and then utilize unlabeled novel objects as positive samples during the FSOD training stage.
Our improved hierarchical sampling strategy for the region proposal network (RPN) also boosts the perception of the object detection model for large objects.
arXiv Detail & Related papers (2023-08-15T02:35:59Z) - Recent Few-Shot Object Detection Algorithms: A Survey with Performance
Comparison [54.357707168883024]
Few-Shot Object Detection (FSOD) mimics the humans' ability of learning to learn.
FSOD intelligently transfers the learned generic object knowledge from the common heavy-tailed, to the novel long-tailed object classes.
We give an overview of FSOD, including the problem definition, common datasets, and evaluation protocols.
arXiv Detail & Related papers (2022-03-27T04:11:28Z) - Activation to Saliency: Forming High-Quality Labels for Unsupervised
Salient Object Detection [54.92703325989853]
We propose a two-stage Activation-to-Saliency (A2S) framework that effectively generates high-quality saliency cues.
No human annotations are involved in our framework during the whole training process.
Our framework reports significant performance compared with existing USOD methods.
arXiv Detail & Related papers (2021-12-07T11:54:06Z) - A Survey of Deep Learning for Low-Shot Object Detection [44.20187548691372]
Low-Shot Object Detection (LSOD) is an emerging research topic of detecting objects from a few or even no annotated samples.
This survey provides a comprehensive review of LSOD methods.
arXiv Detail & Related papers (2021-12-06T06:56:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.