Recent Few-Shot Object Detection Algorithms: A Survey with Performance
Comparison
- URL: http://arxiv.org/abs/2203.14205v2
- Date: Wed, 12 Apr 2023 05:26:58 GMT
- Title: Recent Few-Shot Object Detection Algorithms: A Survey with Performance
Comparison
- Authors: Tianying Liu, Lu Zhang, Yang Wang, Jihong Guan, Yanwei Fu, Jiajia
Zhao, Shuigeng Zhou
- Abstract summary: Few-Shot Object Detection (FSOD) mimics the humans' ability of learning to learn.
FSOD intelligently transfers the learned generic object knowledge from the common heavy-tailed, to the novel long-tailed object classes.
We give an overview of FSOD, including the problem definition, common datasets, and evaluation protocols.
- Score: 54.357707168883024
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The generic object detection (GOD) task has been successfully tackled by
recent deep neural networks, trained by an avalanche of annotated training
samples from some common classes. However, it is still non-trivial to
generalize these object detectors to the novel long-tailed object classes,
which have only few labeled training samples. To this end, the Few-Shot Object
Detection (FSOD) has been topical recently, as it mimics the humans' ability of
learning to learn, and intelligently transfers the learned generic object
knowledge from the common heavy-tailed, to the novel long-tailed object
classes. Especially, the research in this emerging field has been flourishing
in recent years with various benchmarks, backbones, and methodologies proposed.
To review these FSOD works, there are several insightful FSOD survey articles
[58, 59, 74, 78] that systematically study and compare them as the groups of
fine-tuning/transfer learning, and meta-learning methods. In contrast, we
review the existing FSOD algorithms from a new perspective under a new taxonomy
based on their contributions, i.e., data-oriented, model-oriented, and
algorithm-oriented. Thus, a comprehensive survey with performance comparison is
conducted on recent achievements of FSOD. Furthermore, we also analyze the
technical challenges, the merits and demerits of these methods, and envision
the future directions of FSOD. Specifically, we give an overview of FSOD,
including the problem definition, common datasets, and evaluation protocols.
The taxonomy is then proposed that groups FSOD methods into three types.
Following this taxonomy, we provide a systematic review of the advances in
FSOD. Finally, further discussions on performance, challenges, and future
directions are presented.
Related papers
- Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions [62.12545440385489]
Large language models (LLMs) have brought substantial advancements in text generation, but their potential for enhancing classification tasks remains underexplored.
We propose a framework for thoroughly investigating fine-tuning LLMs for classification, including both generation- and encoding-based approaches.
We instantiate this framework in edit intent classification (EIC), a challenging and underexplored classification task.
arXiv Detail & Related papers (2024-10-02T20:48:28Z) - Beyond Few-shot Object Detection: A Detailed Survey [25.465534270637523]
Researchers have introduced few-shot object detection (FSOD) approaches that merge few-shot learning and object detection principles.
These approaches play a vital role in reducing the reliance on extensive labeled datasets.
This survey paper aims to provide a comprehensive understanding of the above-mentioned few-shot settings and explore the methodologies for each FSOD task.
arXiv Detail & Related papers (2024-08-26T13:09:23Z) - Deep Learning-Based Object Pose Estimation: A Comprehensive Survey [73.74933379151419]
We discuss the recent advances in deep learning-based object pose estimation.
Our survey also covers multiple input data modalities, degrees-of-freedom of output poses, object properties, and downstream tasks.
arXiv Detail & Related papers (2024-05-13T14:44:22Z) - Few-Shot Object Detection: Research Advances and Challenges [15.916463121997843]
Few-shot object detection (FSOD) combines few-shot learning and object detection techniques to rapidly adapt to novel objects with limited annotated samples.
This paper presents a comprehensive survey to review the significant advancements in the field of FSOD in recent years.
arXiv Detail & Related papers (2024-04-07T03:37:29Z) - Fine-Grained Zero-Shot Learning: Advances, Challenges, and Prospects [84.36935309169567]
We present a broad review of recent advances for fine-grained analysis in zero-shot learning (ZSL)
We first provide a taxonomy of existing methods and techniques with a thorough analysis of each category.
Then, we summarize the benchmark, covering publicly available datasets, models, implementations, and some more details as a library.
arXiv Detail & Related papers (2024-01-31T11:51:24Z) - Continual Learning with Pre-Trained Models: A Survey [61.97613090666247]
Continual Learning aims to overcome the catastrophic forgetting of former knowledge when learning new ones.
This paper presents a comprehensive survey of the latest advancements in PTM-based CL.
arXiv Detail & Related papers (2024-01-29T18:27:52Z) - A Survey on Few-Shot Class-Incremental Learning [11.68962265057818]
Few-shot class-incremental learning (FSCIL) poses a significant challenge for deep neural networks to learn new tasks.
This paper provides a comprehensive survey on FSCIL.
FSCIL has achieved impressive achievements in various fields of computer vision.
arXiv Detail & Related papers (2023-04-17T10:15:08Z) - A Comparative Review of Recent Few-Shot Object Detection Algorithms [0.0]
Few-shot object detection, learning to adapt to the novel classes with a few labeled data, is an imperative and long-lasting problem.
Recent studies have explored how to use implicit cues in extra datasets without target-domain supervision to help few-shot detectors refine robust task notions.
arXiv Detail & Related papers (2021-10-30T07:57:11Z) - A Survey on Text Classification: From Shallow to Deep Learning [83.47804123133719]
The last decade has seen a surge of research in this area due to the unprecedented success of deep learning.
This paper fills the gap by reviewing the state-of-the-art approaches from 1961 to 2021.
We create a taxonomy for text classification according to the text involved and the models used for feature extraction and classification.
arXiv Detail & Related papers (2020-08-02T00:09:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.