Towards Open World Detection: A Survey
- URL: http://arxiv.org/abs/2508.16527v1
- Date: Fri, 22 Aug 2025 16:49:52 GMT
- Title: Towards Open World Detection: A Survey
- Authors: Andrei-Stefan Bulzan, Cosmin Cernazanu-Glavan,
- Abstract summary: Open World Detection (OWD) is an umbrella term we propose to unify class-agnostic and generally applicable detection models in the vision domain.<n>We start from the history of foundational vision and cover key concepts, methodologies and datasets making up today's state-of-the-art landscape.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: For decades, Computer Vision has aimed at enabling machines to perceive the external world. Initial limitations led to the development of highly specialized niches. As success in each task accrued and research progressed, increasingly complex perception tasks emerged. This survey charts the convergence of these tasks and, in doing so, introduces Open World Detection (OWD), an umbrella term we propose to unify class-agnostic and generally applicable detection models in the vision domain. We start from the history of foundational vision subdomains and cover key concepts, methodologies and datasets making up today's state-of-the-art landscape. This traverses topics starting from early saliency detection, foreground/background separation, out of distribution detection and leading up to open world object detection, zero-shot detection and Vision Large Language Models (VLLMs). We explore the overlap between these subdomains, their increasing convergence, and their potential to unify into a singular domain in the future, perception.
Related papers
- A Survey on Video Anomaly Detection via Deep Learning: Human, Vehicle, and Environment [2.3349787245442966]
Video Anomaly Detection (VAD) has emerged as a pivotal task in computer vision, with broad relevance across multiple fields.<n>Recent advances in deep learning have driven significant progress in this area, yet the field remains fragmented across domains and learning paradigms.<n>This survey offers a comprehensive perspective on VAD, systematically organizing the literature across various supervision levels.
arXiv Detail & Related papers (2025-08-19T18:50:49Z) - Vision Generalist Model: A Survey [87.49797517847132]
We provide a comprehensive overview of the vision generalist models, delving into their characteristics and capabilities within the field.<n>We take a brief excursion into related domains, shedding light on their interconnections and potential synergies.
arXiv Detail & Related papers (2025-06-11T17:23:41Z) - Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook [85.43403500874889]
Retrieval-augmented generation (RAG) has emerged as a pivotal technique in artificial intelligence (AI)<n>Recent advancements in RAG for embodied AI, with a particular focus on applications in planning, task execution, multimodal perception, interaction, and specialized domains.
arXiv Detail & Related papers (2025-03-23T10:33:28Z) - Towards Visual Grounding: A Survey [87.37662490666098]
Since 2021, visual grounding has witnessed significant advancements, with emerging new concepts such as grounded pre-training.<n>This survey is designed to be suitable for both beginners and experienced researchers, serving as an invaluable resource for understanding key concepts and tracking the latest research developments.
arXiv Detail & Related papers (2024-12-28T16:34:35Z) - Open World Object Detection: A Survey [16.839310066730533]
Open world object detection (OWOD) is an emerging area of research that adapts this principle to explore new knowledge.<n>This paper offers a thorough review of the OWOD domain, covering essential aspects, including problem definitions, benchmark datasets, source codes, evaluation metrics, and a comparative study of existing methods.<n>The paper concludes by addressing the limitations and challenges faced by current OWOD algorithms and proposes directions for future research.
arXiv Detail & Related papers (2024-10-15T05:46:00Z) - Deepfake Generation and Detection: A Benchmark and Survey [134.19054491600832]
Deepfake is a technology dedicated to creating highly realistic facial images and videos under specific conditions.
This survey comprehensively reviews the latest developments in deepfake generation and detection.
We focus on researching four representative deepfake fields: face swapping, face reenactment, talking face generation, and facial attribute editing.
arXiv Detail & Related papers (2024-03-26T17:12:34Z) - Generalized Video Anomaly Event Detection: Systematic Taxonomy and
Comparison of Deep Models [33.43062232461652]
Video Anomaly Detection (VAD) serves as a pivotal technology in the intelligent surveillance systems.
This survey extends the conventional scope of VAD beyond unsupervised methods, encompassing a broader spectrum termed Generalized Video Anomaly Event Detection (GVAED)
arXiv Detail & Related papers (2023-02-10T07:11:37Z) - Universal Object Detection with Large Vision Model [79.06618136217142]
This study focuses on the large-scale, multi-domain universal object detection problem.
To address these challenges, we introduce our approach to label handling, hierarchy-aware design, and resource-efficient model training.
Our method has demonstrated remarkable performance, securing a prestigious second-place ranking in the object detection track of the Robust Vision Challenge 2022.
arXiv Detail & Related papers (2022-12-19T12:40:13Z) - Fine-Grained Image Analysis with Deep Learning: A Survey [146.22351342315233]
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition.
This paper attempts to re-define and broaden the field of FGIA by consolidating two fundamental fine-grained research areas -- fine-grained image recognition and fine-grained image retrieval.
arXiv Detail & Related papers (2021-11-11T09:43:56Z) - Towards Open World Object Detection [68.79678648726416]
ORE: Open World Object Detector is based on contrastive clustering and energy based unknown identification.
We find that identifying and characterizing unknown instances helps to reduce confusion in an incremental object detection setting.
arXiv Detail & Related papers (2021-03-03T18:58:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.