Dynamic Proposals for Efficient Object Detection
- URL: http://arxiv.org/abs/2207.05252v1
- Date: Tue, 12 Jul 2022 01:32:50 GMT
- Title: Dynamic Proposals for Efficient Object Detection
- Authors: Yiming Cui, Linjie Yang, Ding Liu
- Abstract summary: We propose a simple yet effective method which is adaptive to different computational resources by generating dynamic proposals for object detection.
Our method achieves significant speed-up across a wide range of detection models including two-stage and query-based models.
- Score: 48.66093789652899
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection is a basic computer vision task to loccalize and categorize
objects in a given image. Most state-of-the-art detection methods utilize a
fixed number of proposals as an intermediate representation of object
candidates, which is unable to adapt to different computational constraints
during inference. In this paper, we propose a simple yet effective method which
is adaptive to different computational resources by generating dynamic
proposals for object detection. We first design a module to make a single
query-based model to be able to inference with different numbers of proposals.
Further, we extend it to a dynamic model to choose the number of proposals
according to the input image, greatly reducing computational costs. Our method
achieves significant speed-up across a wide range of detection models including
two-stage and query-based models while obtaining similar or even better
accuracy.
Related papers
- Appearance-based Refinement for Object-Centric Motion Segmentation [95.80420062679104]
We introduce an appearance-based refinement method that leverages temporal consistency in video streams to correct inaccurate flow-based proposals.
Our approach involves a simple selection mechanism that identifies accurate flow-predicted masks as exemplars.
Its performance is evaluated on multiple video segmentation benchmarks, including DAVIS, YouTubeVOS, SegTrackv2, and FBMS-59.
arXiv Detail & Related papers (2023-12-18T18:59:51Z) - Uncertainty Aware Active Learning for Reconfiguration of Pre-trained
Deep Object-Detection Networks for New Target Domains [0.0]
Object detection is one of the most important and fundamental aspects of computer vision tasks.
To obtain training data for object detection model efficiently, many datasets opt to obtain their unannotated data in video format.
Annotating every frame from a video is costly and inefficient since many frames contain very similar information for the model to learn from.
In this paper, we proposed a novel active learning algorithm for object detection models to tackle this problem.
arXiv Detail & Related papers (2023-03-22T17:14:10Z) - D\'etection d'Objets dans les documents num\'eris\'es par r\'eseaux de
neurones profonds [0.0]
We study multiple tasks related to document layout analysis such as the detection of text lines, the splitting into acts or the detection of the writing support.
We propose two deep neural models following two different approaches.
arXiv Detail & Related papers (2023-01-27T14:45:45Z) - Object Detection with Deep Reinforcement Learning [0.0]
We implement a novel active object localization algorithm based on deep reinforcement learning.
We compare two different action settings for this MDP: a hierarchical method and a dynamic method.
arXiv Detail & Related papers (2022-08-09T02:34:53Z) - Attentional Prototype Inference for Few-Shot Segmentation [128.45753577331422]
We propose attentional prototype inference (API), a probabilistic latent variable framework for few-shot segmentation.
We define a global latent variable to represent the prototype of each object category, which we model as a probabilistic distribution.
We conduct extensive experiments on four benchmarks, where our proposal obtains at least competitive and often better performance than state-of-the-art prototype-based methods.
arXiv Detail & Related papers (2021-05-14T06:58:44Z) - Meta Faster R-CNN: Towards Accurate Few-Shot Object Detection with
Attentive Feature Alignment [33.446875089255876]
Few-shot object detection (FSOD) aims to detect objects using only few examples.
We propose a meta-learning based few-shot object detection method by transferring meta-knowledge learned from data-abundant base classes to data-scarce novel classes.
arXiv Detail & Related papers (2021-04-15T19:01:27Z) - Ensembling object detectors for image and video data analysis [98.26061123111647]
We propose a method for ensembling the outputs of multiple object detectors for improving detection performance and precision of bounding boxes on image data.
We extend it to video data by proposing a two-stage tracking-based scheme for detection refinement.
arXiv Detail & Related papers (2021-02-09T12:38:16Z) - Part-aware Prototype Network for Few-shot Semantic Segmentation [50.581647306020095]
We propose a novel few-shot semantic segmentation framework based on the prototype representation.
Our key idea is to decompose the holistic class representation into a set of part-aware prototypes.
We develop a novel graph neural network model to generate and enhance the proposed part-aware prototypes.
arXiv Detail & Related papers (2020-07-13T11:03:09Z) - Adaptive Object Detection with Dual Multi-Label Prediction [78.69064917947624]
We propose a novel end-to-end unsupervised deep domain adaptation model for adaptive object detection.
The model exploits multi-label prediction to reveal the object category information in each image.
We introduce a prediction consistency regularization mechanism to assist object detection.
arXiv Detail & Related papers (2020-03-29T04:23:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.