IterDet: Iterative Scheme for Object Detection in Crowded Environments
- URL: http://arxiv.org/abs/2005.05708v2
- Date: Fri, 29 Jan 2021 07:26:14 GMT
- Title: IterDet: Iterative Scheme for Object Detection in Crowded Environments
- Authors: Danila Rukhovich, Konstantin Sofiiuk, Danil Galeev, Olga Barinova,
Anton Konushin
- Abstract summary: Deep learning-based detectors usually produce a redundant set of object bounding boxes.
These boxes are filtered using non-maximum suppression (NMS) in order to select exactly one bounding box per object of interest.
This greedy scheme is simple and provides sufficient accuracy for isolated objects but often fails in crowded environments.
In this work we develop an alternative iterative scheme, where a new subset of objects is detected at each iteration.
- Score: 6.2997667081978825
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep learning-based detectors usually produce a redundant set of object
bounding boxes including many duplicate detections of the same object. These
boxes are then filtered using non-maximum suppression (NMS) in order to select
exactly one bounding box per object of interest. This greedy scheme is simple
and provides sufficient accuracy for isolated objects but often fails in
crowded environments, since one needs to both preserve boxes for different
objects and suppress duplicate detections. In this work we develop an
alternative iterative scheme, where a new subset of objects is detected at each
iteration. Detected boxes from the previous iterations are passed to the
network at the following iterations to ensure that the same object would not be
detected twice. This iterative scheme can be applied to both one-stage and
two-stage object detectors with just minor modifications of the training and
inference procedures. We perform extensive experiments with two different
baseline detectors on four datasets and show significant improvement over the
baseline, leading to state-of-the-art performance on CrowdHuman and WiderPerson
datasets. The source code and the trained models are available at
https://github.com/saic-vul/iterdet.
Related papers
- Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation [58.37525311718006]
We put forth a novel formulation of the aerial object detection problem, namely open-vocabulary aerial object detection (OVAD)
We propose CastDet, a CLIP-activated student-teacher detection framework that serves as the first OVAD detector specifically designed for the challenging aerial scenario.
Our framework integrates a robust localization teacher along with several box selection strategies to generate high-quality proposals for novel objects.
arXiv Detail & Related papers (2024-11-04T12:59:13Z) - Bayesian Detector Combination for Object Detection with Crowdsourced Annotations [49.43709660948812]
Acquiring fine-grained object detection annotations in unconstrained images is time-consuming, expensive, and prone to noise.
We propose a novel Bayesian Detector Combination (BDC) framework to more effectively train object detectors with noisy crowdsourced annotations.
BDC is model-agnostic, requires no prior knowledge of the annotators' skill level, and seamlessly integrates with existing object detection models.
arXiv Detail & Related papers (2024-07-10T18:00:54Z) - Object-Centric Multiple Object Tracking [124.30650395969126]
This paper proposes a video object-centric model for multiple-object tracking pipelines.
It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module.
Benefited from object-centric learning, we only require sparse detection labels for object localization and feature binding.
arXiv Detail & Related papers (2023-09-01T03:34:12Z) - Linear Object Detection in Document Images using Multiple Object
Tracking [58.720142291102135]
Linear objects convey substantial information about document structure.
Many approaches can recover some vector representation, but only one closed-source technique introduced in 1994.
We propose a framework for accurate instance segmentation of linear objects in document images using Multiple Object Tracking.
arXiv Detail & Related papers (2023-05-26T14:22:03Z) - Real-time Multi-Object Tracking Based on Bi-directional Matching [0.0]
This study offers a bi-directional matching algorithm for multi-object tracking.
A stranded area is used in the matching algorithm to temporarily store the objects that fail to be tracked.
In the MOT17 challenge, the proposed algorithm achieves 63.4% MOTA, 55.3% IDF1, and 20.1 FPS tracking speed.
arXiv Detail & Related papers (2023-03-15T08:38:08Z) - CASAPose: Class-Adaptive and Semantic-Aware Multi-Object Pose Estimation [2.861848675707602]
We present a new single-stage architecture called CASAPose.
It determines 2D-3D correspondences for pose estimation of multiple different objects in RGB images in one pass.
It is fast and memory efficient, and achieves high accuracy for multiple objects.
arXiv Detail & Related papers (2022-10-11T10:20:01Z) - W2N:Switching From Weak Supervision to Noisy Supervision for Object
Detection [64.10643170523414]
We propose a novel WSOD framework with a new paradigm that switches from weak supervision to noisy supervision (W2N)
In the localization adaptation module, we propose a regularization loss to reduce the proportion of discriminative parts in original pseudo ground-truths.
Our W2N outperforms all existing pure WSOD methods and transfer learning methods.
arXiv Detail & Related papers (2022-07-25T12:13:48Z) - Few-shot Object Counting and Detection [25.61294147822642]
We tackle a new task of few-shot object counting and detection. Given a few exemplar bounding boxes of a target object class, we seek to count and detect all objects of the target class.
This task shares the same supervision as the few-shot object counting but additionally outputs the object bounding boxes along with the total object count.
We introduce a novel two-stage training strategy and a novel uncertainty-aware few-shot object detector: Counting-DETR.
arXiv Detail & Related papers (2022-07-22T10:09:18Z) - Multi-Grid Redundant Bounding Box Annotation for Accurate Object
Detection [0.0]
YOLOv3 is a state-of-the-art one-shot detector that takes in an input image and divides it into an equal-sized grid matrix.
This paper presents a new mathematical approach that assigns multiple grids per object for accurately tight-fit bounding box prediction.
arXiv Detail & Related papers (2022-01-05T23:01:55Z) - Decoupled Adaptation for Cross-Domain Object Detection [69.5852335091519]
Cross-domain object detection is more challenging than object classification.
D-adapt achieves a state-of-the-art results on four cross-domain object detection tasks.
arXiv Detail & Related papers (2021-10-06T08:43:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.