Segmentation-Based Bounding Box Generation for Omnidirectional
Pedestrian Detection
- URL: http://arxiv.org/abs/2104.13764v3
- Date: Sun, 4 Jun 2023 01:20:22 GMT
- Title: Segmentation-Based Bounding Box Generation for Omnidirectional
Pedestrian Detection
- Authors: Masato Tamura, Tomoaki Yoshinaga
- Abstract summary: We propose a segmentation-based bounding box generation method for omnidirectional pedestrian detection.
Due to the wide angle of view, omnidirectional cameras are more cost-effective than standard cameras.
Standard pedestrian detectors are likely to be substantially degraded because pedestrians' appearance in omnidirectional images may be rotated to any angle.
- Score: 8.122270502556374
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a segmentation-based bounding box generation method for
omnidirectional pedestrian detection that enables detectors to tightly fit
bounding boxes to pedestrians without omnidirectional images for training. Due
to the wide angle of view, omnidirectional cameras are more cost-effective than
standard cameras and hence suitable for large-scale monitoring. The problem of
using omnidirectional cameras for pedestrian detection is that the performance
of standard pedestrian detectors is likely to be substantially degraded because
pedestrians' appearance in omnidirectional images may be rotated to any angle.
Existing methods mitigate this issue by transforming images during inference.
However, the transformation substantially degrades the detection accuracy and
speed. A recently proposed method obviates the transformation by training
detectors with omnidirectional images, which instead incurs huge annotation
costs. To obviate both the transformation and annotation works, we leverage an
existing large-scale object detection dataset. We train a detector with rotated
images and tightly fitted bounding box annotations generated from the
segmentation annotations in the dataset, resulting in detecting pedestrians in
omnidirectional images with tightly fitted bounding boxes. We also develop
pseudo-fisheye distortion augmentation, which further enhances the performance.
Extensive analysis shows that our detector successfully fits bounding boxes to
pedestrians and demonstrates substantial performance improvement.
Related papers
- Zone Evaluation: Revealing Spatial Bias in Object Detection [69.59295428233844]
A fundamental limitation of object detectors is that they suffer from "spatial bias"
We present a new zone evaluation protocol, which measures the detection performance over zones.
For the first time, we provide numerical results, showing that the object detectors perform quite unevenly across the zones.
arXiv Detail & Related papers (2023-10-20T01:44:49Z) - Traditional methods in Edge, Corner and Boundary detection [0.0]
There are many real-world applications of edge, corner, and boundary detection methods.
In modern innovations like autonomous vehicles, edge detection and segmentation are the most crucial things.
Real-world images are used to validate detector performance and limitations.
arXiv Detail & Related papers (2022-08-12T22:26:05Z) - Cross-Camera Trajectories Help Person Retrieval in a Camera Network [124.65912458467643]
Existing methods often rely on purely visual matching or consider temporal constraints but ignore the spatial information of the camera network.
We propose a pedestrian retrieval framework based on cross-camera generation, which integrates both temporal and spatial information.
To verify the effectiveness of our method, we construct the first cross-camera pedestrian trajectory dataset.
arXiv Detail & Related papers (2022-04-27T13:10:48Z) - End-to-End Instance Edge Detection [29.650295133113183]
Edge detection has long been an important problem in the field of computer vision.
Previous works have explored category-agnostic or category-aware edge detection.
In this paper, we explore edge detection in the context of object instances.
arXiv Detail & Related papers (2022-04-06T15:32:21Z) - ARPD: Anchor-free Rotation-aware People Detection using Topview Fisheye
Camera [3.0868856870169625]
We propose ARPD, a single-stage anchor-free fully convolutional network to detect arbitrarily rotated people in fish-eye images.
Our method competes favorably with state-of-the-art algorithms while running significantly faster.
arXiv Detail & Related papers (2022-01-25T05:49:50Z) - Pedestrian Detection: Domain Generalization, CNNs, Transformers and
Beyond [82.37430109152383]
We show that, current pedestrian detectors poorly handle even small domain shifts in cross-dataset evaluation.
We attribute the limited generalization to two main factors, the method and the current sources of data.
We propose a progressive fine-tuning strategy which improves generalization.
arXiv Detail & Related papers (2022-01-10T06:00:26Z) - Cross-Camera Feature Prediction for Intra-Camera Supervised Person
Re-identification across Distant Scenes [70.30052164401178]
Person re-identification (Re-ID) aims to match person images across non-overlapping camera views.
ICS-DS Re-ID uses cross-camera unpaired data with intra-camera identity labels for training.
Cross-camera feature prediction method to mine cross-camera self supervision information.
Joint learning of global-level and local-level features forms a global-local cross-camera feature prediction scheme.
arXiv Detail & Related papers (2021-07-29T11:27:50Z) - SADet: Learning An Efficient and Accurate Pedestrian Detector [68.66857832440897]
This paper proposes a series of systematic optimization strategies for the detection pipeline of one-stage detector.
It forms a single shot anchor-based detector (SADet) for efficient and accurate pedestrian detection.
Though structurally simple, it presents state-of-the-art result and real-time speed of $20$ FPS for VGA-resolution images.
arXiv Detail & Related papers (2020-07-26T12:32:38Z) - Deep Soft Procrustes for Markerless Volumetric Sensor Alignment [81.13055566952221]
In this work, we improve markerless data-driven correspondence estimation to achieve more robust multi-sensor spatial alignment.
We incorporate geometric constraints in an end-to-end manner into a typical segmentation based model and bridge the intermediate dense classification task with the targeted pose estimation one.
Our model is experimentally shown to achieve similar results with marker-based methods and outperform the markerless ones, while also being robust to the pose variations of the calibration structure.
arXiv Detail & Related papers (2020-03-23T10:51:32Z) - Detecting Lane and Road Markings at A Distance with Perspective
Transformer Layers [5.033948921121557]
In existing approaches, the detection accuracy often degrades with the increasing distance.
This is due to the fact that distant lane and road markings occupy a small number of pixels in the image.
Inverse Perspective Mapping can be used to eliminate the perspective distortion, but the inherent can lead to artifacts.
arXiv Detail & Related papers (2020-03-19T03:22:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.